Explaining Mortgage Defaults Using Shap and Lasso
No Thumbnail Available
Date
2024
Authors
Ozturkkal, Belma
Wahlstrom, Ranik Raaen
Journal Title
Journal ISSN
Volume Title
Publisher
Springer
Open Access Color
OpenAIRE Downloads
OpenAIRE Views
Abstract
We utilize machine learning methods to model the credit risk of mortgages in a significant emerging market. For this purpose, we investigate a multitude of variables that explain the characteristics of the loans, the demographics of the borrowers, and macroeconomic factors. We employ SHapley Additive exPlanations (SHAP) values in conjunction with five different tree-based machine learning methods, as well as the least absolute shrinkage and selection operator (LASSO) in conjunction with logistic regressions. Our findings, which are robust across two sampling schemes, reveal that while demographic variables are significant and important, loan-specific and macroeconomic variables are the most crucial in explaining mortgage defaults. As existing literature on mortgage default has primarily focused on advanced markets, we aim to bridge this gap by concentrating on emerging market data. We also share our code, which we hope will encourage others to utilize the methods we have applied.
Description
Keywords
Mortgage Loan, Default Risk, Tree-Based Methods, Lasso, Shap, Emerging Market, C40, C52, C65, G21
Turkish CoHE Thesis Center URL
Fields of Science
Citation
0
WoS Q
Q2
Scopus Q
Q2