Explaining Mortgage Defaults Using Shap and Lasso
No Thumbnail Available
Date
2024
Journal Title
Journal ISSN
Volume Title
Publisher
Springer
Open Access Color
HYBRID
Green Open Access
Yes
OpenAIRE Downloads
OpenAIRE Views
Publicly Funded
No
Abstract
We utilize machine learning methods to model the credit risk of mortgages in a significant emerging market. For this purpose, we investigate a multitude of variables that explain the characteristics of the loans, the demographics of the borrowers, and macroeconomic factors. We employ SHapley Additive exPlanations (SHAP) values in conjunction with five different tree-based machine learning methods, as well as the least absolute shrinkage and selection operator (LASSO) in conjunction with logistic regressions. Our findings, which are robust across two sampling schemes, reveal that while demographic variables are significant and important, loan-specific and macroeconomic variables are the most crucial in explaining mortgage defaults. As existing literature on mortgage default has primarily focused on advanced markets, we aim to bridge this gap by concentrating on emerging market data. We also share our code, which we hope will encourage others to utilize the methods we have applied.
Description
Keywords
Mortgage Loan, Default Risk, Tree-Based Methods, Lasso, Shap, Emerging Market, C40, C52, C65, G21
Turkish CoHE Thesis Center URL
Fields of Science
0502 economics and business, 05 social sciences
Citation
WoS Q
Q2
Scopus Q
Q2

OpenCitations Citation Count
N/A
Source
Computational Economics
Volume
66
Issue
Start Page
3291
End Page
3325
PlumX Metrics
Citations
Scopus : 2
Captures
Mendeley Readers : 17
Google Scholar™

OpenAlex FWCI
0.99906764
Sustainable Development Goals
3
GOOD HEALTH AND WELL-BEING

7
AFFORDABLE AND CLEAN ENERGY

9
INDUSTRY, INNOVATION AND INFRASTRUCTURE

11
SUSTAINABLE CITIES AND COMMUNITIES

15
LIFE ON LAND

17
PARTNERSHIPS FOR THE GOALS


