Explaining Mortgage Defaults Using Shap and Lasso

No Thumbnail Available

Date

2024

Authors

Ozturkkal, Belma
Wahlstrom, Ranik Raaen

Journal Title

Journal ISSN

Volume Title

Publisher

Springer

Open Access Color

OpenAIRE Downloads

OpenAIRE Views

Research Projects

Organizational Units

Journal Issue

Abstract

We utilize machine learning methods to model the credit risk of mortgages in a significant emerging market. For this purpose, we investigate a multitude of variables that explain the characteristics of the loans, the demographics of the borrowers, and macroeconomic factors. We employ SHapley Additive exPlanations (SHAP) values in conjunction with five different tree-based machine learning methods, as well as the least absolute shrinkage and selection operator (LASSO) in conjunction with logistic regressions. Our findings, which are robust across two sampling schemes, reveal that while demographic variables are significant and important, loan-specific and macroeconomic variables are the most crucial in explaining mortgage defaults. As existing literature on mortgage default has primarily focused on advanced markets, we aim to bridge this gap by concentrating on emerging market data. We also share our code, which we hope will encourage others to utilize the methods we have applied.

Description

Keywords

Mortgage Loan, Default Risk, Tree-Based Methods, Lasso, Shap, Emerging Market, C40, C52, C65, G21

Turkish CoHE Thesis Center URL

Fields of Science

Citation

0

WoS Q

Q2

Scopus Q

Q2

Source

Volume

Issue

Start Page

End Page