Alternative Credit Scoring and Classification Employing Machine Learning Techniques on a Big Data Platform

gdc.relation.journal UBMK 2019 - Proceedings, 4th International Conference on Computer Science and Engineering en_US
dc.contributor.author Hindistan, Yavuz Selim
dc.contributor.author Kiyakoğlu, Burhan Yasin
dc.contributor.author Rezaeinazhad, Arash Mohammadian
dc.contributor.author Korkmaz, Halil Ergun
dc.contributor.author Dağ, Hasan
dc.date.accessioned 2021-02-19T18:52:27Z
dc.date.available 2021-02-19T18:52:27Z
dc.date.issued 2019
dc.description.abstract With the bloom of financial technology and innovations aiming to deliver a high standard of financial services, banks and credit service companies, along with other financial institutions, use the most recent technologies available in a variety of ways from addressing the information asymmetry, matching the needs of borrowers and lenders, to facilitating transactions using payment services. In the long list of FinTechs, one of the most attractive platforms is the Peer-to-Peer (P2P) lending which aims to bring the investors and borrowers hand in hand, leaving out the traditional intermediaries like banks. The main purpose of a financial institution as an intermediary is of controlling risk and P2P lending platforms innovate and use new ways of risk assessment. In the era of Big Data, using a diverse source of information from spending behaviors of customers, social media behavior, and geographic information along with traditional methods for credit scoring prove to have new insights for the proper and more accurate credit scoring. In this study, we investigate the machine learning techniques on big data platforms, analyzing the credit scoring methods. It has been concluded that on a HDFS (Hadoop Distributed File System) environment, Logistic Regression performs better than Decision Tree and Random Forest for credit scoring and classification considering performance metrics such as accuracy, precision and recall, and the overall run time of algorithms. Logistic Regression also performs better in time in a single node HDFS configuration compared to a non-HDFS configuration. en_US
dc.identifier.citationcount 3
dc.identifier.doi 10.1109/UBMK.2019.8907113 en_US
dc.identifier.isbn 978-172813964-7
dc.identifier.scopus 2-s2.0-85076215629 en_US
dc.identifier.uri https://hdl.handle.net/20.500.12469/3960
dc.language.iso en en_US
dc.publisher Institute of Electrical and Electronics Engineers Inc. en_US
dc.relation.ispartof 2019 4th International Conference on Computer Science and Engineering (UBMK)
dc.rights info:eu-repo/semantics/closedAccess en_US
dc.subject Big data en_US
dc.subject Credit Risk Scoring en_US
dc.subject Crowd-funding en_US
dc.subject Hadoop en_US
dc.subject Machine Learning en_US
dc.subject P2P en_US
dc.subject Peer-to-Peer lending en_US
dc.title Alternative Credit Scoring and Classification Employing Machine Learning Techniques on a Big Data Platform en_US
dc.type Book Part en_US
dspace.entity.type Publication
gdc.author.institutional Dağ, Hasan
gdc.author.institutional Kiyakoğlu, Burhan Yasin en_US
gdc.author.institutional Rezaeinazhad, Arash Mohammadian en_US
gdc.author.institutional Korkmaz, Halil Ergun en_US
gdc.author.institutional Daǧ, Hasan en_US
gdc.bip.impulseclass C5
gdc.bip.influenceclass C5
gdc.bip.popularityclass C4
gdc.coar.access metadata only access
gdc.coar.type text::book::book part
gdc.description.endpage 734 en_US
gdc.description.publicationcategory Kitap Bölümü - Uluslararası en_US
gdc.description.scopusquality N/A
gdc.description.startpage 731 en_US
gdc.description.wosquality N/A
gdc.identifier.openalex W2990454259
gdc.identifier.wos WOS:000609879900138 en_US
gdc.oaire.diamondjournal false
gdc.oaire.impulse 2.0
gdc.oaire.influence 2.7595506E-9
gdc.oaire.isgreen false
gdc.oaire.keywords Machine Learning
gdc.oaire.keywords Big data
gdc.oaire.keywords P2P
gdc.oaire.keywords Hadoop
gdc.oaire.keywords Crowd-funding
gdc.oaire.keywords Credit Risk Scoring
gdc.oaire.keywords Peer-to-Peer lending
gdc.oaire.popularity 4.2934354E-9
gdc.oaire.publicfunded false
gdc.oaire.sciencefields 0202 electrical engineering, electronic engineering, information engineering
gdc.oaire.sciencefields 02 engineering and technology
gdc.openalex.fwci 2.173
gdc.openalex.normalizedpercentile 0.82
gdc.opencitations.count 4
gdc.plumx.crossrefcites 3
gdc.plumx.mendeley 61
gdc.plumx.scopuscites 7
gdc.scopus.citedcount 7
gdc.wos.citedcount 5
relation.isAuthorOfPublication e02bc683-b72e-4da4-a5db-ddebeb21e8e7
relation.isAuthorOfPublication.latestForDiscovery e02bc683-b72e-4da4-a5db-ddebeb21e8e7
relation.isOrgUnitOfPublication ff62e329-217b-4857-88f0-1dae00646b8c
relation.isOrgUnitOfPublication acb86067-a99a-4664-b6e9-16ad10183800
relation.isOrgUnitOfPublication b20623fc-1264-4244-9847-a4729ca7508c
relation.isOrgUnitOfPublication.latestForDiscovery ff62e329-217b-4857-88f0-1dae00646b8c

Files