Alternative Credit Scoring and Classification Employing Machine Learning Techniques on a Big Data Platform

dc.contributor.authorDağ, Hasan
dc.contributor.authorKiyakoğlu, Burhan Yasin
dc.contributor.authorRezaeinazhad, Arash Mohammadian
dc.contributor.authorKorkmaz, Halil Ergun
dc.contributor.authorDağ, Hasan
dc.date.accessioned2021-02-19T18:52:27Z
dc.date.available2021-02-19T18:52:27Z
dc.date.issued2019
dc.description.abstractWith the bloom of financial technology and innovations aiming to deliver a high standard of financial services, banks and credit service companies, along with other financial institutions, use the most recent technologies available in a variety of ways from addressing the information asymmetry, matching the needs of borrowers and lenders, to facilitating transactions using payment services. In the long list of FinTechs, one of the most attractive platforms is the Peer-to-Peer (P2P) lending which aims to bring the investors and borrowers hand in hand, leaving out the traditional intermediaries like banks. The main purpose of a financial institution as an intermediary is of controlling risk and P2P lending platforms innovate and use new ways of risk assessment. In the era of Big Data, using a diverse source of information from spending behaviors of customers, social media behavior, and geographic information along with traditional methods for credit scoring prove to have new insights for the proper and more accurate credit scoring. In this study, we investigate the machine learning techniques on big data platforms, analyzing the credit scoring methods. It has been concluded that on a HDFS (Hadoop Distributed File System) environment, Logistic Regression performs better than Decision Tree and Random Forest for credit scoring and classification considering performance metrics such as accuracy, precision and recall, and the overall run time of algorithms. Logistic Regression also performs better in time in a single node HDFS configuration compared to a non-HDFS configuration.en_US
dc.identifier.citation3
dc.identifier.doi10.1109/UBMK.2019.8907113en_US
dc.identifier.endpage734en_US
dc.identifier.isbn978-172813964-7
dc.identifier.scopus2-s2.0-85076215629en_US
dc.identifier.scopusqualityN/A
dc.identifier.startpage731en_US
dc.identifier.urihttps://hdl.handle.net/20.500.12469/3960
dc.identifier.wosWOS:000609879900138en_US
dc.identifier.wosqualityN/A
dc.institutionauthorHindistan, Yavuz Selimen_US
dc.institutionauthorKiyakoğlu, Burhan Yasinen_US
dc.institutionauthorRezaeinazhad, Arash Mohammadianen_US
dc.institutionauthorKorkmaz, Halil Ergunen_US
dc.institutionauthorDaǧ, Hasanen_US
dc.language.isoenen_US
dc.publisherInstitute of Electrical and Electronics Engineers Inc.en_US
dc.relation.journalUBMK 2019 - Proceedings, 4th International Conference on Computer Science and Engineeringen_US
dc.relation.publicationcategoryKitap Bölümü - Uluslararasıen_US
dc.rightsinfo:eu-repo/semantics/closedAccessen_US
dc.subjectBig dataen_US
dc.subjectCredit Risk Scoringen_US
dc.subjectCrowd-fundingen_US
dc.subjectHadoopen_US
dc.subjectMachine Learningen_US
dc.subjectP2Pen_US
dc.subjectPeer-to-Peer lendingen_US
dc.titleAlternative Credit Scoring and Classification Employing Machine Learning Techniques on a Big Data Platformen_US
dc.typeBook Parten_US
dspace.entity.typePublication
relation.isAuthorOfPublicatione02bc683-b72e-4da4-a5db-ddebeb21e8e7
relation.isAuthorOfPublication.latestForDiscoverye02bc683-b72e-4da4-a5db-ddebeb21e8e7

Files