Striking a Balance: Evaluating Credit Risk with Traditional and Machine Learning Models
DOI:
https://doi.org/10.61506/01.00425Keywords:
Credit Risk, Credit Scoring, Machine Learning Model, Traditional Model, DefaultAbstract
This research assesses machine learning models' validity, clarity, and equity, compared to classical models and especially logistic regression in credit risk evaluation. In the traditional model of data management, efficiency and the accuracy of information are challenges; an issue of machine learning models is model selection and multicollinearity. The study intends to help financial institutions establish the best strategy for their needs. Furthermore, it delves into the effect of heterogeneous data sources on the credit risk model using machine learning. The research analyses the implications of using machine learning in assessing credit risk. Interestingly, focusing on peer-to-peer lending platforms, the research aims to deal with the need for more attention to combining machine learning and traditional models in the literature. The deductive method is the application of inferential analyses, the Traditional model is logistic regression, and the Machine Learning model is a neural network (CNN model) based on secondary data from the Kaggle peer-to-peer lending dataset. With likely findings expected to comprise prediction of the probability of default and better availability of loans, risk analysis leads to formulated lending decisions managing a financial portfolio.
References
Alonso Robisco, A. and Carbó Martínez, J.M. (2022). Measuring the model risk-adjusted performance of machine learning algorithms in credit default prediction. Financial Innovation, 8(1). DOI: https://doi.org/10.1186/s40854-022-00366-1
Alonso, A. and Carbó, J.M. (2020). Machine Learning in Credit Risk: Measuring the Dilemma Between Prediction and Supervisory Cost. DOI: https://doi.org/10.2139/ssrn.3724374
Amaro, M.M. (2020). Credit scoring: comparison of non‐parametric techniques against logistic regression.
Ariza-Garzon, M.J., Arroyo, J., Caparrini, A. and Segovia-Vargas, M.-J. (2020). Explainability of a Machine Learning Granting Scoring Model in Peer-to-Peer Lending. IEEE Access, 8, pp.64873–64890. DOI: https://doi.org/10.1109/ACCESS.2020.2984412
Bojer, C.S. and Meldgaard, J.P. (2020). Kaggle Forecasting competitions: an Overlooked Learning Opportunity. International Journal of Forecasting, 37(2). DOI: https://doi.org/10.1016/j.ijforecast.2020.07.007
Breeden, J. (2021). A Survey of Machine Learning in Credit Risk. DOI: https://doi.org/10.21314/JCR.2021.008
Doumpos, M., Christos Lemonakis, Dimitrios Niklis and Constantin Zopounidis (2019). Analytical Techniques in the Assessment of Credit Risk. EURO advanced tutorials on operational research. DOI: https://doi.org/10.1007/978-3-319-99411-6
Dumitrescu, E., Hué, S., Hurlin, C. and Tokpavi, S. (2021). Machine Learning for Credit Scoring: Improving Logistic Regression with Non-Linear Decision-Tree Effects. European Journal of Operational Research, 297(3). DOI: https://doi.org/10.1016/j.ejor.2021.06.053
Fullerton, A.S. and Anderson, K.F. (2021). Ordered regression models: A tutorial. Prevention Science, pp.1-13.
Heydarian, M., Doyle, T.E. and Samavi, R. (2022). MLCM: Multi-Label Confusion Matrix. IEEE Access, 10, pp.19083–19095. DOI: https://doi.org/10.1109/ACCESS.2022.3151048
Hussin Adam Khatir, A.A. and Bee, M. (2022). Machine learning models and data-balancing techniques for credit scoring: What is the best combination?. Risks, 10(9), p.169. DOI: https://doi.org/10.3390/risks10090169
Kigo, S.N., Omondi, E.O. and Omolo, B.O. (2023). Assessing predictive performance of supervised machine learning algorithms for a diamond pricing model. Scientific Reports, 13(1), p.17315. DOI: https://doi.org/10.1038/s41598-023-44326-w
Koskimäki, M. (2021). Default prediction in peer-to-peer lending and country comparison. lutpub.lut.fi.
Mhlanga, D. (2021). Financial inclusion in emerging economies: Applying machine learning and artificial intelligence in credit risk assessment. International journal of economic studies, 9(3), p.39. DOI: https://doi.org/10.3390/ijfs9030039
Naik, K.S. (2021). Predicting Credit Risk for Unsecured Lending: A Machine Learning Approach. arXiv preprint arXiv:2110.02206.
Naili, M. and Lahrichi, Y. (2020). The determinants of banks’ credit risk: Review of the literature and future research agenda. International Journal of Finance & Economics, 27(1). Alonso Robisco, A. and Carbó Martínez, J.M. (2022). Measuring the model risk-adjusted performance of machine learning algorithms in credit default prediction. Financial Innovation, 8(1). DOI: https://doi.org/10.1186/s40854-022-00366-1
Noriega, J.P., Rivera, L.A. and Herrera, J.A. (2023). Machine Learning for Credit Risk Prediction: A Systematic Literature Review. Data, 8(11), p.169. DOI: https://doi.org/10.3390/data8110169
Quaranta, L., Calefato, F. and Lanubile, F. (2021), May. Kgtorrent: A dataset of python jupyter notebooks from Kaggle. In 2021 IEEE/ACM 18th International Conference on Mining Software Repositories (MSR) (pp. 550-554). IEEE. DOI: https://doi.org/10.1109/MSR52588.2021.00072
Razali, N.A.M., Shamsaimon, N., Ishak, K.K., Ramli, S., Amran, M.F.M. and Sukardi, S. (2021). Gap, techniques, and evaluation: traffic flow prediction using machine learning and deep learning. Journal of Big Data, 8(1), pp.1-25. DOI: https://doi.org/10.1186/s40537-021-00542-7
Wang, Y., Zhang, Y., Lu, Y., and Yu, X. (2020). A Comparative Assessment of Credit Risk Model Based on Machine Learning——a case study of bank loan data. Procedia Computer Science, 174, pp.141-149. DOI: https://doi.org/10.1016/j.procs.2020.06.069