Algorithm comparison for data mining classification: assessing bank customer credit scoring default risk
Rating consumer credit risk involves assessing credit application risks. Thus, every business must appropriately identify debtors and non-debtors. This study uses machine learning approaches to simulate consumer credit risk and compares the results to the logistic model, determining if machine learn...
| Main Authors: | , |
|---|---|
| Format: | Article |
| Language: | English |
| Published: |
Penerbit Universiti Kebangsaan Malaysia
2024
|
| Online Access: | http://journalarticle.ukm.my/25733/ http://journalarticle.ukm.my/25733/1/13.pdf |
| Summary: | Rating consumer credit risk involves assessing credit application risks. Thus, every business must appropriately identify debtors and non-debtors. This study uses machine learning approaches to simulate consumer credit risk and compares the results to the logistic model, determining if machine learning improves client default ratings. The study examines how customer attributes affect virtual experiences. Despite advances in machine learning models for credit assessment, unbalanced datasets and some algorithms’ failure to explain forecasts remain major issues. This study used 2005 Taiwanese credit card consumers’ education, age, marital status, payment history, and sex. The default experience is modeled using Logistic Regression, K neighbors, Support Vector Machine, Decision Tree, Random Forest, Ada Boost Classifier, and Gradient Boosting. The models’ Accuracy, precision, recall, receiver operating characteristic (ROC) curve, and precision-recall curve were evaluated. Random Forest’s 97% ROC metric rating outperformed all other accuracy metrics. The logistic model underperformed, while machine learning improved the default categorization. |
|---|