Consumer credit risk: Individual probability estimates using machine learning

被引:115
|
作者
Kruppa, Jochen [1 ]
Schwarz, Alexandra [2 ]
Arminger, Gerhard [2 ]
Ziegler, Andreas [1 ]
机构
[1] Univ Lubeck, Univ Klinikum Schleswig Holstein, Inst Med Biometrie & Stat, D-23562 Lubeck, Germany
[2] Univ Wuppertal, Schumpeter Sch Business & Econ, D-42097 Wuppertal, Germany
关键词
Probability estimation; Random forest; Credit scoring; Probability machines; Logistic regression; Machine learning; IMPROVED CONFIDENCE-INTERVALS; CLASSIFICATION ALGORITHMS; RANDOM FORESTS; CONVERGENCE; PERFORMANCE; CONSISTENCY; PREDICTION; REGRESSION;
D O I
10.1016/j.eswa.2013.03.019
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Consumer credit scoring is often considered a classification task where clients receive either a good or a bad credit status. Default probabilities provide more detailed information about the creditworthiness of consumers, and they are usually estimated by logistic regression. Here, we present a general framework for estimating individual consumer credit risks by use of machine learning methods. Since a probability is an expected value, all nonparametric regression approaches which are consistent for the mean are consistent for the probability estimation problem. Among others, random forests (RF), k-nearest neighbors (kNN), and bagged k-nearest neighbors (bNN) belong to this class of consistent nonparametric regression approaches. We apply the machine learning methods and an optimized logistic regression to a large dataset of complete payment histories of short-termed installment credits. We demonstrate probability estimation in Random Jungle, an RF package written in C++ with a generalized framework for fast tree growing, probability estimation, and classification. We also describe an algorithm for tuning the terminal node size for probability estimation. We demonstrate that regression RF outperforms the optimized logistic regression model, kNN, and bNN on the test data of the short-term installment credits. (c) 2013 Elsevier Ltd. All rights reserved.
引用
收藏
页码:5125 / 5131
页数:7
相关论文
共 50 条
  • [1] Credit Scoring to Classify Consumer Loan Using Machine Learning
    Natasha, Azaria
    Prastyo, Dedy Dwi
    Suhartono
    2ND INTERNATIONAL CONFERENCE ON SCIENCE, MATHEMATICS, ENVIRONMENT, AND EDUCATION, 2019, 2019, 2194
  • [2] Consumer credit-risk models via machine-learning algorithms
    Khandani, Amir E.
    Kim, Adlar J.
    Lo, Andrew W.
    JOURNAL OF BANKING & FINANCE, 2010, 34 (11) : 2767 - 2787
  • [3] Credit Risk Analysis Using Machine Learning Algorithms
    Kalayci, Sacide
    Kamasak, Mustafa
    Arslan, Secil
    2018 26TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2018,
  • [4] Sharp Probability Tail Estimates for Portfolio Credit Risk
    Collamore, Jeffrey F. F.
    de Silva, Hasitha
    Vidyashankar, Anand N. N.
    RISKS, 2022, 10 (12)
  • [5] Credit Risk Analysis Using Machine Learning Techniques
    Shiv, S. J.
    Murthy, Srinivasa
    Challuru, Krishnaprasad
    2018 FOURTEENTH INTERNATIONAL CONFERENCE ON INFORMATION PROCESSING (ICINPRO) - 2018, 2018, : 214 - 218
  • [6] Credit Risk Assessment Using Machine Learning Algorithms
    Attigeri, Girija V.
    Pai, M. M. Manohara
    Pai, Radhika M.
    ADVANCED SCIENCE LETTERS, 2017, 23 (04) : 3649 - 3653
  • [7] Predicting of Credit Risk Using Machine Learning Algorithms
    Antony, Tisa Maria
    Kumar, B. Sathish
    ARTIFICIAL INTELLIGENCE: THEORY AND APPLICATIONS, VOL 1, AITA 2023, 2024, 843 : 99 - 114
  • [8] Credit Risk Prediction Using Machine Learning and Deep Learning: A Study on Credit Card Customers
    Chang, Victor
    Sivakulasingam, Sharuga
    Wang, Hai
    Wong, Siu Tung
    Ganatra, Meghana Ashok
    Luo, Jiabin
    RISKS, 2024, 12 (11)
  • [9] Credit Risk Analysis Using Machine and Deep Learning Models
    Addo, Peter Martey
    Guegan, Dominique
    Hassani, Bertrand
    RISKS, 2018, 6 (02):
  • [10] Credit Risk Analysis Using Machine-Learning Algorithms
    Alagoz, Gokhan
    Canakoglu, Ethem
    29TH IEEE CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS (SIU 2021), 2021,