Consumer credit risk: Individual probability estimates using machine learning

被引:115
|
作者
Kruppa, Jochen [1 ]
Schwarz, Alexandra [2 ]
Arminger, Gerhard [2 ]
Ziegler, Andreas [1 ]
机构
[1] Univ Lubeck, Univ Klinikum Schleswig Holstein, Inst Med Biometrie & Stat, D-23562 Lubeck, Germany
[2] Univ Wuppertal, Schumpeter Sch Business & Econ, D-42097 Wuppertal, Germany
关键词
Probability estimation; Random forest; Credit scoring; Probability machines; Logistic regression; Machine learning; IMPROVED CONFIDENCE-INTERVALS; CLASSIFICATION ALGORITHMS; RANDOM FORESTS; CONVERGENCE; PERFORMANCE; CONSISTENCY; PREDICTION; REGRESSION;
D O I
10.1016/j.eswa.2013.03.019
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Consumer credit scoring is often considered a classification task where clients receive either a good or a bad credit status. Default probabilities provide more detailed information about the creditworthiness of consumers, and they are usually estimated by logistic regression. Here, we present a general framework for estimating individual consumer credit risks by use of machine learning methods. Since a probability is an expected value, all nonparametric regression approaches which are consistent for the mean are consistent for the probability estimation problem. Among others, random forests (RF), k-nearest neighbors (kNN), and bagged k-nearest neighbors (bNN) belong to this class of consistent nonparametric regression approaches. We apply the machine learning methods and an optimized logistic regression to a large dataset of complete payment histories of short-termed installment credits. We demonstrate probability estimation in Random Jungle, an RF package written in C++ with a generalized framework for fast tree growing, probability estimation, and classification. We also describe an algorithm for tuning the terminal node size for probability estimation. We demonstrate that regression RF outperforms the optimized logistic regression model, kNN, and bNN on the test data of the short-term installment credits. (c) 2013 Elsevier Ltd. All rights reserved.
引用
收藏
页码:5125 / 5131
页数:7
相关论文
共 50 条
  • [21] Machine Learning Analysis of Mortgage Credit Risk
    Pillai, Sivakumar G.
    Woodbury, Jennifer
    Dikshit, Nikhil
    Leider, Avery
    Tappert, Charles C.
    PROCEEDINGS OF THE FUTURE TECHNOLOGIES CONFERENCE (FTC) 2019, VOL 1, 2020, 1069 : 107 - 123
  • [22] Two-stage consumer credit risk modelling using heterogeneous ensemble learning
    Papouskova, Monika
    Hajek, Petr
    DECISION SUPPORT SYSTEMS, 2019, 118 : 33 - 45
  • [23] ON THE AGGREGATION OF INDIVIDUAL PROBABILITY ESTIMATES
    BORDLEY, RF
    WOLFF, RW
    MANAGEMENT SCIENCE, 1981, 27 (08) : 959 - 964
  • [24] Credit Risk Assessment and Fraud Detection in Financial Transactions Using Machine Learning
    Malik, Pankaj
    Chourasia, Ankita
    Pandit, Rakesh
    Bawane, Sheetal
    Surana, Jayesh
    JOURNAL OF ELECTRICAL SYSTEMS, 2024, 20 (03) : 2061 - 2069
  • [25] Credit Default Risk Analysis Using Machine Learning Algorithms with Hyperparameter Optimization
    Inga, Juan
    Sacoto-Cabrera, Erwin
    INTELLIGENT TECHNOLOGIES: DESIGN AND APPLICATIONS FOR SOCIETY, CITIS 2022, 2023, 607 : 81 - 95
  • [26] Assessing credit risk of commercial customers using hybrid machine learning algorithms
    Machado, Marcos Roberto
    Karray, Salma
    EXPERT SYSTEMS WITH APPLICATIONS, 2022, 200
  • [27] Credit risk assessment using statistical and machine learning: basic methodology and risk modeling applications
    Galindo, J.
    Tamayo, P.
    Computational Economics, 2000, 15 (1-2) : 107 - 143
  • [28] Demand Forecasting of Individual Probability Density Functions with Machine Learning
    Wick F.
    Kerzel U.
    Hahn M.
    Wolf M.
    Singhal T.
    Stemmer D.
    Ernst J.
    Feindt M.
    Operations Research Forum, 2 (3)
  • [29] Credit Risk Prediction Based on Machine Learning Methods
    Li, Yu
    14TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND EDUCATION (ICCSE 2019), 2019, : 1011 - 1013
  • [30] ANALYSING MACHINE LEARNING METHODS AND CREDIT RISK ASSESSMENT
    Coelho, Felipe Fernandes
    de Lima Amorim, Daniel Penido
    de Camargos, Marcos Antonio
    REVISTA GESTAO & TECNOLOGIA-JOURNAL OF MANAGEMENT AND TECHNOLOGY, 2021, 21 (01): : 89 - 116