Consumer credit risk: Individual probability estimates using machine learning

被引:115
|
作者
Kruppa, Jochen [1 ]
Schwarz, Alexandra [2 ]
Arminger, Gerhard [2 ]
Ziegler, Andreas [1 ]
机构
[1] Univ Lubeck, Univ Klinikum Schleswig Holstein, Inst Med Biometrie & Stat, D-23562 Lubeck, Germany
[2] Univ Wuppertal, Schumpeter Sch Business & Econ, D-42097 Wuppertal, Germany
关键词
Probability estimation; Random forest; Credit scoring; Probability machines; Logistic regression; Machine learning; IMPROVED CONFIDENCE-INTERVALS; CLASSIFICATION ALGORITHMS; RANDOM FORESTS; CONVERGENCE; PERFORMANCE; CONSISTENCY; PREDICTION; REGRESSION;
D O I
10.1016/j.eswa.2013.03.019
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Consumer credit scoring is often considered a classification task where clients receive either a good or a bad credit status. Default probabilities provide more detailed information about the creditworthiness of consumers, and they are usually estimated by logistic regression. Here, we present a general framework for estimating individual consumer credit risks by use of machine learning methods. Since a probability is an expected value, all nonparametric regression approaches which are consistent for the mean are consistent for the probability estimation problem. Among others, random forests (RF), k-nearest neighbors (kNN), and bagged k-nearest neighbors (bNN) belong to this class of consistent nonparametric regression approaches. We apply the machine learning methods and an optimized logistic regression to a large dataset of complete payment histories of short-termed installment credits. We demonstrate probability estimation in Random Jungle, an RF package written in C++ with a generalized framework for fast tree growing, probability estimation, and classification. We also describe an algorithm for tuning the terminal node size for probability estimation. We demonstrate that regression RF outperforms the optimized logistic regression model, kNN, and bNN on the test data of the short-term installment credits. (c) 2013 Elsevier Ltd. All rights reserved.
引用
收藏
页码:5125 / 5131
页数:7
相关论文
共 50 条
  • [41] Credit Risk Scoring Analysis Based on Machine Learning Models
    Qiu, Ziyue
    Li, Yuming
    Ni, Pin
    Li, Gangmin
    2019 6TH INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE AND CONTROL ENGINEERING (ICISCE 2019), 2019, : 220 - 224
  • [42] Machine Learning for Credit Risk Prediction: A Systematic Literature Review
    Noriega, Jomark Pablo
    Rivera, Luis Antonio
    Herrera, Jose Alfredo
    DATA, 2023, 8 (11)
  • [43] Responsible Credit Risk Assessment with Machine Learning and Knowledge Acquisition
    Charles Guan
    Hendra Suryanto
    Ashesh Mahidadia
    Michael Bain
    Paul Compton
    Human-Centric Intelligent Systems, 2023, 3 (3): : 232 - 243
  • [44] Explainable Ensemble Machine Learning Method for Credit Risk Classification
    Ben Ghozzi, Sirine
    Ben HajKacem, Mohamed Aymen
    Essoussi, Nadia
    2024 INTERNATIONAL CONFERENCE ON INNOVATIONS IN INTELLIGENT SYSTEMS AND APPLICATIONS, INISTA, 2024,
  • [45] Use of machine learning techniques in bank credit risk analysis
    Fenerich, Amanda
    Arns Steiner, Maria Teresinha
    Steiner Neto, Pedro Jose
    Tochetto, Edevar
    Tsutsumi, Diego
    Assef, Fernanda Medeiros
    dos Santos, Bruno Samways
    REVISTA INTERNACIONAL DE METODOS NUMERICOS PARA CALCULO Y DISENO EN INGENIERIA, 2020, 36 (03):
  • [46] The generalized Vasicek credit risk model: A Machine Learning approach
    Garcia-Cespedes, Ruben
    Moreno, Manuel
    FINANCE RESEARCH LETTERS, 2022, 47
  • [47] Machine learning-driven credit risk: a systemic review
    Shi, Si
    Tse, Rita
    Luo, Wuman
    D'Addona, Stefano
    Pau, Giovanni
    NEURAL COMPUTING & APPLICATIONS, 2022, 34 (17): : 14327 - 14339
  • [48] Machine learning-driven credit risk: a systemic review
    Si Shi
    Rita Tse
    Wuman Luo
    Stefano D’Addona
    Giovanni Pau
    Neural Computing and Applications, 2022, 34 : 14327 - 14339
  • [49] Credit Modelling using Hybrid Machine Learning Technique
    Dahiya, Shashi
    Handa, S. S.
    Singh, N. P.
    2015 INTERNATIONAL CONFERENCE ON SOFT COMPUTING TECHNIQUES AND IMPLEMENTATIONS (ICSCTI), 2015,
  • [50] Detecting Credit Card Fraud using Machine Learning
    Almuteer A.H.
    Aloufi A.A.
    Alrashidi W.O.
    Alshobaili J.F.
    Ibrahim D.M.
    International Journal of Interactive Mobile Technologies, 2021, 15 (24) : 108 - 122