Tree-based heterogeneous cascade ensemble model for credit scoring

被引:12
|
作者
Liu, Wanan [1 ]
Fan, Hong [1 ]
Xia, Meng [2 ]
机构
[1] Donghua Univ, Glorious Sun Sch Business & Management, Shanghai 200051, Peoples R China
[2] Donghua Univ, Coll Informat Sci & Technol, Shanghai, Peoples R China
基金
中国国家自然科学基金; 上海市自然科学基金;
关键词
Credit scoring; Ensemble algorithm; Heterogeneous deep forest; Weighted voting mechanism; Interpretability; ART CLASSIFICATION ALGORITHMS; BANKRUPTCY PREDICTION; FEATURE-SELECTION; IMPACT; PERFORMANCE; MACHINES;
D O I
10.1016/j.ijforecast.2022.07.007
中图分类号
F [经济];
学科分类号
02 ;
摘要
Credit scoring is an important tool to guard against commercial risks for banks and lending companies and provides good conditions for the construction of individual personal credit. Ensemble algorithms have shown appealing progress for the improvement of credit scoring. In this study, to meet the challenge of large-scale credit scoring, we propose a heterogeneous deep forest model (Heter-DF), which is established based on considerations ranging from base learner selection, encouragement of the diversity of base learners, and ensemble strategies, for credit scoring. Heter-DF is designed as a scalable cascading framework that can increase its complexity with the scale of the credit dataset. Moreover, each level of Heter-DF is built by multiple heterogeneous tree-based ensembled base learners, avoiding the homogeneous prediction of the ensemble framework. In addition, a weighted voting mechanism is introduced to highlight important information and suppress irrelevant features, making Heter-DF a robust model for credit scoring. Experimental results on four credit scoring datasets and six evaluation metrics show that the cascading framework a good choice for the ensemble of tree-based base learners. A comparison among homogeneous ensembles and heterogeneous ensembles further demonstrates the effectiveness of Heter-DF. Experiments on different training sets indicate that Heter-DF is a scalable framework which not only deals with large-scale credit scoring but also satisfies the condition where small-scale credit scoring is desirable. Finally, based on the good interpretability of a tree-based structure, the global interpretation of Heter-DF is preliminarily explored. (c) 2022 International Institute of Forecasters. Published by Elsevier B.V. All rights reserved.
引用
收藏
页码:1593 / 1614
页数:22
相关论文
共 50 条
  • [21] Ensemble classification based on supervised clustering for credit scoring
    Xiao, Hongshan
    Xiao, Zhi
    Wang, Yu
    APPLIED SOFT COMPUTING, 2016, 43 : 73 - 86
  • [22] Tree-based ensemble methods and their applications in analytical chemistry
    Cao, Dong-Sheng
    Xu, Qing-Song
    Zhang, Liang-Xiao
    Huang, Jian-Hua
    Liang, Yi-Zeng
    TRAC-TRENDS IN ANALYTICAL CHEMISTRY, 2012, 40 : 158 - 167
  • [23] Faithfulness of Local Explanations for Tree-Based Ensemble Models
    Rahnama, Amir Hossein Akhavan
    Geurts, Pierre
    Bostrom, Henrik
    DISCOVERY SCIENCE, DS 2024, PT II, 2025, 15244 : 19 - 33
  • [24] A new hybrid ensemble credit scoring model based on classifiers consensus system approach
    Ala'raj, Maher
    Abbod, Maysam F.
    EXPERT SYSTEMS WITH APPLICATIONS, 2016, 64 : 36 - 55
  • [25] An Ensemble Classifier Model to Predict Credit Scoring - Comparative Analysis
    Parvin, A. Safiya
    Saleena, B.
    2020 6TH IEEE INTERNATIONAL SYMPOSIUM ON SMART ELECTRONIC SYSTEMS (ISES 2020) (FORMERLY INIS), 2020, : 27 - 30
  • [26] Tree-based ensemble model prediction for hydrological drought in a tropical river basin of India
    Rose, M. A. Jincy
    Chithra, N. R.
    INTERNATIONAL JOURNAL OF ENVIRONMENTAL SCIENCE AND TECHNOLOGY, 2023, 20 (05) : 4973 - 4990
  • [27] Tree-based ensemble model prediction for hydrological drought in a tropical river basin of India
    M. A. Jincy Rose
    N. R. Chithra
    International Journal of Environmental Science and Technology, 2023, 20 : 4973 - 4990
  • [28] A Novel GSCI-Based Ensemble Approach for Credit Scoring
    Chen, Xiaohong
    Li, Siwei
    Xu, Xuanhua
    Meng, Fanyong
    Cao, Wenzhi
    IEEE ACCESS, 2020, 8 : 222449 - 222465
  • [29] Bayesian Ensemble Assessment for Credit Scoring
    Chen, Haojie
    Jiang, Minghui
    Wang, Xue
    2017 4TH INTERNATIONAL CONFERENCE ON INDUSTRIAL ECONOMICS SYSTEM AND INDUSTRIAL SECURITY ENGINEERING (IEIS), 2017,
  • [30] A DYNAMIC CREDIT SCORING MODEL BASED ON SURVIVAL GRADIENT BOOSTING DECISION TREE APPROACH
    Xia, Yufei
    He, Lingyun
    Li, Yinguo
    Fu, Yating
    Xu, Yixin
    TECHNOLOGICAL AND ECONOMIC DEVELOPMENT OF ECONOMY, 2021, 27 (01) : 96 - 119