Tree-based heterogeneous cascade ensemble model for credit scoring

被引:12
|
作者
Liu, Wanan [1 ]
Fan, Hong [1 ]
Xia, Meng [2 ]
机构
[1] Donghua Univ, Glorious Sun Sch Business & Management, Shanghai 200051, Peoples R China
[2] Donghua Univ, Coll Informat Sci & Technol, Shanghai, Peoples R China
基金
中国国家自然科学基金; 上海市自然科学基金;
关键词
Credit scoring; Ensemble algorithm; Heterogeneous deep forest; Weighted voting mechanism; Interpretability; ART CLASSIFICATION ALGORITHMS; BANKRUPTCY PREDICTION; FEATURE-SELECTION; IMPACT; PERFORMANCE; MACHINES;
D O I
10.1016/j.ijforecast.2022.07.007
中图分类号
F [经济];
学科分类号
02 ;
摘要
Credit scoring is an important tool to guard against commercial risks for banks and lending companies and provides good conditions for the construction of individual personal credit. Ensemble algorithms have shown appealing progress for the improvement of credit scoring. In this study, to meet the challenge of large-scale credit scoring, we propose a heterogeneous deep forest model (Heter-DF), which is established based on considerations ranging from base learner selection, encouragement of the diversity of base learners, and ensemble strategies, for credit scoring. Heter-DF is designed as a scalable cascading framework that can increase its complexity with the scale of the credit dataset. Moreover, each level of Heter-DF is built by multiple heterogeneous tree-based ensembled base learners, avoiding the homogeneous prediction of the ensemble framework. In addition, a weighted voting mechanism is introduced to highlight important information and suppress irrelevant features, making Heter-DF a robust model for credit scoring. Experimental results on four credit scoring datasets and six evaluation metrics show that the cascading framework a good choice for the ensemble of tree-based base learners. A comparison among homogeneous ensembles and heterogeneous ensembles further demonstrates the effectiveness of Heter-DF. Experiments on different training sets indicate that Heter-DF is a scalable framework which not only deals with large-scale credit scoring but also satisfies the condition where small-scale credit scoring is desirable. Finally, based on the good interpretability of a tree-based structure, the global interpretation of Heter-DF is preliminarily explored. (c) 2022 International Institute of Forecasters. Published by Elsevier B.V. All rights reserved.
引用
收藏
页码:1593 / 1614
页数:22
相关论文
共 50 条
  • [31] INTELLIGENT TREE-BASED ENSEMBLE APPROACHES FOR PHISHING WEBSITE DETECTION
    Alsariera, Yazan A.
    Balogun, Abdullateef O.
    Adeyemo, Victor E.
    Tarawneh, Omar H.
    Mojeed, Hammed A.
    JOURNAL OF ENGINEERING SCIENCE AND TECHNOLOGY, 2022, 17 (01): : 563 - 582
  • [32] A tree-based varying coefficient model
    Zakrisson, Henning
    Lindholm, Mathias
    COMPUTATIONAL STATISTICS, 2025,
  • [33] Tree-Based Ensemble Learning Techniques in the Analysis of Parkinsonian Syndromes
    Gorriz, J. M.
    Ramirez, J.
    Moreno-Caballero, M.
    Martinez-Murcia, F. J.
    Ortiz, A.
    Illan, I. A.
    Segovia, F.
    Salas-Gonzalez, D.
    Gomez-Rio, M.
    MEDICAL IMAGE UNDERSTANDING AND ANALYSIS (MIUA 2017), 2017, 723 : 459 - 469
  • [34] A novel hybrid ensemble model based on tree-based method and deep learning method for default prediction
    He, Hongliang
    Fan, Yanli
    EXPERT SYSTEMS WITH APPLICATIONS, 2021, 176
  • [35] A Credit Scoring Model Based on Integrated Mixed Sampling and Ensemble Feature Selection: RBR XGB _
    Lin, Xiaobing
    Wu, Zhe
    Chen, Jianfa
    Huang, Lianfen
    Shi, Zhiyuan
    JOURNAL OF INTERNET TECHNOLOGY, 2022, 23 (05): : 1061 - 1068
  • [36] Application of new deep genetic cascade ensemble of SVM classifiers to predict the Australian credit scoring
    Plawiak, Pawel
    Abdar, Moloud
    Acharya, U. Rajendra
    APPLIED SOFT COMPUTING, 2019, 84
  • [37] Classifier selection and clustering with fuzzy assignment in ensemble model for credit scoring
    Zhang, Haoting
    He, Hongliang
    Zhang, Wenyu
    NEUROCOMPUTING, 2018, 316 : 210 - 221
  • [38] A novel deep ensemble model for imbalanced credit scoring in internet finance
    Xiao, Jin
    Zhong, Yu
    Jia, Yanlin
    Wang, Yadong
    Li, Ruoyi
    Jiang, Xiaoyi
    Wang, Shouyang
    INTERNATIONAL JOURNAL OF FORECASTING, 2024, 40 (01) : 348 - 372
  • [39] An ensemble tree-based machine learning model for predicting the uniaxial compressive strength of travertine rocks
    Barzegar, Rahim
    Sattarpour, Masoud
    Deo, Ravinesh
    Fijani, Elham
    Adamowski, Jan
    NEURAL COMPUTING & APPLICATIONS, 2020, 32 (13): : 9065 - 9080
  • [40] Decision Tree-Based Ensemble Model for Predicting National Greenhouse Gas Emissions in Saudi Arabia
    Rahman, Muhammad Muhitur
    Shafiullah, Md
    Alam, Md Shafiul
    Rahman, Mohammad Shahedur
    Alsanad, Mohammed Ahmed
    Islam, Mohammed Monirul
    Islam, Md Kamrul
    Rahman, Syed Masiur
    APPLIED SCIENCES-BASEL, 2023, 13 (06):