A novel tree-based dynamic heterogeneous ensemble method for credit scoring

被引:54
|
作者
Xia, Yufei [1 ]
Zhao, Junhao [2 ]
He, Lingyun [3 ]
Li, Yinguo [1 ]
Niu, Mengyi [4 ]
机构
[1] Jiangsu Normal Univ, Business Sch, Xuzhou 221116, Jiangsu, Peoples R China
[2] Jiangsu Normal Univ, Sino Russian Inst, Xuzhou 221116, Jiangsu, Peoples R China
[3] China Univ Min & Technol, Sch Econ & Management, Xuzhou 221116, Jiangsu, Peoples R China
[4] Jiangsu Normal Univ, Law Sch, Xuzhou 221116, Jiangsu, Peoples R China
基金
中国国家自然科学基金;
关键词
Credit scoring; Selective ensemble; Random forests; Gradient boosting decision tree; Machine learning; NEURAL-NETWORK ENSEMBLE; ART CLASSIFICATION ALGORITHMS; RISK-ASSESSMENT; BANKRUPTCY PREDICTION; GENETIC ALGORITHM; REJECT INFERENCE; MODEL; CLASSIFIERS; SELECTION; DIVERSITY;
D O I
10.1016/j.eswa.2020.113615
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Ensemble models have been extensively applied to credit scoring. However, advanced tree-based classifiers have been seldom utilized as components of ensemble models. Moreover, few studies have considered dynamic ensemble selection. To fill the research gap, this paper aims to develop a novel tree-based overfitting-cautious heterogeneous ensemble model (i.e., OCHE) for credit scoring which departs from existing literature on base models and ensemble selection strategy. Regarding base models, tree-based techniques are employed to acquire a balance between predictive accuracy and computational cost. In terms of ensemble selection, the proposed method can assign weights to base models dynamically according to the overfitting measure. Validated on five public datasets, the proposed approach is compared with several popular benchmark models and selection strategies on predictive accuracy and computational cost measures. For predictive accuracy, the proposed approach outperforms the benchmark models significantly in most cases based on the non-parametric significance test. It also performs marginally better than several state-of-the-art studies. Our proposal remains robust in several scenarios. In terms of computational cost, the proposed method provides acceptable performance and benefits from GPU acceleration considerably. (C) 2020 Elsevier Ltd. All rights reserved.
引用
收藏
页数:18
相关论文
共 50 条
  • [21] The Predictive Capability of a Novel Ensemble Tree-Based Algorithm for Assessing Groundwater Potential
    Park, Soyoung
    Kim, Jinsoo
    SUSTAINABILITY, 2021, 13 (05) : 1 - 19
  • [22] An interpretable decision tree ensemble model for imbalanced credit scoring datasets
    My, Bui T. T.
    Ta, Bao Q.
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2023, 45 (06) : 10853 - 10864
  • [23] Dynamic ensemble classification for credit scoring using soft probability
    Feng, Xiaodong
    Xiao, Zhi
    Zhong, Bo
    Qiu, Jing
    Dong, Yuanxiang
    APPLIED SOFT COMPUTING, 2018, 65 : 139 - 151
  • [24] Novel Credal Decision Tree-Based Ensemble Approaches for Predicting the Landslide Susceptibility
    Arabameri, Alireza
    Karimi-Sangchini, Ebrahim
    Pal, Subodh Chandra
    Saha, Asish
    Chowdhuri, Indrajit
    Lee, Saro
    Tien Bui, Dieu
    REMOTE SENSING, 2020, 12 (20) : 1 - 27
  • [25] A heterogeneous ensemble credit scoring model based on adaptive classifier selection: An application on imbalanced data
    Zhang, Tong
    Chi, Guotai
    INTERNATIONAL JOURNAL OF FINANCE & ECONOMICS, 2021, 26 (03) : 4372 - 4385
  • [26] EnHAT-Synergy of a tree-based Ensemble with Hoeffding Adaptive Tree for dynamic data streams mining
    Weinberg, Abraham Itzhak
    Last, Mark
    INFORMATION FUSION, 2023, 89 : 397 - 404
  • [27] Tree-Based Ensemble Multi-Task Learning Method for Classification and Regression
    Simm, Jaak
    Magrans De Abril, Ildefons
    Sugiyama, Masashi
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2014, E97D (06) : 1677 - 1681
  • [28] A Novel Enterprise Credit Scoring Method Based On Random Forest
    Wu Jing
    Dong Huailin
    Wu Qingfeng
    Wang Wei
    ICCSE 2008: PROCEEDINGS OF THE THIRD INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE & EDUCATION: ADVANCED COMPUTER TECHNOLOGY, NEW EDUCATION, 2008, : 188 - 192
  • [29] Development of an Ensemble Decision Tree-Based Power System Dynamic Security State Predictor
    Mukherjee, Rituparna
    De, Abhinandan
    IEEE SYSTEMS JOURNAL, 2020, 14 (03): : 3836 - 3843
  • [30] Ensemble classification based on supervised clustering for credit scoring
    Xiao, Hongshan
    Xiao, Zhi
    Wang, Yu
    APPLIED SOFT COMPUTING, 2016, 43 : 73 - 86