An interpretable decision tree ensemble model for imbalanced credit scoring datasets

被引:0
|
作者
My, Bui T. T. [1 ,2 ]
Ta, Bao Q. [3 ]
机构
[1] Ho Chi Minh Univ Banking, Dept Math Econ, Ho Chi Minh City, Vietnam
[2] Univ Econ Ho Chi Minh City, Fac Math & Stat, Ho Chi Minh City, Vietnam
[3] Vietnam Natl Univ, Int Univ, Dept Math, Ho Chi Minh City, Vietnam
关键词
Classifiers; credit scoring; decision tree; ensemble classifiers; imbalanced data; ART CLASSIFICATION ALGORITHMS; NEURAL-NETWORKS; SMOTE;
D O I
10.3233/JIFS-230825
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Credit scoring is a typical example of imbalanced classification, which poses a challenge to conventional machine learning algorithms and statistical classifiers when attempting to accurately predict outcomes for defaulting customers. In this paper, we propose a credit scoring classifier called Decision Tree Ensemble model (DTE). This model effectively addresses the challenge of imbalanced data and identifies significant features that influence the likelihood of credit status. An experiment demonstrates that DTE exhibits superior performance metrics in comparison to well-known based-tree ensemble classifiers such as Bagging, Random Forest, and AdaBoost, particularly when integrated with resampling techniques for handling imbalanced data.
引用
收藏
页码:10853 / 10864
页数:12
相关论文
共 50 条
  • [1] Interpretable machine learning for imbalanced credit scoring datasets
    Chen, Yujia
    Calabrese, Raffaella
    Martin-Barragan, Belen
    [J]. EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2024, 312 (01) : 357 - 372
  • [2] Empirical Analysis of Ensemble Learning for Imbalanced Credit Scoring Datasets: A Systematic Review
    Lenka, Sudhansu R.
    Bisoy, Sukant Kishoro
    Priyadarshini, Rojalina
    Sain, Mangal
    [J]. WIRELESS COMMUNICATIONS & MOBILE COMPUTING, 2022, 2022
  • [3] Multiple optimized ensemble learning for high-dimensional imbalanced credit scoring datasets
    Lenka, Sudhansu R.
    Bisoy, Sukant Kishoro
    Priyadarshini, Rojalina
    [J]. KNOWLEDGE AND INFORMATION SYSTEMS, 2024, 66 (09) : 5429 - 5457
  • [4] A novel deep ensemble model for imbalanced credit scoring in internet finance
    Xiao, Jin
    Zhong, Yu
    Jia, Yanlin
    Wang, Yadong
    Li, Ruoyi
    Jiang, Xiaoyi
    Wang, Shouyang
    [J]. INTERNATIONAL JOURNAL OF FORECASTING, 2024, 40 (01) : 348 - 372
  • [5] Credit scoring prediction leveraging interpretable ensemble learning
    Liu, Yang
    Huang, Fei
    Ma, Lili
    Zeng, Qingguo
    Shi, Jiale
    [J]. JOURNAL OF FORECASTING, 2024, 43 (02) : 286 - 308
  • [6] An Incremental Learning Ensemble Method for Imbalanced Credit Scoring
    Tian, Jin
    Liu, Xinye
    Li, Minqiang
    [J]. 2019 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (IEEE SSCI 2019), 2019, : 754 - 759
  • [7] Tree-based heterogeneous cascade ensemble model for credit scoring
    Liu, Wanan
    Fan, Hong
    Xia, Meng
    [J]. INTERNATIONAL JOURNAL OF FORECASTING, 2023, 39 (04) : 1593 - 1614
  • [8] Decision Trees as Interpretable Bank Credit Scoring Models
    Szwabe, Andrzej
    Misiorek, Pawel
    [J]. BEYOND DATABASES, ARCHITECTURES AND STRUCTURES: FACING THE CHALLENGES OF DATA PROLIFERATION AND GROWING VARIETY, 2018, 928 : 207 - 219
  • [9] ASE: Anomaly scoring based ensemble learning for highly imbalanced datasets
    Liang, Xiayu
    Gao, Ying
    Xu, Shanrong
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2024, 238
  • [10] A heterogeneous ensemble credit scoring model based on adaptive classifier selection: An application on imbalanced data
    Zhang, Tong
    Chi, Guotai
    [J]. INTERNATIONAL JOURNAL OF FINANCE & ECONOMICS, 2021, 26 (03) : 4372 - 4385