A novel deep ensemble model for imbalanced credit scoring in internet finance

被引:3
|
作者
Xiao, Jin [1 ,2 ]
Zhong, Yu [1 ]
Jia, Yanlin [3 ]
Wang, Yadong [4 ]
Li, Ruoyi [5 ]
Jiang, Xiaoyi [6 ]
Wang, Shouyang [7 ]
机构
[1] Sichuan Univ, Business Sch, Chengdu 610065, Peoples R China
[2] Sichuan Univ, Inst Management Sci & Operat Res, Chengdu 610065, Peoples R China
[3] Southwest Petr Univ, Sch Sci, Chengdu 610500, Peoples R China
[4] Guangzhou Univ, Pubilc Adm Sch, Guangzhou 510006, Guangdong, Peoples R China
[5] Univ Chicago, Harris Sch Publ Policy, Chicago, IL 60637 USA
[6] Univ Munster, Fac Math & Comp Sci, D-48149 Munster, Germany
[7] ShanghaiTech Univ, Sch Entrepreneurship & Management, Shanghai 201210, Peoples R China
基金
中国国家自然科学基金;
关键词
Credit scoring; Deep ensemble; Class imbalance; VAE; Deep forest; STATISTICAL COMPARISONS; INSTANCE SELECTION; DATA SETS; CLASSIFIERS; SMOTE; RISK; NETWORKS;
D O I
10.1016/j.ijforecast.2023.03.004
中图分类号
F [经济];
学科分类号
02 ;
摘要
Most existing deep ensemble credit scoring models have considered deep neural net-works, for which the structures are difficult to design and the modeling results are difficult to interpret. Moreover, the methods of dealing with the class-imbalance problem in these studies are still based on traditional resampling methods. To fill these gaps, we combine a new over-sampling method, the variational autoencoder (VAE), and a deep ensemble classifier, the deep forest (DF), and propose a novel deep ensemble model for credit scoring in internet finance, VAE-DF. We train and test our model using a number of credit scoring datasets in internet finance and find that our model exhibits good performance and can realize a self-adapting depth. The results show that VAE-DF is an effective credit scoring tool, especially for highly class-imbalanced and non-linear datasets in internet finance, due to its strong ability to learn the complex distributions of these datasets.(c) 2023 International Institute of Forecasters. Published by Elsevier B.V. All rights reserved.
引用
收藏
页码:348 / 372
页数:25
相关论文
共 50 条
  • [1] An interpretable decision tree ensemble model for imbalanced credit scoring datasets
    My, Bui T. T.
    Ta, Bao Q.
    [J]. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2023, 45 (06) : 10853 - 10864
  • [2] A Novel Multi-Stage Ensemble Model With a Hybrid Genetic Algorithm for Credit Scoring on Imbalanced Data
    Jin, Yilun
    Zhang, Wenyu
    Wu, Xin
    Liu, Yanan
    Hu, Zeqian
    [J]. IEEE ACCESS, 2021, 9 : 143593 - 143607
  • [3] An Incremental Learning Ensemble Method for Imbalanced Credit Scoring
    Tian, Jin
    Liu, Xinye
    Li, Minqiang
    [J]. 2019 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (IEEE SSCI 2019), 2019, : 754 - 759
  • [4] A Novel Ensemble Learning Model Combined XGBoost With Deep Neural Network for Credit Scoring
    He, Xiaowei
    Li, Siqi
    He, Xin Tian
    Wang, Wenqiang
    Zhang, Xiang
    Wang, Bin
    [J]. JOURNAL OF INFORMATION TECHNOLOGY RESEARCH, 2022, 15 (01)
  • [5] A heterogeneous ensemble credit scoring model based on adaptive classifier selection: An application on imbalanced data
    Zhang, Tong
    Chi, Guotai
    [J]. INTERNATIONAL JOURNAL OF FINANCE & ECONOMICS, 2021, 26 (03) : 4372 - 4385
  • [6] A novel method for credit scoring based on feature transformation and ensemble model
    Li, Hongxiang
    Feng, Ao
    Lin, Bin
    Su, Houcheng
    Liu, Zixi
    Duan, Xuliang
    Pu, Haibo
    Wang, Yifei
    [J]. PEERJ COMPUTER SCIENCE, 2021,
  • [7] A Novel Method for Credit Scoring Based on Feature Transformation and Ensemble model
    Li, Hongxiang
    Feng, Ao
    Lin, Bin
    Su, Houcheng
    Liu, Zixi
    Duan, Xuliang
    Pu, Haibo
    Wang, Yifei
    [J]. PeerJ Computer Science, 2021, 7 : 1 - 18
  • [8] A novel heterogeneous ensemble credit scoring model based on bstacking approach
    Xia, Yufei
    Liu, Chuanzhe
    Da, Bowen
    Xie, Fangming
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2018, 93 : 182 - 199
  • [9] Empirical Analysis of Ensemble Learning for Imbalanced Credit Scoring Datasets: A Systematic Review
    Lenka, Sudhansu R.
    Bisoy, Sukant Kishoro
    Priyadarshini, Rojalina
    Sain, Mangal
    [J]. WIRELESS COMMUNICATIONS & MOBILE COMPUTING, 2022, 2022
  • [10] A novel hybrid credit scoring model based on ensemble feature selection and multilayer ensemble classification
    Tripathi, Diwakar
    Edla, Damodar Reddy
    Cheruku, Ramalingaswamy
    Kuppili, Venkatanareshbabu
    [J]. COMPUTATIONAL INTELLIGENCE, 2019, 35 (02) : 371 - 394