Class-imbalanced dynamic financial distress prediction based on Adaboost-SVM ensemble combined with SMOTE and time weighting

被引:202
|
作者
Sun, Jie [1 ]
Li, Hui [2 ]
Fujita, Hamido [3 ]
Fu, Binbin [4 ]
Ai, Wenguo [5 ]
机构
[1] Tianjin Univ Finance & Econ, Sch Accountancy, Tianjin, Peoples R China
[2] Nankai Univ, Coll Tourism & Serv Management, Tianjin, Peoples R China
[3] Iwate Prefectural Univ, Fac Software & Informat Sci, Takizawa, Iwate, Japan
[4] Zhejiang Normal Univ, Sch Econ & Management, Jinhua, Zhejiang, Peoples R China
[5] Harbin Inst Technol, Management Sch, Harbin, Heilongjiang, Peoples R China
基金
中国国家自然科学基金;
关键词
Dynamic financial distress prediction; Class imbalance; SMOTE; Time weighting; Adaboost; Support vector machine; CLASSIFICATION; ALGORITHMS; SELECTION; FAILURE; RATIOS; DISTANCE; MACHINE;
D O I
10.1016/j.inffus.2019.07.006
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper focuses on how to effectively construct dynamic financial distress prediction models based on class-imbalanced data streams. Two class-imbalanced dynamic financial distress prediction approaches are proposed based on the synthetic minority oversampling technique (SMOTE) combined with the Adaboost support vector machine ensemble integrated with time weighting (ADASVM-TW). One is the simple integration model of SMOTE with ADASVM-TW, which uses SMOTE to make each data batch class-balanced before ADASVM-TW is applied for dynamic financial distress prediction modeling. The other is the embedding integration model of SMOTE with ADASVM-TW, which embeds SMOTE into the iteration of ADASVM-TW and creatively designs a new sample weighting mechanism. Namely, in each round of iteration, it does not only resample the majority financial normal samples by time weighting, but also generates more synthetic minority samples around new and difficult minority samples and less synthetic minority samples around old and easy minority samples to make the training dataset class-balanced. The empirical experiments are carried out based on the financial data of totally 2628 Chinese listed companies. Since there is certain degree of randomness in the proposed models, 50 times of experiments were performed under the same computational environment, so that the experimental results of model performance can be statistically compared and tested. The results indicate that both the simple integration model and the embedding integration model can greatly improve the recognition ability for the minority financial distress samples, and the embedding integration model is even more preferred because it also significantly outperforms the simple integration model.
引用
收藏
页码:128 / 144
页数:17
相关论文
共 18 条
  • [1] Dynamic class-imbalanced financial distress prediction based on case-based reasoning integrated with time weighting and resampling
    Sun, Jie
    Sun, Mingyang
    Zhao, Mengru
    Du, Yingying
    [J]. JOURNAL OF CREDIT RISK, 2023, 19 (01): : 39 - 73
  • [2] Dynamic financial distress prediction based on class-imbalanced data batches
    Sun, Jie
    Liu, Xin
    Ai, Wenguo
    Tian, Qianyuan
    [J]. INTERNATIONAL JOURNAL OF FINANCIAL ENGINEERING, 2021, 8 (03)
  • [3] Dynamic financial distress prediction with concept drift based on time weighting combined with Adaboost support vector machine ensemble
    Sun, Jie
    Fujita, Hamido
    Chen, Peng
    Li, Hui
    [J]. KNOWLEDGE-BASED SYSTEMS, 2017, 120 : 4 - 14
  • [4] Class-imbalanced dynamic financial distress prediction based on random forest from the perspective of concept drift
    Sun, Jie
    Zhao, Mengru
    Lei, Cong
    [J]. RISK MANAGEMENT-AN INTERNATIONAL JOURNAL, 2024, 26 (04):
  • [5] Combining weighted SMOTE with ensemble learning for the class-imbalanced prediction of small business credit risk
    Abedin, Mohammad Zoynul
    Guotai, Chi
    Hajek, Petr
    Zhang, Tong
    [J]. COMPLEX & INTELLIGENT SYSTEMS, 2023, 9 (04) : 3559 - 3579
  • [6] Combining weighted SMOTE with ensemble learning for the class-imbalanced prediction of small business credit risk
    Mohammad Zoynul Abedin
    Chi Guotai
    Petr Hajek
    Tong Zhang
    [J]. Complex & Intelligent Systems, 2023, 9 : 3559 - 3579
  • [7] Class-imbalanced financial distress prediction with machine learning: Incorporating financial, management, textual, and social responsibility features into index system
    Song, Yinghua
    Jiang, Minzhe
    Li, Shixuan
    Zhao, Shengzhe
    [J]. JOURNAL OF FORECASTING, 2024, 43 (03) : 593 - 614
  • [8] Predicting Extreme Financial Risks on Imbalanced Dataset: A Combined Kernel FCM and Kernel SMOTE Based SVM Classifier
    Huang, Xun
    Zhang, Cheng-Zhao
    Yuan, Jia
    [J]. COMPUTATIONAL ECONOMICS, 2020, 56 (01) : 187 - 216
  • [9] Predicting Extreme Financial Risks on Imbalanced Dataset: A Combined Kernel FCM and Kernel SMOTE Based SVM Classifier
    Xun Huang
    Cheng-Zhao Zhang
    Jia Yuan
    [J]. Computational Economics, 2020, 56 : 187 - 216
  • [10] Class-imbalanced Dynamical Financial Distress Prediction Based on Synthetic Minority Oversampling Technique and Local Weighted Scheme Integrated with Support Vector Machine
    Sun Jie
    Fu Bin-bin
    Li Hui
    Ai Wen-guo
    [J]. 2018 25TH ANNUAL INTERNATIONAL CONFERENCE ON MANAGEMENT SCIENCE & ENGINEERING, 2018, : 227 - 233