The effect of feature selection on financial distress prediction

被引:138
|
作者
Liang, Deron [1 ]
Tsai, Chih-Fong [2 ]
Wu, Hsin-Ting [1 ]
机构
[1] Natl Cent Univ, Dept Comp Sci & Informat Engn, Tainan, Taiwan
[2] Natl Cent Univ, Dept Informat Management, Tainan, Taiwan
关键词
Financial distress prediction; Bankruptcy prediction; Credit scoring; Feature selection; Data mining; INTEGRATING FEATURE-SELECTION; PARTICLE SWARM OPTIMIZATION; SUPPORT VECTOR MACHINES; DISCRIMINANT-ANALYSIS; FAILURE PREDICTION; NEURAL-NETWORKS; CLASSIFICATION; ALGORITHMS; ENSEMBLE; BANKS;
D O I
10.1016/j.knosys.2014.10.010
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Financial distress prediction is always important for financial institutions in order for them to assess the financial health of enterprises and individuals. Bankruptcy prediction and credit scoring are two important issues in financial distress prediction where various statistical and machine learning techniques have been employed to develop financial prediction models. Since there are no generally agreed upon financial ratios as input features for model development, many studies consider feature selection as a pre-processing step in data mining before constructing the models. However, most works only focused on applying specific feature selection methods over either bankruptcy prediction or credit scoring problem domains. In this work, a comprehensive study is conducted to examine the effect of performing filter and wrapper based feature selection methods on financial distress prediction. In addition, the effect of feature selection on the prediction models obtained using various classification techniques is also investigated. In the experiments, two bankruptcy and two credit datasets are used. In addition, three filter and two wrapper based feature selection methods combined with six different prediction models are studied. Our experimental results show that there is no the best combination of the feature selection method and the classification technique over the four datasets. Moreover, depending on the chosen techniques, performing feature selection does not always improve the prediction performance. However, on average performing the genetic algorithm and logistic regression for feature selection can provide prediction improvements over the credit and bankruptcy datasets respectively. (C) 2014 Elsevier B.V. All rights reserved.
引用
收藏
页码:289 / 297
页数:9
相关论文
共 50 条
  • [1] Novel feature selection methods to financial distress prediction
    Lin, Fengyi
    Liang, Deron
    Yeh, Ching-Chiang
    Huang, Jui-Chieh
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2014, 41 (05) : 2472 - 2483
  • [2] Financial distress prediction based on ensemble feature selection and improved stacking algorithm
    Wu, Chong
    Chen, Xiaofang
    Jiang, Yongjie
    [J]. KYBERNETES, 2024,
  • [3] Combining feature selection, instance selection, and ensemble classification techniques for improved financial distress prediction
    Tsai, Chih-Fong
    Sue, Kuen-Liang
    Hu, Ya-Han
    Chiu, Andy
    [J]. JOURNAL OF BUSINESS RESEARCH, 2021, 130 : 200 - 209
  • [4] Missing value imputation and the effect of feature normalisation on financial distress prediction
    Sue, Kuen-Liang
    Tsai, Chih-Fong
    Tsau, Hau-Min
    [J]. JOURNAL OF EXPERIMENTAL & THEORETICAL ARTIFICIAL INTELLIGENCE, 2022,
  • [5] Financial distress prediction using a corrected feature selection measure and gradient boosted decision tree
    Qian, Hongyi
    Wang, Baohui
    Yuan, Minghe
    Gao, Songfeng
    Song, You
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2022, 190
  • [6] A comparative study of feature selection and feature extraction methods for financial distress identification
    Kuiziniene, Dovile
    Savickas, Paulius
    Kunickaite, Rimante
    Juozaitiene, Ruta
    Damasevicius, Robertas
    Maskeliunas, Rytis
    Krilavicius, Tomas
    [J]. PEERJ COMPUTER SCIENCE, 2024, 10
  • [7] IMPACT OF FEATURE SELECTION AND FEATURE TYPES ON FINANCIAL STOCK PRICE PREDICTION
    Hagenau, Michael
    Liebmann, Michael
    Neumann, Dirk
    [J]. KDIR 2011: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND INFORMATION RETRIEVAL, 2011, : 303 - 308
  • [8] CUS-heterogeneous ensemble-based financial distress prediction for imbalanced dataset with ensemble feature selection
    Du, Xudong
    Li, Wei
    Ruan, Sumei
    Li, Li
    [J]. APPLIED SOFT COMPUTING, 2020, 97
  • [9] On the utility of input selection and pruning for financial distress prediction models
    Becerra, VM
    Galvao, RKH
    Abou-Seada, M
    [J]. PROCEEDING OF THE 2002 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-3, 2002, : 1328 - 1333
  • [10] Incorporating Multiple Textual Factors into Unbalanced Financial Distress Prediction: A Feature Selection Methods and Ensemble Classifiers Combined Approach
    Li, Shixuan
    Shi, Wenxuan
    [J]. INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE SYSTEMS, 2023, 16 (01)