A comparative study of feature selection and feature extraction methods for financial distress identification

被引:0
|
作者
Kuizinienė D. [1 ]
Savickas P. [1 ]
Kunickaitė R. [1 ]
Juozaitienė R. [1 ]
Damaševičius R. [1 ]
Maskeliūnas R. [2 ]
Krilavičius T. [1 ]
机构
[1] Department of Applied Informatics, Vytautas Magnus University, Kaunas
[2] Silesian University of Technology, Gliwice
关键词
Bankruptcy prediction; Dimensionality reduction; Feature extraction; Feature selection; Financial distress; Insolvency; Machine learning;
D O I
10.7717/PEERJ-CS.1956
中图分类号
学科分类号
摘要
Financial distress identification remains an essential topic in the scientific literature due to its importance for society and the economy. The advancements in information technology and the escalating volume of stored data have led to the emergence of financial distress that transcends the realm of financial statements and its’ indicators (ratios). The feature space could be expanded by incorporating new perspectives on feature data categories such as macroeconomics, sectors, social, board, management, judicial incident, etc. However, the increased dimensionality results in sparse data and overfitted models. This study proposes a new approach for efficient financial distress classification assessment by combining dimensionality reduction and machine learning techniques. The proposed framework aims to identify a subset of features leading to the minimization of the loss function describing the financial distress in an enterprise. During the study, 15 dimensionality reduction techniques with different numbers of features and 17 machine-learning models were compared. Overall, 1,432 experiments were performed using Lithuanian enterprise data covering the period from 2015 to 2022. Results revealed that the artificial neural network (ANN) model with 30 ranked features identified using the Random Forest mean decreasing Gini (RF_MDG) feature selection technique provided the highest AUC score. Moreover, this study has introduced a novel approach for feature extraction, which could improve financial distress classification models. © (2024), Kuizinienė et al.
引用
收藏
相关论文
共 50 条
  • [1] A comparative study of feature selection and feature extraction methods for financial distress identification
    Kuiziniene, Dovile
    Savickas, Paulius
    Kunickaite, Rimante
    Juozaitiene, Ruta
    Damasevicius, Robertas
    Maskeliunas, Rytis
    Krilavicius, Tomas
    PEERJ COMPUTER SCIENCE, 2024, 10
  • [2] Novel feature selection methods to financial distress prediction
    Lin, Fengyi
    Liang, Deron
    Yeh, Ching-Chiang
    Huang, Jui-Chieh
    EXPERT SYSTEMS WITH APPLICATIONS, 2014, 41 (05) : 2472 - 2483
  • [3] A comparative study of bread wheat varieties identification on feature extraction, feature selection and machine learning algorithms
    Kilicarslan, Serhat
    Kilicarslan, Sabire
    EUROPEAN FOOD RESEARCH AND TECHNOLOGY, 2024, 250 (01) : 135 - 149
  • [4] A comparative study of bread wheat varieties identification on feature extraction, feature selection and machine learning algorithms
    Serhat Kılıçarslan
    Sabire Kılıçarslan
    European Food Research and Technology, 2024, 250 : 135 - 149
  • [5] Comparative Study of ECG Feature Extraction Methods
    Agrawal, Akanksha
    Gawali, Dhanashri H.
    2017 2ND IEEE INTERNATIONAL CONFERENCE ON RECENT TRENDS IN ELECTRONICS, INFORMATION & COMMUNICATION TECHNOLOGY (RTEICT), 2017, : 2021 - 2025
  • [6] Comparative Study of ECG Feature Extraction Methods
    Agrawal, Akanksha
    Gawali, Dhanashri H.
    2017 2ND IEEE INTERNATIONAL CONFERENCE ON RECENT TRENDS IN ELECTRONICS, INFORMATION & COMMUNICATION TECHNOLOGY (RTEICT), 2017, : 246 - 250
  • [7] The effect of feature selection on financial distress prediction
    Liang, Deron
    Tsai, Chih-Fong
    Wu, Hsin-Ting
    KNOWLEDGE-BASED SYSTEMS, 2015, 73 : 289 - 297
  • [8] Filter methods for feature selection -: A comparative study
    Sanchez-Marono, Noelia
    Alonso-Betanzos, Amparo
    Tombilla-Sanroman, Marfa
    INTELLIGENT DATA ENGINEERING AND AUTOMATED LEARNING - IDEAL 2007, 2007, 4881 : 178 - 187
  • [9] A Comparative Study of Redundant Feature Detection based Feature Selection Methods
    Zeng, Xue-Qiang
    Chen, Qian-Sheng
    2014 INTERNATIONAL CONFERENCE ON COMPUTER, INFORMATION AND TELECOMMUNICATION SYSTEMS (CITS), 2014,
  • [10] Analysis and Evaluation of Feature Selection and Feature Extraction Methods
    Nogales, Ruben E.
    Benalcazar, Marco E.
    INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE SYSTEMS, 2023, 16 (01)