A comparative study of feature selection and feature extraction methods for financial distress identification

被引:0
|
作者
Kuizinienė D. [1 ]
Savickas P. [1 ]
Kunickaitė R. [1 ]
Juozaitienė R. [1 ]
Damaševičius R. [1 ]
Maskeliūnas R. [2 ]
Krilavičius T. [1 ]
机构
[1] Department of Applied Informatics, Vytautas Magnus University, Kaunas
[2] Silesian University of Technology, Gliwice
关键词
Bankruptcy prediction; Dimensionality reduction; Feature extraction; Feature selection; Financial distress; Insolvency; Machine learning;
D O I
10.7717/PEERJ-CS.1956
中图分类号
学科分类号
摘要
Financial distress identification remains an essential topic in the scientific literature due to its importance for society and the economy. The advancements in information technology and the escalating volume of stored data have led to the emergence of financial distress that transcends the realm of financial statements and its’ indicators (ratios). The feature space could be expanded by incorporating new perspectives on feature data categories such as macroeconomics, sectors, social, board, management, judicial incident, etc. However, the increased dimensionality results in sparse data and overfitted models. This study proposes a new approach for efficient financial distress classification assessment by combining dimensionality reduction and machine learning techniques. The proposed framework aims to identify a subset of features leading to the minimization of the loss function describing the financial distress in an enterprise. During the study, 15 dimensionality reduction techniques with different numbers of features and 17 machine-learning models were compared. Overall, 1,432 experiments were performed using Lithuanian enterprise data covering the period from 2015 to 2022. Results revealed that the artificial neural network (ANN) model with 30 ranked features identified using the Random Forest mean decreasing Gini (RF_MDG) feature selection technique provided the highest AUC score. Moreover, this study has introduced a novel approach for feature extraction, which could improve financial distress classification models. © (2024), Kuizinienė et al.
引用
收藏
相关论文
共 50 条
  • [21] A Comparative Study of Evolutionary Methods for Feature Selection in Sentiment Analysis
    Garg, Shikhar
    Verma, Sukriti
    IJCCI: PROCEEDINGS OF THE 11TH INTERNATIONAL JOINT CONFERENCE ON COMPUTATIONAL INTELLIGENCE, 2019, : 131 - 138
  • [22] Feature Extraction Methods in ECG: Comparative Analysis
    Neto, J. E.
    Suarez-Leon, A. A.
    Vazquez-Seisdedos, C. R.
    Lopez-Mora, N. A.
    Leite, J. C.
    Oliveira, R. C. L.
    5TH LATIN AMERICAN CONGRESS ON BIOMEDICAL ENGINEERING (CLAIB 2011): SUSTAINABLE TECHNOLOGIES FOR THE HEALTH OF ALL, PTS 1 AND 2, 2013, 33 (1-2): : 858 - 861
  • [23] EEG Feature Extraction and Selection Techniques for Epileptic Detection: A Comparative Study
    Hussein, Ramy
    Mohamed, Amr
    Shaban, Khaled
    Mohamed, Abduljalil A.
    2013 IEEE SYMPOSIUM ON COMPUTERS AND INFORMATICS (ISCI 2013), 2013,
  • [24] Feature Extraction Methods in Language Identification: A Survey
    Deepti Deshwal
    Pardeep Sangwan
    Divya Kumar
    Wireless Personal Communications, 2019, 107 : 2071 - 2103
  • [25] Feature Extraction Methods in Language Identification: A Survey
    Deshwal, Deepti
    Sangwan, Pardeep
    Kumar, Divya
    WIRELESS PERSONAL COMMUNICATIONS, 2019, 107 (04) : 2071 - 2103
  • [26] Feature Extraction and Feature Selection Methods in Classification of Brain MRI Images: A Review
    Poernama, Aqidatul Izza
    Soesanti, Indah
    Wahyunggoro, Oyas
    2019 INTERNATIONAL BIOMEDICAL INSTRUMENTATION AND TECHNOLOGY CONFERENCE (IBITEC), 2019, : 58 - 63
  • [27] Feature Extraction Methods in Quantitative StructureActivity Relationship Modeling: A Comparative Study
    Alsenan, Shrooq A.
    Al-Turaiki, Isra M.
    Hafez, Alaaeldin M.
    IEEE ACCESS, 2020, 8 : 78737 - 78752
  • [28] Comparative study of biohashing technique using different feature extraction methods
    Saini, Nirmala
    Sinha, Aloka
    PHOTONICS 2010: TENTH INTERNATIONAL CONFERENCE ON FIBER OPTICS AND PHOTONICS, 2011, 8173
  • [29] A comparative study on deep feature selection methods for skin lesion classification
    Golnoori, Farzad
    Boroujeni, Farsad Zamani
    Monadjemi, Seyed Amirhassan
    IET IMAGE PROCESSING, 2024, 18 (04) : 996 - 1013
  • [30] Feature Selection on a Flare Forecasting Testbed: A Comparative Study of 24 Methods
    Yeolekar, Atharv
    Patel, Sagar
    Talla, Shreejaa
    Puthucode, Krishna Rukmini
    Ahmadzadeh, Azim
    Sadykov, Viacheslav M.
    Angryk, Rafal A.
    21ST IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS ICDMW 2021, 2021, : 1067 - 1076