A quantitative benchmark of neural network feature selection methods for detecting nonlinear signals

被引:1
|
作者
Passemiers, Antoine [1 ]
Folco, Pietro [2 ]
Raimondi, Daniele [1 ,3 ]
Birolo, Giovanni [2 ]
Moreau, Yves [1 ]
Fariselli, Piero [2 ]
机构
[1] Katholieke Univ Leuven, ESAT STADIUS, Leuven, Belgium
[2] Univ Torino, Dept Med Sci, Turin, Italy
[3] Univ Montpellier, Inst Genet Mol Montpellier, Montpellier, France
来源
SCIENTIFIC REPORTS | 2024年 / 14卷 / 01期
关键词
D O I
10.1038/s41598-024-82583-5
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Classification and regression problems can be challenging when the relevant input features are diluted in noisy datasets, in particular when the sample size is limited. Traditional Feature Selection (FS) methods address this issue by relying on some assumptions such as the linear or additive relationship between features. Recently, a proliferation of Deep Learning (DL) models has emerged to tackle both FS and prediction at the same time, allowing non-linear modeling of the selected features. In this study, we systematically assess the performance of DL-based feature selection methods on synthetic datasets of varying complexity, and benchmark their efficacy in uncovering non-linear relationships between features. We also use the same settings to benchmark the reliability of gradient-based feature attribution techniques for Neural Networks (NNs), such as Saliency Maps (SM). A quantitative evaluation of the reliability of these approaches is currently missing. Our analysis indicates that even simple synthetic datasets can significantly challenge most of the DL-based FS and SM methods, while Random Forests, TreeShap, mRMR and LassoNet are the best performing FS methods. Our conclusion is that when quantifying the relevance of a few non linearly-entangled predictive features diluted in a large number of irrelevant noisy variables, DL-based FS and SM interpretation methods are still far from being reliable.
引用
收藏
页数:17
相关论文
共 50 条
  • [1] OPTIMIZATION OF NEURAL NETWORK INPUTS BY FEATURE SELECTION METHODS
    Prochazka, Michal
    Oplatkova, Zuzana
    Holoska, Jiri
    Gerlich, Vladimir
    PROCEEDINGS - 25TH EUROPEAN CONFERENCE ON MODELLING AND SIMULATION, ECMS 2011, 2011, : 440 - 445
  • [2] Dermatology Diagnosis with Feature Selection Methods and Artificial Neural Network
    Abdul-Rahman, Shuzlina
    Norhan, Ahmad Khairil
    Yusoff, Marina
    Mohamed, Azlinah
    Mutalib, Sofianita
    2012 IEEE EMBS CONFERENCE ON BIOMEDICAL ENGINEERING AND SCIENCES (IECBES), 2012,
  • [3] Improving neural network methods for time domain fault analysis of nonlinear analog circuits by feature selection
    Ossowski, Marek
    INTERNATIONAL CONFERENCE ON SIGNALS AND ELECTRONIC SYSTEMS (ICSES '10): CONFERENCE PROCEEDINGS, 2010, : 301 - 304
  • [4] Design of nonlinear cellular neural network filters for detecting linear trajectory signals
    Muikaichi, M
    Kondo, K
    Hamada, N
    IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 1997, E80A (09) : 1655 - 1661
  • [5] Nonlinear Feature Selection Neural Network via Structured Sparse Regularization
    Wang, Rong
    Bian, Jintang
    Nie, Feiping
    Li, Xuelong
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (11) : 9493 - 9505
  • [6] Evaluation of feature selection methods based on artificial neural network weights
    da Costa, Nattane Luiza
    de Lima, Marcio Dias
    Barbosa, Rommel
    EXPERT SYSTEMS WITH APPLICATIONS, 2021, 168
  • [7] A review and benchmark of feature importance methods for neural networks
    Mandler, Hannes
    Weigand, Bernhard
    ACM COMPUTING SURVEYS, 2024, 56 (12)
  • [8] Feature extraction and selection of neural network
    Wu, CD
    Gao, F
    Ma, SH
    PROCEEDINGS OF THE 3RD WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION, VOLS 1-5, 2000, : 1103 - 1106
  • [9] Analysis and improvements on feature selection methods based on artificial neural network weights
    da Costa, Nattane Luiza
    de Lima, Marcio Dias
    Barbosa, Rommel
    APPLIED SOFT COMPUTING, 2022, 127
  • [10] Feature Selection and Classification of Electroencephalographic Signals: An Artificial Neural Network and Genetic Algorithm Based Approach
    Erguzel, Turker Tekin
    Ozekes, Serhat
    Tan, Oguz
    Gultekin, Selahattin
    CLINICAL EEG AND NEUROSCIENCE, 2015, 46 (04) : 321 - 326