An experimental comparison of feature selection methods on two-class biomedical datasets

被引:57
|
作者
Drotar, P. [1 ]
Gazda, J. [2 ]
Smekal, Z. [1 ]
机构
[1] Brno Univ Technol, Dept Telecommun, Tech 12, Brno 61200, Czech Republic
[2] Tech Univ Kosice, Dept Comp & Informat, Kosice 0401, Slovakia
关键词
Feature selection; Stability; Classification performance; Univariate FS; Multivariate FS; MOLECULAR CLASSIFICATION; CLASS PREDICTION; CANCER; STABILITY; ALGORITHMS;
D O I
10.1016/j.compbiomed.2015.08.010
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Feature selection is a significant part of many machine learning applications dealing with small-sample and high-dimensional data. Choosing the most important features is an essential step for knowledge discovery in many areas of biomedical informatics. The increased popularity of feature selection methods and their frequent utilisation raise challenging new questions about the interpretability and stability of feature selection techniques. In this study, we compared the behaviour of ten state-of-the-art filter methods for feature selection in terms of their stability, similarity, and influence on prediction performance. All of the experiments were conducted on eight two-class datasets from biomedical areas. While entropy-based feature selection appears to be the most stable, the feature selection techniques yielding the highest prediction performance are minimum redundance maximum relevance method and feature selection based on Bhattacharyya distance. In general, univariate feature selection techniques perform similarly to or even better than more complex multivariate feature selection techniques with high-dimensional datasets. However, with more complex and smaller datasets multivariate methods slightly outperform univariate techniques. (C) 2015 Elsevier Ltd. All rights reserved.
引用
收藏
页码:1 / 10
页数:10
相关论文
共 50 条
  • [1] Feature Selection and Ensemble Learning Techniques in One-Class Classifiers: An Empirical Study of Two-Class Imbalanced Datasets
    Tsai, Chih-Fong
    Lin, Wei-Chao
    [J]. IEEE ACCESS, 2021, 9 : 13717 - 13726
  • [2] An Experimental Comparison of Feature-Selection and Classification Methods for Microarray Datasets
    Cilia, Nicole Dalia
    De Stefano, Claudio
    Fontanella, Francesco
    Raimondo, Stefano
    di Freca, Alessandra Scotto
    [J]. INFORMATION, 2019, 10 (03)
  • [3] Bi-objective feature selection for discriminant analysis in two-class classification
    Pacheco, Joaquin
    Casado, Silvia
    Angel-Bello, Francisco
    Alvarez, Ada
    [J]. KNOWLEDGE-BASED SYSTEMS, 2013, 44 : 57 - 64
  • [4] The impact of feature selection on one and two-class classification performance for plant microRNAs
    Khalifa, Waleed
    Yousef, Malik
    Demirci, Muserref Duygu Sacar
    Allmer, Jens
    [J]. PEERJ, 2016, 4
  • [5] EXPERIMENTAL COMPARISON OF TWO FEATURE SELECTION METHODS BASED ON GENERIC ALGORITHM
    Liu, Bo
    Zhai, Jun-Hai
    Liu, Hai-Bo
    [J]. PROCEEDINGS OF 2017 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS (ICMLC), VOL 1, 2017, : 241 - 245
  • [6] An Effective Metaheuristic for Bi-objective Feature Selection in Two-Class Classification Problem
    Lyubchenko, A. A.
    Pacheco, J. A.
    Casado, S.
    Nunez, L.
    [J]. XII INTERNATIONAL SCIENTIFIC AND TECHNICAL CONFERENCE APPLIED MECHANICS AND SYSTEMS DYNAMICS, 2019, 1210
  • [7] A New Feature Selection Algorithm for Two-Class Classification Problems and Application to Endometrial Cancer
    Ahsen, M. Eren
    Singh, Nitin K.
    Boren, Todd
    Vidyasagar, M.
    White, Michael A.
    [J]. 2012 IEEE 51ST ANNUAL CONFERENCE ON DECISION AND CONTROL (CDC), 2012, : 2976 - 2982
  • [8] Comparison of Feature Selection Methods in Text Classification on Highly Skewed Datasets
    Asim, Muhammad Nabeel
    Wasim, Muhammad
    Ali, Muhammad Sajid
    Rehman, Abdur
    [J]. 2017 FIRST INTERNATIONAL CONFERENCE ON LATEST TRENDS IN ELECTRICAL ENGINEERING AND COMPUTING TECHNOLOGIES (INTELLECT), 2017,
  • [9] Applications of Feature Selection Techniques on Large Biomedical Datasets
    Ewen, Nicolas
    Abdou, Tamer
    Bener, Ayse
    [J]. ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, 11489 : 543 - 548
  • [10] Two-Class with Oversampling Versus One-Class Classification for Microarray Datasets
    Perez-Sanchez, Beatriz
    Fontenla-Romero, Oscar
    Sanchez-Marono, Noelia
    [J]. ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2016, PT II, 2016, 9887 : 398 - 405