Improving Evolutionary Algorithm Performance for Feature Selection in High-Dimensional Data

被引:0
|
作者
Cilia, N. [1 ]
De Stefano, C. [1 ]
Fontanella, F. [1 ]
di Freca, A. Scotto [1 ]
机构
[1] Univ Cassino & Lazio Meridionale, DIEI, Via G Di Biasio 43, I-03043 Cassino, FR, Italy
关键词
Feature selection; High-dimensional data; Feature ranking; Evolutionary algorithms; GENETIC ALGORITHM; CLASSIFICATION; OPTIMIZATION; STRATEGY;
D O I
10.1007/978-3-319-77538-8_30
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In classification and clustering problems, selecting a subset of discriminative features is a challenging problem, especially when hundreds or thousands of features are involved. In this framework, Evolutionary Computation (EC) techniques have received a growing scientific interest in the last years, because they are able to explore large search spaces without requiring any a priori knowledge or assumption on the considered domain. Following this line of thought, we developed a novel strategy to improve the performance of EC-based algorithms for feature selection. The proposed strategy requires to rank the whole set of available features according to a univariate evaluation function; then the search space represented by the first M ranked features is searched using an evolutionary algorithm for finding feature subsets with high discriminative power. Results of comparisons demonstrated the effectiveness of the proposed approach in improving the performance obtainable with three effective and widely used EC-based algorithm for feature selection in high dimensional data problems, namely Ant Colony Optimization (ACO), Particle Swarm Optimization (PSO) and Artificial Bees Colony (ABC).
引用
收藏
页码:439 / 454
页数:16
相关论文
共 50 条
  • [31] High-Dimensional Software Engineering Data and Feature Selection
    Wang, Huanjing
    Khoshgoftaar, Taghi M.
    Gao, Kehan
    Seliya, Naeem
    [J]. ICTAI: 2009 21ST INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE, 2009, : 83 - +
  • [32] Simultaneous Feature Selection and Classification for High-Dimensional Data
    Pai, Vriddhi
    Gupta, Subhash Chand
    [J]. PROCEEDINGS OF THE SECOND INTERNATIONAL CONFERENCE ON GREEN COMPUTING AND INTERNET OF THINGS (ICGCIOT 2018), 2018, : 153 - 158
  • [33] Feature Selection for High-Dimensional Data: The Issue of Stability
    Pes, Barbara
    [J]. 2017 IEEE 26TH INTERNATIONAL CONFERENCE ON ENABLING TECHNOLOGIES - INFRASTRUCTURE FOR COLLABORATIVE ENTERPRISES (WETICE), 2017, : 170 - 175
  • [34] A hybrid feature selection method for high-dimensional data
    Taheri, Nooshin
    Nezamabadi-pour, Hossein
    [J]. 2014 4TH INTERNATIONAL CONFERENCE ON COMPUTER AND KNOWLEDGE ENGINEERING (ICCKE), 2014, : 141 - 145
  • [35] Hybrid Feature Selection for High-Dimensional Manufacturing Data
    Sun, Yajuan
    Yu, Jianlin
    Li, Xiang
    Wu, Ji Yan
    Lu, Wen Feng
    [J]. 2021 26TH IEEE INTERNATIONAL CONFERENCE ON EMERGING TECHNOLOGIES AND FACTORY AUTOMATION (ETFA), 2021,
  • [36] On the scalability of feature selection methods on high-dimensional data
    Bolon-Canedo, V.
    Rego-Fernandez, D.
    Peteiro-Barral, D.
    Alonso-Betanzos, A.
    Guijarro-Berdinas, B.
    Sanchez-Marono, N.
    [J]. KNOWLEDGE AND INFORMATION SYSTEMS, 2018, 56 (02) : 395 - 442
  • [37] A hybrid feature selection scheme for high-dimensional data
    Ganjei, Mohammad Ahmadi
    Boostani, Reza
    [J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2022, 113
  • [38] Evaluating Feature Selection Robustness on High-Dimensional Data
    Pes, Barbara
    [J]. HYBRID ARTIFICIAL INTELLIGENT SYSTEMS (HAIS 2018), 2018, 10870 : 235 - 247
  • [39] Feature selection for classifying high-dimensional numerical data
    Wu, YM
    Zhang, AD
    [J]. PROCEEDINGS OF THE 2004 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 2, 2004, : 251 - 258
  • [40] Evolutionary Subspace Clustering Algorithm for High-Dimensional Data
    Nourashrafeddin, S. N.
    Arnold, Dirk V.
    Milios, Evangelos
    [J]. PROCEEDINGS OF THE FOURTEENTH INTERNATIONAL CONFERENCE ON GENETIC AND EVOLUTIONARY COMPUTATION COMPANION (GECCO'12), 2012, : 1497 - 1498