Feature selection for high-dimensional classification using a competitive swarm optimizer

被引:232
|
作者
Gu, Shenkai [1 ]
Cheng, Ran [1 ]
Jin, Yaochu [1 ,2 ]
机构
[1] Univ Surrey, Dept Comp Sci, Guildford GU2 7XH, Surrey, England
[2] Dalian Univ Technol, Sch Management Sci & Engn, Dalian 116023, Peoples R China
基金
英国工程与自然科学研究理事会; 中国国家自然科学基金;
关键词
Feature selection; High dimensionality; Large-scale optimization; Classification; Competitive swarm optimization; COMBINATORIAL; ALGORITHM;
D O I
10.1007/s00500-016-2385-6
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
When solving many machine learning problems such as classification, there exists a large number of input features. However, not all features are relevant for solving the problem, and sometimes, including irrelevant features may deteriorate the learning performance.Please check the edit made in the article title Therefore, it is essential to select the most relevant features, which is known as feature selection. Many feature selection algorithms have been developed, including evolutionary algorithms or particle swarm optimization (PSO) algorithms, to find a subset of the most important features for accomplishing a particular machine learning task. However, the traditional PSO does not perform well for large-scale optimization problems, which degrades the effectiveness of PSO for feature selection when the number of features dramatically increases. In this paper, we propose to use a very recent PSO variant, known as competitive swarm optimizer (CSO) that was dedicated to large-scale optimization, for solving high-dimensional feature selection problems. In addition, the CSO, which was originally developed for continuous optimization, is adapted to perform feature selection that can be considered as a combinatorial optimization problem. An archive technique is also introduced to reduce computational cost. Experiments on six benchmark datasets demonstrate that compared to the canonical PSO-based and a state-of-the-art PSO variant for feature selection, the proposed CSO-based feature selection algorithm not only selects a much smaller number of features, but result in better classification performance as well.
引用
收藏
页码:811 / 822
页数:12
相关论文
共 50 条
  • [11] Particle Swarm Optimisation for Feature Selection and Weighting in High-Dimensional Clustering
    O'Neill, Damien
    Lensen, Andrew
    Xue, Bing
    Zhang, Mengjie
    [J]. 2018 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2018, : 173 - 180
  • [12] Feature selection based on hybridization of genetic algorithm and competitive swarm optimizer
    Ding, Ye
    Zhou, Kui
    Bi, Weihong
    [J]. SOFT COMPUTING, 2020, 24 (15) : 11663 - 11672
  • [13] Feature selection based on hybridization of genetic algorithm and competitive swarm optimizer
    Ye Ding
    Kui Zhou
    Weihong Bi
    [J]. Soft Computing, 2020, 24 : 11663 - 11672
  • [14] Feature Selection and Classification for High-Dimensional Incomplete Multimodal Data
    Deng, Wan-Yu
    Liu, Dan
    Dong, Ying-Ying
    [J]. MATHEMATICAL PROBLEMS IN ENGINEERING, 2018, 2018
  • [15] Feature selection, mutual information, and the classification of high-dimensional patterns
    Bonev, Boyan
    Escolano, Francisco
    Cazorla, Miguel
    [J]. PATTERN ANALYSIS AND APPLICATIONS, 2008, 11 (3-4) : 309 - 319
  • [16] Feature selection for histopathological image classification using levy flight salp swarm optimizer
    Rachapudi, Venubabu
    Lavanya Devi, G.
    [J]. Recent Patents on Computer Science, 2019, 12 (04): : 329 - 337
  • [17] Improved aquila optimizer with mRMR for feature selection of high-dimensional gene expression data
    Qin, Xiwen
    Zhang, Siqi
    Dong, Xiaogang
    Shi, Hongyu
    Yuan, Liping
    [J]. CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2024, 27 (09): : 13005 - 13027
  • [18] High-Dimensional Feature Selection for Automatic Classification of Coronary Stenosis Using an Evolutionary Algorithm
    Gil-Rios, Miguel-Angel
    Cruz-Aceves, Ivan
    Hernandez-Aguirre, Arturo
    Moya-Albor, Ernesto
    Brieva, Jorge
    Hernandez-Gonzalez, Martha-Alicia
    Solorio-Meza, Sergio-Eduardo
    [J]. DIAGNOSTICS, 2024, 14 (03)
  • [19] Extended particle swarm optimization for feature selection of high-dimensional biomedical data
    Al-Shammary, Dhiah
    Albukhnefis, Adil L.
    Alsaeedi, Ali Hakem
    Al-Asfoor, Muntasir
    [J]. CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2022, 34 (10):
  • [20] A PSO Based Hybrid Feature Selection Algorithm for High-Dimensional Classification
    Binh Tran
    Zhang, Mengjie
    Xue, Bing
    [J]. 2016 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2016, : 3801 - 3808