Feature selection for high-dimensional classification using a competitive swarm optimizer

被引:230
|
作者
Gu, Shenkai [1 ]
Cheng, Ran [1 ]
Jin, Yaochu [1 ,2 ]
机构
[1] Univ Surrey, Dept Comp Sci, Guildford GU2 7XH, Surrey, England
[2] Dalian Univ Technol, Sch Management Sci & Engn, Dalian 116023, Peoples R China
基金
中国国家自然科学基金; 英国工程与自然科学研究理事会;
关键词
Feature selection; High dimensionality; Large-scale optimization; Classification; Competitive swarm optimization; COMBINATORIAL; ALGORITHM;
D O I
10.1007/s00500-016-2385-6
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
When solving many machine learning problems such as classification, there exists a large number of input features. However, not all features are relevant for solving the problem, and sometimes, including irrelevant features may deteriorate the learning performance.Please check the edit made in the article title Therefore, it is essential to select the most relevant features, which is known as feature selection. Many feature selection algorithms have been developed, including evolutionary algorithms or particle swarm optimization (PSO) algorithms, to find a subset of the most important features for accomplishing a particular machine learning task. However, the traditional PSO does not perform well for large-scale optimization problems, which degrades the effectiveness of PSO for feature selection when the number of features dramatically increases. In this paper, we propose to use a very recent PSO variant, known as competitive swarm optimizer (CSO) that was dedicated to large-scale optimization, for solving high-dimensional feature selection problems. In addition, the CSO, which was originally developed for continuous optimization, is adapted to perform feature selection that can be considered as a combinatorial optimization problem. An archive technique is also introduced to reduce computational cost. Experiments on six benchmark datasets demonstrate that compared to the canonical PSO-based and a state-of-the-art PSO variant for feature selection, the proposed CSO-based feature selection algorithm not only selects a much smaller number of features, but result in better classification performance as well.
引用
收藏
页码:811 / 822
页数:12
相关论文
共 50 条
  • [31] Multiobjective optimization algorithm with dynamic operator selection for feature selection in high-dimensional classification
    Wei, Wenhong
    Xuan, Manlin
    Li, Lingjie
    Lin, Qiuzhen
    Ming, Zhong
    Coello, Carlos A. Coello
    [J]. APPLIED SOFT COMPUTING, 2023, 143
  • [32] A Clustering-Guided Integer Brain Storm Optimizer for Feature Selection in High-Dimensional Data
    Jia Yun-Tao
    Zhang Wan-Qiu
    He Chun-Lin
    [J]. DISCRETE DYNAMICS IN NATURE AND SOCIETY, 2021, 2021
  • [33] High-dimensional microarray dataset classification using an improved adam optimizer (iAdam)
    Utkarsh Mahadeo Khaire
    R. Dhanalakshmi
    [J]. Journal of Ambient Intelligence and Humanized Computing, 2020, 11 : 5187 - 5204
  • [34] High-dimensional microarray dataset classification using an improved adam optimizer (iAdam)
    Khaire, Utkarsh Mahadeo
    Dhanalakshmi, R.
    [J]. JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2020, 11 (11) : 5187 - 5204
  • [35] A particle swarm optimization based multiobjective memetic algorithm for high-dimensional feature selection
    Juanjuan Luo
    Dongqing Zhou
    Lingling Jiang
    Huadong Ma
    [J]. Memetic Computing, 2022, 14 : 77 - 93
  • [36] A particle swarm optimization based multiobjective memetic algorithm for high-dimensional feature selection
    Luo, Juanjuan
    Zhou, Dongqing
    Jiang, Lingling
    Ma, Huadong
    [J]. MEMETIC COMPUTING, 2022, 14 (01) : 77 - 93
  • [37] MD5 Encryption Algorithm Enhanced Competitive Swarm Optimizer for Feature Selection
    Lin, Yanhong
    Huang, Shiguo
    Hong, Minglin
    Chen, Shiting
    Li, Xiaolin
    Lin, Dakun
    [J]. 2019 IEEE INTL CONF ON PARALLEL & DISTRIBUTED PROCESSING WITH APPLICATIONS, BIG DATA & CLOUD COMPUTING, SUSTAINABLE COMPUTING & COMMUNICATIONS, SOCIAL COMPUTING & NETWORKING (ISPA/BDCLOUD/SOCIALCOM/SUSTAINCOM 2019), 2019, : 1250 - 1254
  • [38] Binary competitive swarm optimizer Algorithm for Feature Selection in Identification of Chinese Fir Family
    Huang, Shiguo
    Hong, Minglin
    Lin, Yanhong
    Li, Xiaolin
    Lin, Dakun
    [J]. 2019 IEEE INTL CONF ON PARALLEL & DISTRIBUTED PROCESSING WITH APPLICATIONS, BIG DATA & CLOUD COMPUTING, SUSTAINABLE COMPUTING & COMMUNICATIONS, SOCIAL COMPUTING & NETWORKING (ISPA/BDCLOUD/SOCIALCOM/SUSTAINCOM 2019), 2019, : 1519 - 1523
  • [39] A Constrained Competitive Swarm Optimizer With an SVM-Based Surrogate Model for Feature Selection
    Nguyen, Bach Hoai
    Xue, Bing
    Zhang, Mengjie
    [J]. IEEE TRANSACTIONS ON EVOLUTIONARY COMPUTATION, 2024, 28 (01) : 2 - 16
  • [40] Particle swarm optimizer for variable weighting in clustering high-dimensional data
    Yanping Lu
    Shengrui Wang
    Shaozi Li
    Changle Zhou
    [J]. Machine Learning, 2011, 82 : 43 - 70