Feature selection for high-dimensional classification using a competitive swarm optimizer

被引:232
|
作者
Gu, Shenkai [1 ]
Cheng, Ran [1 ]
Jin, Yaochu [1 ,2 ]
机构
[1] Univ Surrey, Dept Comp Sci, Guildford GU2 7XH, Surrey, England
[2] Dalian Univ Technol, Sch Management Sci & Engn, Dalian 116023, Peoples R China
基金
英国工程与自然科学研究理事会; 中国国家自然科学基金;
关键词
Feature selection; High dimensionality; Large-scale optimization; Classification; Competitive swarm optimization; COMBINATORIAL; ALGORITHM;
D O I
10.1007/s00500-016-2385-6
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
When solving many machine learning problems such as classification, there exists a large number of input features. However, not all features are relevant for solving the problem, and sometimes, including irrelevant features may deteriorate the learning performance.Please check the edit made in the article title Therefore, it is essential to select the most relevant features, which is known as feature selection. Many feature selection algorithms have been developed, including evolutionary algorithms or particle swarm optimization (PSO) algorithms, to find a subset of the most important features for accomplishing a particular machine learning task. However, the traditional PSO does not perform well for large-scale optimization problems, which degrades the effectiveness of PSO for feature selection when the number of features dramatically increases. In this paper, we propose to use a very recent PSO variant, known as competitive swarm optimizer (CSO) that was dedicated to large-scale optimization, for solving high-dimensional feature selection problems. In addition, the CSO, which was originally developed for continuous optimization, is adapted to perform feature selection that can be considered as a combinatorial optimization problem. An archive technique is also introduced to reduce computational cost. Experiments on six benchmark datasets demonstrate that compared to the canonical PSO-based and a state-of-the-art PSO variant for feature selection, the proposed CSO-based feature selection algorithm not only selects a much smaller number of features, but result in better classification performance as well.
引用
收藏
页码:811 / 822
页数:12
相关论文
共 50 条
  • [1] Feature selection for high-dimensional classification using a competitive swarm optimizer
    Shenkai Gu
    Ran Cheng
    Yaochu Jin
    [J]. Soft Computing, 2018, 22 : 811 - 822
  • [2] An Asymmetric Chaotic Competitive Swarm Optimization Algorithm for Feature Selection in High-Dimensional Data
    Pichai, Supailin
    Sunat, Khamron
    Chiewchanwattana, Sirapat
    [J]. SYMMETRY-BASEL, 2020, 12 (11): : 1 - 13
  • [3] Boosted Spider Wasp Optimizer for High-dimensional Feature Selection
    Mohamed, Elfadil A.
    Braik, Malik Sh.
    Al-Betar, Mohammed Azmi
    Awadallah, Mohammed A.
    [J]. JOURNAL OF BIONIC ENGINEERING, 2024,
  • [4] Binary Competitive Swarm Optimizer Approaches for Feature Selection
    Too, Jingwei
    Abdullah, Abdul Rahim
    Saad, Norhashimah Mohd
    [J]. COMPUTATION, 2019, 7 (02)
  • [5] Evolutionary Multitasking for Feature Selection in High-Dimensional Classification via Particle Swarm Optimization
    Chen, Ke
    Xue, Bing
    Zhang, Mengjie
    Zhou, Fengyu
    [J]. IEEE TRANSACTIONS ON EVOLUTIONARY COMPUTATION, 2022, 26 (03) : 446 - 460
  • [6] Variable-Length Particle Swarm Optimization for Feature Selection on High-Dimensional Classification
    Tran, Binh
    Xue, Bing
    Zhang, Mengjie
    [J]. IEEE TRANSACTIONS ON EVOLUTIONARY COMPUTATION, 2019, 23 (03) : 473 - 487
  • [7] An Enhanced Evolutionary Based Feature Selection Approach Using Grey Wolf Optimizer for the Classification of High-dimensional Biological Data
    Thaher, Thaer
    Awad, Mohammed
    Aldasht, Mohammed
    Sheta, Alaa
    Turabieh, Hamza
    Chantar, Hamouda
    [J]. JOURNAL OF UNIVERSAL COMPUTER SCIENCE, 2022, 28 (05) : 499 - 539
  • [8] Efficient Multiclass Classification Using Feature Selection in High-Dimensional Datasets
    Kumar, Ankur
    Kaur, Avinash
    Singh, Parminder
    Driss, Maha
    Boulila, Wadii
    [J]. ELECTRONICS, 2023, 12 (10)
  • [9] Simultaneous Feature Selection and Classification for High-Dimensional Data
    Pai, Vriddhi
    Gupta, Subhash Chand
    [J]. PROCEEDINGS OF THE SECOND INTERNATIONAL CONFERENCE ON GREEN COMPUTING AND INTERNET OF THINGS (ICGCIOT 2018), 2018, : 153 - 158
  • [10] Binary dwarf mongoose optimizer for solving high-dimensional feature selection problems
    Akinola, Olatunji A.
    Agushaka, Jeffrey O.
    Ezugwu, Absalom E.
    [J]. PLOS ONE, 2022, 17 (10):