A hybrid feature selection algorithm combining information gain and grouping particle swarm optimization for cancer diagnosis

被引:0
|
作者
Yang, Fangyuan [1 ]
Xu, Zhaozhao [2 ]
Wang, Hong [1 ]
Sun, Lisha [1 ]
Zhai, Mengjiao [1 ]
Zhang, Juan [1 ]
机构
[1] Henan Polytech Univ, Affiliated Hosp 1, Dept Gynecol Oncol, Jiaozuo, Henan, Peoples R China
[2] Henan Polytech Univ, Sch Comp Sci & Technol, Jiaozuo, Henan, Peoples R China
来源
PLOS ONE | 2024年 / 19卷 / 03期
关键词
WRAPPER FEATURE-SELECTION; CLASSIFICATION; FILTER;
D O I
10.1371/journal.pone.0290332
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Background Cancer diagnosis based on machine learning has become a popular application direction. Support vector machine (SVM), as a classical machine learning algorithm, has been widely used in cancer diagnosis because of its advantages in high-dimensional and small sample data. However, due to the high-dimensional feature space and high feature redundancy of gene expression data, SVM faces the problem of poor classification effect when dealing with such data.Methods Based on this, this paper proposes a hybrid feature selection algorithm combining information gain and grouping particle swarm optimization (IG-GPSO). The algorithm firstly calculates the information gain values of the features and ranks them in descending order according to the value. Then, ranked features are grouped according to the information index, so that the features in the group are close, and the features outside the group are sparse. Finally, grouped features are searched using grouping PSO and evaluated according to in-group and out-group.Results Experimental results show that the average accuracy (ACC) of the SVM on the feature subset selected by the IG-GPSO is 98.50%, which is significantly better than the traditional feature selection algorithm. Compared with KNN, the classification effect of the feature subset selected by the IG-GPSO is still optimal. In addition, the results of multiple comparison tests show that the feature selection effect of the IG-GPSO is significantly better than that of traditional feature selection algorithms.Conclusion The feature subset selected by IG-GPSO not only has the best classification effect, but also has the least feature scale (FS). More importantly, the IG-GPSO significantly improves the ACC of SVM in cancer diagnostic.
引用
收藏
页数:17
相关论文
共 50 条
  • [31] An Entropy Driven Multiobjective Particle Swarm Optimization Algorithm for Feature Selection
    Luo, Juanjuan
    Zhou, Dongqing
    Jiang, Lingling
    Ma, Huadong
    2021 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC 2021), 2021, : 768 - 775
  • [32] Feature Selection Based on Hybridization of Genetic Algorithm and Particle Swarm Optimization
    Ghamisi, Pedram
    Benediktsson, Jon Atli
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2015, 12 (02) : 309 - 313
  • [33] Feature selection based on a hybrid simplified particle swarm optimization algorithm with maximum separation and minimum redundancy
    Sun, Liqin
    Yang, Youlong
    Liu, Yuanyuan
    Ning, Tong
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2023, 14 (03) : 789 - 816
  • [34] Feature selection based on a hybrid simplified particle swarm optimization algorithm with maximum separation and minimum redundancy
    Liqin Sun
    Youlong Yang
    Yuanyuan Liu
    Tong Ning
    International Journal of Machine Learning and Cybernetics, 2023, 14 : 789 - 816
  • [35] An improved hybrid chameleon swarm algorithm for feature selection in medical diagnosis
    Braik, Malik Shehadeh
    Hammouri, Abdelaziz I.
    Awadallah, Mohammed A.
    Al-Betar, Mohammed Azmi
    Khtatneh, Khalaf
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2023, 85
  • [36] An Improved Particle Swarm Optimization for Feature Selection
    Liu, Yuanning
    Wang, Gang
    Chen, Huiling
    Dong, Hao
    Zhu, Xiaodong
    Wang, Sujing
    JOURNAL OF BIONIC ENGINEERING, 2011, 8 (02) : 191 - 200
  • [37] An improved particle swarm optimization for feature selection
    Yuanning Liu
    Gang Wang
    Huiling Chen
    Hao Dong
    Xiaodong Zhu
    Sujing Wang
    Journal of Bionic Engineering, 2011, 8 : 191 - 200
  • [38] A Hybrid Feature Selection Method Based on Genetic Algorithm and Information Gain
    He, Fei
    Yang, Huamin
    Miao, Yu
    Louis, Rainbow
    PROCEEDINGS OF 2016 5TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND NETWORK TECHNOLOGY (ICCSNT), 2016, : 320 - 323
  • [39] An improved particle swarm optimization for feature selection
    Chen, Li-Fei
    Su, Chao-Ton
    Chen, Kun-Huang
    INTELLIGENT DATA ANALYSIS, 2012, 16 (02) : 167 - 182
  • [40] Multimodal particle swarm optimization for feature selection
    Hu, Xiao-Min
    Zhang, Shou-Rong
    Li, Min
    Deng, Jeremiah D.
    APPLIED SOFT COMPUTING, 2021, 113