A hybrid feature selection algorithm combining information gain and grouping particle swarm optimization for cancer diagnosis

被引:0
|
作者
Yang, Fangyuan [1 ]
Xu, Zhaozhao [2 ]
Wang, Hong [1 ]
Sun, Lisha [1 ]
Zhai, Mengjiao [1 ]
Zhang, Juan [1 ]
机构
[1] Henan Polytech Univ, Affiliated Hosp 1, Dept Gynecol Oncol, Jiaozuo, Henan, Peoples R China
[2] Henan Polytech Univ, Sch Comp Sci & Technol, Jiaozuo, Henan, Peoples R China
来源
PLOS ONE | 2024年 / 19卷 / 03期
关键词
WRAPPER FEATURE-SELECTION; CLASSIFICATION; FILTER;
D O I
10.1371/journal.pone.0290332
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Background Cancer diagnosis based on machine learning has become a popular application direction. Support vector machine (SVM), as a classical machine learning algorithm, has been widely used in cancer diagnosis because of its advantages in high-dimensional and small sample data. However, due to the high-dimensional feature space and high feature redundancy of gene expression data, SVM faces the problem of poor classification effect when dealing with such data.Methods Based on this, this paper proposes a hybrid feature selection algorithm combining information gain and grouping particle swarm optimization (IG-GPSO). The algorithm firstly calculates the information gain values of the features and ranks them in descending order according to the value. Then, ranked features are grouped according to the information index, so that the features in the group are close, and the features outside the group are sparse. Finally, grouped features are searched using grouping PSO and evaluated according to in-group and out-group.Results Experimental results show that the average accuracy (ACC) of the SVM on the feature subset selected by the IG-GPSO is 98.50%, which is significantly better than the traditional feature selection algorithm. Compared with KNN, the classification effect of the feature subset selected by the IG-GPSO is still optimal. In addition, the results of multiple comparison tests show that the feature selection effect of the IG-GPSO is significantly better than that of traditional feature selection algorithms.Conclusion The feature subset selected by IG-GPSO not only has the best classification effect, but also has the least feature scale (FS). More importantly, the IG-GPSO significantly improves the ACC of SVM in cancer diagnostic.
引用
收藏
页数:17
相关论文
共 50 条
  • [1] Information gain ratio-based subfeature grouping empowers particle swarm optimization for feature selection
    Gao, Jinrui
    Wang, Ziqian
    Jin, Ting
    Cheng, Jiujun
    Lei, Zhenyu
    Gao, Shangce
    KNOWLEDGE-BASED SYSTEMS, 2024, 286
  • [2] Hybrid particle swarm optimization algorithm for fault feature selection
    Taiyuan University of Technology, Taiyuan 030024, China
    不详
    Xitong Fangzhen Xuebao / Journal of System Simulation, 2008, 20 (15): : 4041 - 4044
  • [3] Hybrid Feature Selection Algorithm Combining Information Gain Ratio and Genetic Algorithm
    Xu Z.-Z.
    Shen D.-R.
    Nie T.-Z.
    Kou Y.
    Ruan Jian Xue Bao/Journal of Software, 2022, 33 (03): : 1128 - 1140
  • [4] An hybrid particle swarm optimization with crow search algorithm for feature selection
    Adamu, Abdulhameed
    Abdullahi, Mohammed
    Junaidu, Sahalu Balarabe
    Hassan, Ibrahim Hayatu
    MACHINE LEARNING WITH APPLICATIONS, 2021, 6
  • [5] Hybrid particle swarm optimization algorithm for text feature selection problems
    Nachaoui, Mourad
    Lakouam, Issam
    Hafidi, Imad
    NEURAL COMPUTING & APPLICATIONS, 2024, 36 (13): : 7471 - 7489
  • [6] Hybrid particle swarm optimization algorithm for text feature selection problems
    Mourad Nachaoui
    Issam Lakouam
    Imad Hafidi
    Neural Computing and Applications, 2024, 36 : 7471 - 7489
  • [7] An oscillatory particle swarm optimization feature selection algorithm for hybrid data based on mutual information entropy
    He, Jiali
    Qu, Liangdong
    Wang, Pei
    Li, Zhaowen
    APPLIED SOFT COMPUTING, 2024, 152
  • [8] Improving Breast Cancer Diagnosis Accuracy by Particle Swarm Optimization Feature Selection
    Kazerani, Reihane
    INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE SYSTEMS, 2024, 17 (01)
  • [9] Improving Breast Cancer Diagnosis Accuracy by Particle Swarm Optimization Feature Selection
    Reihane Kazerani
    International Journal of Computational Intelligence Systems, 17
  • [10] Hybrid distributed feature selection using particle swarm optimization-mutual information
    Robindro K.
    Devi S.S.
    Clinton U.B.
    Takhellambam L.
    Singh Y.R.
    Hoque N.
    Data Sci. Manag., 1 (64-73): : 64 - 73