A hybrid feature selection algorithm combining information gain and grouping particle swarm optimization for cancer diagnosis

被引:0
|
作者
Yang, Fangyuan [1 ]
Xu, Zhaozhao [2 ]
Wang, Hong [1 ]
Sun, Lisha [1 ]
Zhai, Mengjiao [1 ]
Zhang, Juan [1 ]
机构
[1] Henan Polytech Univ, Affiliated Hosp 1, Dept Gynecol Oncol, Jiaozuo, Henan, Peoples R China
[2] Henan Polytech Univ, Sch Comp Sci & Technol, Jiaozuo, Henan, Peoples R China
来源
PLOS ONE | 2024年 / 19卷 / 03期
关键词
WRAPPER FEATURE-SELECTION; CLASSIFICATION; FILTER;
D O I
10.1371/journal.pone.0290332
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Background Cancer diagnosis based on machine learning has become a popular application direction. Support vector machine (SVM), as a classical machine learning algorithm, has been widely used in cancer diagnosis because of its advantages in high-dimensional and small sample data. However, due to the high-dimensional feature space and high feature redundancy of gene expression data, SVM faces the problem of poor classification effect when dealing with such data.Methods Based on this, this paper proposes a hybrid feature selection algorithm combining information gain and grouping particle swarm optimization (IG-GPSO). The algorithm firstly calculates the information gain values of the features and ranks them in descending order according to the value. Then, ranked features are grouped according to the information index, so that the features in the group are close, and the features outside the group are sparse. Finally, grouped features are searched using grouping PSO and evaluated according to in-group and out-group.Results Experimental results show that the average accuracy (ACC) of the SVM on the feature subset selected by the IG-GPSO is 98.50%, which is significantly better than the traditional feature selection algorithm. Compared with KNN, the classification effect of the feature subset selected by the IG-GPSO is still optimal. In addition, the results of multiple comparison tests show that the feature selection effect of the IG-GPSO is significantly better than that of traditional feature selection algorithms.Conclusion The feature subset selected by IG-GPSO not only has the best classification effect, but also has the least feature scale (FS). More importantly, the IG-GPSO significantly improves the ACC of SVM in cancer diagnostic.
引用
收藏
页数:17
相关论文
共 50 条
  • [41] A Survey on Particle Swarm Optimization in Feature Selection
    Kothari, Vipul
    Anuradha, J.
    Shah, Shreyak
    Mittal, Prerit
    GLOBAL TRENDS IN INFORMATION SYSTEMS AND SOFTWARE APPLICATIONS, PT 2, 2012, 270 : 192 - 201
  • [42] Gene selection using hybrid particle swarm optimization and genetic algorithm
    Shutao Li
    Xixian Wu
    Mingkui Tan
    Soft Computing, 2008, 12 : 1039 - 1048
  • [43] A Hybrid Particle Swarm Optimization Algorithm for Service Selection Problem in the Cloud
    Yang, Wanchun
    Zhang, Chenxi
    INTERNATIONAL JOURNAL OF GRID AND DISTRIBUTED COMPUTING, 2014, 7 (04): : 1 - 10
  • [44] Gene selection using hybrid particle swarm optimization and genetic algorithm
    Li, Shutao
    Wu, Xixian
    Tan, Mingkui
    SOFT COMPUTING, 2008, 12 (11) : 1039 - 1048
  • [45] A Hybrid Particle Swarm Optimization Algorithm
    Qi Changxing
    Bi Yiming
    Han Huihua
    Li Yong
    PROCEEDINGS OF 2017 3RD IEEE INTERNATIONAL CONFERENCE ON COMPUTER AND COMMUNICATIONS (ICCC), 2017, : 2187 - 2190
  • [46] On a hybrid particle swarm optimization algorithm
    Singh, Sharandeep
    Singh, Narinder
    Singh, S. B.
    INTERNATIONAL JOURNAL OF ADVANCED AND APPLIED SCIENCES, 2016, 3 (12): : 96 - 105
  • [47] Particle Swarm Optimization Feature Selection for Breast Cancer Recurrence Prediction
    Sakri, Sapiah Binti
    Rashid, Nuraini Binti Abdul
    Zain, Zuhaira Muhammad
    IEEE ACCESS, 2018, 6 : 29637 - 29647
  • [48] Hybrid Method of Information Gain and Particle Swarm Optimization for Selection of Features of SVM-Based Sentiment Analysis
    Kurniawati, Ika
    Pardede, Hilman F.
    2018 INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY SYSTEMS AND INNOVATION (ICITSI), 2018, : 1 - 5
  • [49] Hybrid feature selection and weighting method based on binary particle swarm optimization
    Severo, Diogo S.
    Verissimo, Everson
    Cavalcanti, George D. C.
    Ren, Tsang Ing
    2013 IEEE 25TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI), 2013, : 433 - 438
  • [50] Hybrid particle swarm optimization with spiral-shaped mechanism for feature selection
    Chen, Ke
    Zhou, Feng-Yu
    Yuan, Xian-Feng
    EXPERT SYSTEMS WITH APPLICATIONS, 2019, 128 : 140 - 156