Serial filter-wrapper feature selection method with elite guided mutation strategy on cancer gene expression data

被引:0
|
作者
Song, Yu-Wei [1 ]
Wang, Jie-Sheng [1 ]
Qi, Yu-Liang [1 ]
Wang, Yu-Cai [1 ]
Song, Hao-Ming [1 ]
Shang-Guan, Yi-Peng [1 ]
机构
[1] School of Electronic and Information Engineering, University of Science and Technology Liaoning, Liaoning, Anshan, China
关键词
Feature selection; Cancer gene expression; Equilibrium optimizer; Parallel filter methods; Elite guided mutation strategies; Serial hybrid frameworks;
D O I
10.1007/s10462-024-11029-1
中图分类号
学科分类号
摘要
Nowadays, many researchers utilize cancer gene expression data to solve the problem of cancer subtype diagnosis, but cancer gene expression data are often high-dimensional, multi-sample, and multi-classified, so a hybrid serial filter-wrapper feature selection (FS) method based on elite guided mutation strategy for cancer gene expression data is proposed. It is divided into a preliminary screening phase and a combined modeling phase. In the preliminary screening stage, the threshold values of seven filter methods are determined by the leave-one cross-validation method, and the features selected by these seven filter methods are combined to form two subsets by using the thoughts of ‘‘And’’ and ‘‘Or’’ in the logical operation. The union subset of two subsets is used in the equilibrium optimizer (EO) in the subsequent combination model stage as the reserved subset in the preliminary screening stage. The resulting hybrid framework is connected by a parallel filter method designed in the first stage with an improved EO in the second stage, which is named as SFEMEO. In order to prove the effectiveness and generalization of the proposed SFEMEO, it is compared with other 9 basic algorithms on 10 UCI data sets. It is found that the classification accuracy of the SFEMEO is improved by 0.56% ~ 20.19%, and the optimal fitness is also greatly improved. After comparing SFEMEO with other nine intelligent optimization algorithms on ten cancer gene expression data sets, it can be found that compared with most algorithms, the accuracy rate is improved by 3.73% ~ 18.13%, and the optimal fitness is relatively superior. At the same time, Wilcoxon rank sum test was used to evaluate the results of intelligent optimization algorithms such as SFEMEO, which proved the effectiveness of the proposed hybrid framework and its superiority in solving the FS problem of high-dimensional cancer gene expression data. © The Author(s) 2025.
引用
收藏
相关论文
共 50 条
  • [1] A hybrid filter-wrapper gene selection method for cancer classification
    Alomari, Osama Ahmad
    Khader, Ahamad Tajudin
    Al-Betar, Mohammed Azmi
    Alyasseri, Zaid Abdi Alkareem
    2018 2ND INTERNATIONAL CONFERENCE ON BIOSIGNAL ANALYSIS, PROCESSING AND SYSTEMS (ICBAPS 2018), 2018, : 113 - 118
  • [2] A Novel Filter-Wrapper Based Feature Selection Approach for Cancer Data Classification
    Mufassirin, M. M. Mohamed
    Ragel, Roshan G.
    2018 IEEE 9TH INTERNATIONAL CONFERENCE ON INFORMATION AND AUTOMATION FOR SUSTAINABILITY (ICIAFS' 2018), 2018,
  • [3] Hybrid Filter-Wrapper Feature Selection Method for Sentiment Classification
    Ansari, Gunjan
    Ahmad, Tanvir
    Doja, Mohammad Najmud
    ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 2019, 44 (11) : 9191 - 9208
  • [4] Feature subset selection Filter-Wrapper based on low quality data
    Cadenas, Jose M.
    Carmen Garrido, M.
    Martinez, Raquel
    EXPERT SYSTEMS WITH APPLICATIONS, 2013, 40 (16) : 6241 - 6252
  • [5] A hybrid filter-wrapper feature selection method for DDoS detection in cloud computing
    Belouch, Mustapha
    Elhadaj, Salah
    Idhammad, Mohamed
    INTELLIGENT DATA ANALYSIS, 2018, 22 (06) : 1209 - 1226
  • [6] A Hybrid Filter/Wrapper Approach of Feature Selection for Gene Expression Data
    Ke, Chao-Hsuan
    Yang, Cheng-Hong
    Chuang, Li-Yeh
    Yang, Cheng-San
    2008 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC), VOLS 1-6, 2008, : 2663 - +
  • [7] A new hybrid filter-wrapper feature selection method for clustering based on ranking
    Solorio-Fernandez, Saul
    Ariel Carrasco-Ochoa, J.
    Fco. Martinez-Trinidad, Jose
    NEUROCOMPUTING, 2016, 214 : 866 - 880
  • [8] Application of a GA/Bayesian filter-wrapper feature selection method to classification of clinical depression from speech data
    Torres, Juan
    Saad, Ashraf
    Moore, Elliot
    SOFT COMPUTING IN INDUSTRIAL APPLICATIONS: RECENT AND EMERGING METHODS AND TECHNIQUES, 2007, 39 : 115 - +
  • [9] Improved swarm-optimization-based filter-wrapper gene selection from microarray data for gene expression tumor classification
    Lin Ke
    Min Li
    Lei Wang
    Shaobo Deng
    Jun Ye
    Xiang Yu
    Pattern Analysis and Applications, 2023, 26 : 455 - 472
  • [10] Improved swarm-optimization-based filter-wrapper gene selection from microarray data for gene expression tumor classification
    Ke, Lin
    Li, Min
    Wang, Lei
    Deng, Shaobo
    Ye, Jun
    Yu, Xiang
    PATTERN ANALYSIS AND APPLICATIONS, 2023, 26 (02) : 455 - 472