Serial filter-wrapper feature selection method with elite guided mutation strategy on cancer gene expression data

被引:0
|
作者
Song, Yu-Wei [1 ]
Wang, Jie-Sheng [1 ]
Qi, Yu-Liang [1 ]
Wang, Yu-Cai [1 ]
Song, Hao-Ming [1 ]
Shang-Guan, Yi-Peng [1 ]
机构
[1] School of Electronic and Information Engineering, University of Science and Technology Liaoning, Liaoning, Anshan, China
关键词
Feature selection; Cancer gene expression; Equilibrium optimizer; Parallel filter methods; Elite guided mutation strategies; Serial hybrid frameworks;
D O I
10.1007/s10462-024-11029-1
中图分类号
学科分类号
摘要
Nowadays, many researchers utilize cancer gene expression data to solve the problem of cancer subtype diagnosis, but cancer gene expression data are often high-dimensional, multi-sample, and multi-classified, so a hybrid serial filter-wrapper feature selection (FS) method based on elite guided mutation strategy for cancer gene expression data is proposed. It is divided into a preliminary screening phase and a combined modeling phase. In the preliminary screening stage, the threshold values of seven filter methods are determined by the leave-one cross-validation method, and the features selected by these seven filter methods are combined to form two subsets by using the thoughts of ‘‘And’’ and ‘‘Or’’ in the logical operation. The union subset of two subsets is used in the equilibrium optimizer (EO) in the subsequent combination model stage as the reserved subset in the preliminary screening stage. The resulting hybrid framework is connected by a parallel filter method designed in the first stage with an improved EO in the second stage, which is named as SFEMEO. In order to prove the effectiveness and generalization of the proposed SFEMEO, it is compared with other 9 basic algorithms on 10 UCI data sets. It is found that the classification accuracy of the SFEMEO is improved by 0.56% ~ 20.19%, and the optimal fitness is also greatly improved. After comparing SFEMEO with other nine intelligent optimization algorithms on ten cancer gene expression data sets, it can be found that compared with most algorithms, the accuracy rate is improved by 3.73% ~ 18.13%, and the optimal fitness is relatively superior. At the same time, Wilcoxon rank sum test was used to evaluate the results of intelligent optimization algorithms such as SFEMEO, which proved the effectiveness of the proposed hybrid framework and its superiority in solving the FS problem of high-dimensional cancer gene expression data. © The Author(s) 2025.
引用
收藏
相关论文
共 50 条
  • [41] A robust fuzzy rule based integrative feature selection strategy for gene expression data in TCGA
    Fan, Shicai
    Tang, Jianxiong
    Tian, Qi
    Wu, Chunguo
    BMC MEDICAL GENOMICS, 2019, 12 (Suppl 1)
  • [42] A robust fuzzy rule based integrative feature selection strategy for gene expression data in TCGA
    Shicai Fan
    Jianxiong Tang
    Qi Tian
    Chunguo Wu
    BMC Medical Genomics, 12
  • [43] Distributed feature selection (DFS) strategy for microarray gene expression data to improve the classification performance
    Potharaju, Sai Prasad
    Sreedevi, M.
    CLINICAL EPIDEMIOLOGY AND GLOBAL HEALTH, 2019, 7 (02): : 171 - 176
  • [44] A novel bio-inspired hybrid multi-filter wrapper gene selection method with ensemble classifier for microarray data
    Babak Nouri-Moghaddam
    Mehdi Ghazanfari
    Mohammad Fathian
    Neural Computing and Applications, 2023, 35 : 11531 - 11561
  • [45] A novel bio-inspired hybrid multi-filter wrapper gene selection method with ensemble classifier for microarray data
    Nouri-Moghaddam, Babak
    Ghazanfari, Mehdi
    Fathian, Mohammad
    NEURAL COMPUTING & APPLICATIONS, 2023, 35 (16): : 11531 - 11561
  • [46] An improved conditional relevance and weighted redundancy feature selection method for gene expression data
    Qin, Xiwen
    Zhang, Siqi
    Dong, Xiaogang
    Luo, Tingru
    Shi, Hongyu
    Yuan, Liping
    Journal of Supercomputing, 2025, 81 (01):
  • [47] A combinational feature selection and ensemble neural network method for classification of gene expression data
    Bing Liu
    Qinghua Cui
    Tianzi Jiang
    Songde Ma
    BMC Bioinformatics, 5
  • [48] A combinational feature selection and ensemble neural network method for classification of gene expression data
    Liu, B
    Cui, QH
    Jiang, TZ
    Ma, SD
    BMC BIOINFORMATICS, 2004, 5 (1)
  • [49] Feature Selection for Microarray Gene Expression Data Using Simulated Annealing Guided by the Multivariate Joint Entropy
    Fernando Gonzalez-Navarro, Felix
    Belanche-Munoz, Lluis A.
    COMPUTACION Y SISTEMAS, 2014, 18 (02): : 275 - 293
  • [50] A Survey on Hybrid Feature Selection Methods in Microarray Gene Expression Data for Cancer Classification
    Almugren, Nada
    Alshamlan, Hala
    IEEE ACCESS, 2019, 7 : 78533 - 78548