Serial filter-wrapper feature selection method with elite guided mutation strategy on cancer gene expression data

被引:0
|
作者
Song, Yu-Wei [1 ]
Wang, Jie-Sheng [1 ]
Qi, Yu-Liang [1 ]
Wang, Yu-Cai [1 ]
Song, Hao-Ming [1 ]
Shang-Guan, Yi-Peng [1 ]
机构
[1] School of Electronic and Information Engineering, University of Science and Technology Liaoning, Liaoning, Anshan, China
关键词
Feature selection; Cancer gene expression; Equilibrium optimizer; Parallel filter methods; Elite guided mutation strategies; Serial hybrid frameworks;
D O I
10.1007/s10462-024-11029-1
中图分类号
学科分类号
摘要
Nowadays, many researchers utilize cancer gene expression data to solve the problem of cancer subtype diagnosis, but cancer gene expression data are often high-dimensional, multi-sample, and multi-classified, so a hybrid serial filter-wrapper feature selection (FS) method based on elite guided mutation strategy for cancer gene expression data is proposed. It is divided into a preliminary screening phase and a combined modeling phase. In the preliminary screening stage, the threshold values of seven filter methods are determined by the leave-one cross-validation method, and the features selected by these seven filter methods are combined to form two subsets by using the thoughts of ‘‘And’’ and ‘‘Or’’ in the logical operation. The union subset of two subsets is used in the equilibrium optimizer (EO) in the subsequent combination model stage as the reserved subset in the preliminary screening stage. The resulting hybrid framework is connected by a parallel filter method designed in the first stage with an improved EO in the second stage, which is named as SFEMEO. In order to prove the effectiveness and generalization of the proposed SFEMEO, it is compared with other 9 basic algorithms on 10 UCI data sets. It is found that the classification accuracy of the SFEMEO is improved by 0.56% ~ 20.19%, and the optimal fitness is also greatly improved. After comparing SFEMEO with other nine intelligent optimization algorithms on ten cancer gene expression data sets, it can be found that compared with most algorithms, the accuracy rate is improved by 3.73% ~ 18.13%, and the optimal fitness is relatively superior. At the same time, Wilcoxon rank sum test was used to evaluate the results of intelligent optimization algorithms such as SFEMEO, which proved the effectiveness of the proposed hybrid framework and its superiority in solving the FS problem of high-dimensional cancer gene expression data. © The Author(s) 2025.
引用
收藏
相关论文
共 50 条
  • [21] RETRACTED ARTICLE: A wrapper based feature selection in bone marrow plasma cell gene expression data
    T. Ragunthar
    S. Selvakumar
    Cluster Computing, 2019, 22 : 13785 - 13796
  • [22] Retraction Note: A wrapper based feature selection in bone marrow plasma cell gene expression data
    T. Ragunthar
    S. Selvakumar
    Cluster Computing, 2023, 26 : 139 - 139
  • [23] Feature Selection of Gene Expression Data for Cancer Classification: A Review
    Singh, Rabindra Kumar
    Sivabalakrishnan, M.
    BIG DATA, CLOUD AND COMPUTING CHALLENGES, 2015, 50 : 52 - 57
  • [24] ANFIS-Based Wrapper Model Gene Selection for Cancer Classification on Microarray Gene Expression Data
    Mahmoudi, Sina
    Lahijan, Biyuk Sadeghi
    Kanan, Hamidreza Rashidy
    2013 13TH IRANIAN CONFERENCE ON FUZZY SYSTEMS (IFSC), 2013,
  • [25] Feature Selection for Breast Cancer Classification by Integrating Somatic Mutation and Gene Expression
    Jiang, Qin
    Jin, Min
    FRONTIERS IN GENETICS, 2021, 12
  • [26] A novel filter feature selection method for paired microarray expression data analysis
    Cao, Zhongbo
    Wang, Yan
    Sun, Ying
    Du, Wei
    Liang, Yanchun
    INTERNATIONAL JOURNAL OF DATA MINING AND BIOINFORMATICS, 2015, 12 (04) : 363 - 386
  • [27] A Filter Feature Selection Method Based LLRFC and Redundancy Analysis for Tumor Classification Using Gene Expression Data
    Li, Jiangeng
    Li, Xiaodan
    Zhang, Wei
    PROCEEDINGS OF THE 2016 12TH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION (WCICA), 2016, : 2861 - 2867
  • [28] RETRACTED: A wrapper based feature selection in bone marrow plasma cell gene expression data (Retracted Article)
    Ragunthar, T.
    Selvakumar, S.
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2019, 22 (Suppl 6): : 13785 - 13796
  • [29] Null space based feature selection method for gene expression data
    Alok Sharma
    Seiya Imoto
    Satoru Miyano
    Vandana Sharma
    International Journal of Machine Learning and Cybernetics, 2012, 3 : 269 - 276
  • [30] A Two-Stage Feature Selection Method for Gene Expression Data
    Chuang, Li-Yeh
    Ke, Chao-Hsuan
    Chang, Hsueh-Wei
    Yang, Cheng-Hong
    OMICS-A JOURNAL OF INTEGRATIVE BIOLOGY, 2009, 13 (02) : 127 - 137