Multi-strategy fusion binary SHO guided by Pearson correlation coefficient for feature selection with cancer gene expression data

被引:0
|
作者
Wang, Yu-Cai [1 ]
Song, Hao-Ming [1 ]
Wang, Jie-Sheng [1 ]
Ma, Xin-Ru [1 ]
Song, Yu-Wei [1 ]
Qi, Yu-Liang [1 ]
机构
[1] Univ Sci & Technol Liaoning, Sch Elect & Informat Engn, Anshan, Liaoning, Peoples R China
关键词
Feature selection; Cancer gene expression; Sea-horse optimizer; Multi-strategy fusion; Pearson correlation coefficient; WRAPPER; ALGORITHMS;
D O I
10.1016/j.eij.2025.100639
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Cancer gene expression data is extensively utilized to address the challenges of cancer subtype diagnosis. However, this data is often characterized by high-dimensional, multi-text and multi-classification, which requires an effective feature selection (FS) method. A multi-strategy fusion binary sea-horse optimizer guided by Pearson correlation coefficient was proposed for FS with cancer gene expression data. For the multi-strategy fusion, the rest strategy is introduced in the sea-horse motor behavior stage. Subsequently, a search strategy based on symbiotic organisms of sea horses is designed for the predation stage. Finally, the elementary function dynamic weight strategy is proposed. Multi-strategy fusion enables the sea-horse optimizer (SHO) to perform dynamic exploitation and exploration in the early stage of iteration, expand the search scope initially, and narrow the search scope in the middle and later stages of the algorithm, so as to avoid the algorithm falling into the local optimal and increase the possibility of the algorithm jumping out of the local optimal, and avoid the blind search caused by elite influence. In the FS part, Pearson correlation coefficient guided strategy is proposed firstly to add or delete features. Then eight binary algorithms are derived from S-type and V-type transfer functions. The simulation experiment was divided into four parts. Firstly, the CEC-2022 test functions were used to test the performance of the multi-strategy fusion SHO, from which the best variant TanASSHO was selected, and then compared with other nine swarm intelligent optimization algorithms. Performance tests of various algorithm variants on 18 UCI datasets show that V1PTASSHO is the most effective binary version. Finally, V1PTASSHO was compared with other nine swarm intelligent optimization algorithms on 18 cancer gene expression datasets. The results demonstrate that V1PTASSHO effectively reduces feature subsets, improve classification accuracy and obtain lower fitness value. Friedman test and Wilcoxon rank sum test were used for statistical analysis to verify the effectiveness of the proposed algorithm.
引用
收藏
页数:34
相关论文
共 50 条
  • [1] GOG-MBSHO: multi-strategy fusion binary sea-horse optimizer with Gaussian transfer function for feature selection of cancer gene expression data
    Wang, Yu-Cai
    Song, Hao-Ming
    Wang, Jie-Sheng
    Song, Yu-Wei
    Qi, Yu-Liang
    Ma, Xin-Ru
    ARTIFICIAL INTELLIGENCE REVIEW, 2024, 57 (12)
  • [2] Multi-strategy ensemble binary hunger games search for feature selection
    Ma, Benedict Jun
    Liu, Shuai
    Heidari, Ali Asghar
    KNOWLEDGE-BASED SYSTEMS, 2022, 248
  • [3] Multi-Strategy Improved Binary Secretarial Bird Optimization Algorithm for Feature Selection
    Chen, Fuqiang
    Ye, Shitong
    Wang, Jianfeng
    Luo, Jia
    MATHEMATICS, 2025, 13 (04)
  • [4] Multi-strategy fusion novel binary equalization optimizer with dynamic transfer function for high-dimensional feature selection
    Song, Hao-Ming
    Wang, Jie-Sheng
    Hou, Jia-Ning
    Wang, Yu-Cai
    Song, Yu-Wei
    Qi, Yu-Liang
    JOURNAL OF SUPERCOMPUTING, 2025, 81 (04):
  • [5] Serial filter-wrapper feature selection method with elite guided mutation strategy on cancer gene expression data
    Song, Yu-Wei
    Wang, Jie-Sheng
    Qi, Yu-Liang
    Wang, Yu-Cai
    Song, Hao-Ming
    Shang-Guan, Yi-Peng
    ARTIFICIAL INTELLIGENCE REVIEW, 2025, 58 (04)
  • [6] Multi-Objective Semi-Supervised Feature Selection and Model Selection Based on Pearson's Correlation Coefficient
    Coelho, Frederico
    Braga, Antonio Padua
    Verleysen, Michel
    PROGRESS IN PATTERN RECOGNITION, IMAGE ANALYSIS, COMPUTER VISION, AND APPLICATIONS, 2010, 6419 : 509 - +
  • [7] Feature Selection and Classification in gene expression cancer data
    Pavithra, D.
    Lakshmanan, B.
    2017 INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE IN DATA SCIENCE (ICCIDS), 2017,
  • [8] Improved binary PSO for feature selection using gene expression data
    Chuang, Li-Yeh
    Chang, Hsueh-Wei
    Tu, Chung-Jui
    Yang, Cheng-Hong
    COMPUTATIONAL BIOLOGY AND CHEMISTRY, 2008, 32 (01) : 29 - 38
  • [9] Binary Political Optimizer for Feature Selection Using Gene Expression Data
    Manita, Ghaith
    Korbaa, Ouajdi
    COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2020, 2020
  • [10] Parrot optimization algorithm for improved multi-strategy fusion for feature optimization of data in medical and industrial field
    Huang, Gaoxia
    Wei, Jianan
    Yuan, Yage
    Huang, Haisong
    Chen, Hualin
    SWARM AND EVOLUTIONARY COMPUTATION, 2025, 95