A differential evolution based feature combination selection algorithm for high-dimensional data

被引:37
|
作者
Guan, Boxin [1 ,2 ]
Zhao, Yuhai [1 ,2 ]
Yin, Ying [1 ,2 ]
Li, Yuan [3 ]
机构
[1] Northeastern Univ, Sch Comp Sci & Engn, Shenyang 110819, Peoples R China
[2] Northeastern Univ, Key Lab Med Image Comp, Minist Educ, Shenyang 110819, Peoples R China
[3] North China Univ Technol, Sch Informat Sci & Technol, Beijing 100144, Peoples R China
关键词
Binary space partitioning; Differential evolution; Feature combination; High-dimensional data; DETECTING EPISTATIC INTERACTIONS; GENOME-WIDE ASSOCIATION; OPTIMIZATION ALGORITHM;
D O I
10.1016/j.ins.2020.08.081
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Feature combination selection is used in object classification to select complementary features that can produce a powerful combination. One active area of selecting feature combinations is genome-wide association studies (GWAS). However, selecting feature combinations from high-dimensional GWAS data faces a serious issue of high computational complexity. In this paper, a fast evolutionary optimization method named search history-guided differential evolution (HGDE) is proposed to deal with the problem. This method applies the search history memorized in a binary space partitioning tree to enhance its power for selecting feature combinations. We perform a comparative study on the proposed HGDE algorithm and other state-of-the-art algorithms using synthetic datasets, and later employ the HGDE algorithm in experiments on a real age-related macular degeneration dataset. The experimental results show that this proposed algorithm has superior performance in the selection of feature combinations. Moreover, the results provide a reference for studying the functional mechanisms of age-related macular degeneration. (C) 2020 Elsevier Inc. All rights reserved.
引用
收藏
页码:870 / 886
页数:17
相关论文
共 50 条
  • [1] Differential Privacy High-Dimensional Data Publishing Based on Feature Selection and Clustering
    Chu, Zhiguang
    He, Jingsha
    Zhang, Xiaolei
    Zhang, Xing
    Zhu, Nafei
    [J]. ELECTRONICS, 2023, 12 (09)
  • [2] A density-based clustering algorithm for high-dimensional data with feature selection
    Qi Xianting
    Wang Pan
    [J]. 2016 2ND INTERNATIONAL CONFERENCE ON INDUSTRIAL INFORMATICS - COMPUTING TECHNOLOGY, INTELLIGENT TECHNOLOGY, INDUSTRIAL INFORMATION INTEGRATION (ICIICII), 2016, : 114 - 118
  • [3] Feature selection for high-dimensional data
    Destrero A.
    Mosci S.
    De Mol C.
    Verri A.
    Odone F.
    [J]. Computational Management Science, 2009, 6 (1) : 25 - 40
  • [4] Feature selection for high-dimensional data
    Bolón-Canedo V.
    Sánchez-Maroño N.
    Alonso-Betanzos A.
    [J]. Progress in Artificial Intelligence, 2016, 5 (2) : 65 - 75
  • [5] Feature selection algorithm based on optimized genetic algorithm and the application in high-dimensional data processing
    Feng, Guilian
    [J]. PLOS ONE, 2024, 19 (05):
  • [6] A Fast Clustering-Based Feature Subset Selection Algorithm for High-Dimensional Data
    Song, Qinbao
    Ni, Jingjie
    Wang, Guangtao
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2013, 25 (01) : 1 - 14
  • [7] Improved neighborhood space based feature selection algorithm for high-dimensional mixed data
    Zhang, Teng-Fei
    Zhang, Yu-Di
    Ma, Fu-Min
    [J]. Kongzhi yu Juece/Control and Decision, 2024, 39 (03): : 929 - 938
  • [8] Feature selection based on dynamic crow search algorithm for high-dimensional data classification
    Jiang, He
    Yang, Ye
    Wan, Qiuying
    Dong, Yao
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2024, 250
  • [9] Improving Evolutionary Algorithm Performance for Feature Selection in High-Dimensional Data
    Cilia, N.
    De Stefano, C.
    Fontanella, F.
    di Freca, A. Scotto
    [J]. APPLICATIONS OF EVOLUTIONARY COMPUTATION, EVOAPPLICATIONS 2018, 2018, 10784 : 439 - 454
  • [10] Feature selection based on geometric distance for high-dimensional data
    Lee, J. -H.
    Oh, S. -Y.
    [J]. ELECTRONICS LETTERS, 2016, 52 (06) : 473 - 474