SMEM: A Subspace Merging Based Evolutionary Method for High-Dimensional Feature Selection

被引:0
|
作者
Li, Kaixuan [1 ]
Jiang, Shibo [2 ]
Zhang, Rui [3 ]
Qiu, Jianfeng [2 ]
Zhang, Lei [3 ]
Yang, Lixia [4 ]
Cheng, Fan [2 ]
机构
[1] Anhui Univ, Informat Mat & Intelligent Sensing Lab Anhui Prov, Hefei 230601, Peoples R China
[2] Anhui Univ, Inst Informat Mat & Intelligent Sensing Lab Anhui, Sch Artificial Intelligence, Hefei 230601, Peoples R China
[3] Anhui Univ, Sch Comp Sci & Technol, Hefei 230601, Peoples R China
[4] Anhui Univ, Inst Informat Mat & Intelligent Sensing Lab Anhui, Sch Elect & Informat Engn, Hefei 230601, Peoples R China
基金
中国国家自然科学基金;
关键词
Feature extraction; Search problems; Merging; Optimization; Sorting; History; Fans; High-dimensional feature selection; multi-objective evolutionary optimization; subspace division; pairwise subspace merging; BINARY DIFFERENTIAL EVOLUTION; PARTICLE SWARM OPTIMIZATION; ARTIFICIAL BEE COLONY; GENETIC ALGORITHM; CLASSIFICATION;
D O I
10.1109/TETCI.2024.3451695
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In the past decade, evolutionary algorithms (EAs) have shown their promising performance in solving the problem of feature selection. Despite that, it is still quite challenging to design the EAs for high-dimensional feature selection (HDFS), since the increasing number of features causes the search space of EAs grows exponentially, which is known as the "curse of dimensionality". To tackle the issue, in this paper, a Subspace Merging based Evolutionary Method, termed SMEM is suggested. In SMEM, to avoid directly optimizing the large search space of HDFS, the original feature space of HDFS is firstly divided into several independent low-dimensional subspaces. In each subspace, a subpopulation is evolved to obtain the latent good feature subsets quickly. Then, to avoid some features being missed, these low-dimensional subspaces merge in pairs, and the further search is carried on the merged subspaces. During the evolving of each merged subspace, the good feature subsets obtained from previous subspace pair are fully utilized. The above subspace merging procedure repeats, and the performance of SMEM is improved gradually, until in the end, all the subspaces are merged into one final space. At that time, the final space is also the original feature space in HDFS, which ensures all the features in the data is considered. Experimental results on different high-dimensional datasets demonstrate the effectiveness and the efficiency of the proposed SMEM, when compared with the state-of-the-arts.
引用
下载
收藏
页数:16
相关论文
共 50 条
  • [41] Feature selection for high-dimensional temporal data
    Tsagris, Michail
    Lagani, Vincenzo
    Tsamardinos, Ioannis
    BMC BIOINFORMATICS, 2018, 19
  • [42] Feature Selection with High-Dimensional Imbalanced Data
    Van Hulse, Jason
    Khoshgoftaar, Taghi M.
    Napolitano, Amri
    Wald, Randall
    2009 IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS (ICDMW 2009), 2009, : 507 - 514
  • [43] Feature selection for high-dimensional temporal data
    Michail Tsagris
    Vincenzo Lagani
    Ioannis Tsamardinos
    BMC Bioinformatics, 19
  • [44] High-dimensional feature selection for genomic datasets
    Afshar, Majid
    Usefi, Hamid
    KNOWLEDGE-BASED SYSTEMS, 2020, 206
  • [45] FEATURE SELECTION FOR HIGH-DIMENSIONAL DATA ANALYSIS
    Verleysen, Michel
    ECTA 2011/FCTA 2011: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON EVOLUTIONARY COMPUTATION THEORY AND APPLICATIONS AND INTERNATIONAL CONFERENCE ON FUZZY COMPUTATION THEORY AND APPLICATIONS, 2011,
  • [46] A Parallel Coordinates Plot Method Based on Unsupervised Feature Selection for High-Dimensional Data Visualization
    Lou, Jiaqi
    Dong, Ke
    Wang, Maosen
    IWCMC 2021: 2021 17TH INTERNATIONAL WIRELESS COMMUNICATIONS & MOBILE COMPUTING CONFERENCE (IWCMC), 2021, : 532 - 536
  • [47] A Group Feature Ranking and Selection Method Based on Dimension Reduction Technique in High-Dimensional Data
    Zubair, Iqbal Muhammad
    Kim, Byunghoon
    IEEE ACCESS, 2022, 10 : 125136 - 125147
  • [48] Set-based integer-coded fuzzy granular evolutionary algorithms for high-dimensional feature selection
    Saadatmand, H.
    Akbarzadeh-T, M. -R
    APPLIED SOFT COMPUTING, 2023, 142
  • [49] Many-Objective Jaccard-based Evolutionary Feature Selection for High-Dimensional Imbalanced Data Classification
    Saadatmand H.
    Akbarzadeh-T M.
    IEEE Transactions on Pattern Analysis and Machine Intelligence, 2024, 46 (12) : 1 - 16
  • [50] A PSO Based Hybrid Feature Selection Algorithm for High-Dimensional Classification
    Binh Tran
    Zhang, Mengjie
    Xue, Bing
    2016 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2016, : 3801 - 3808