Multiobjective optimization algorithm with dynamic operator selection for feature selection in high-dimensional classification

被引:8
|
作者
Wei, Wenhong [1 ]
Xuan, Manlin [2 ]
Li, Lingjie [2 ]
Lin, Qiuzhen [2 ]
Ming, Zhong [2 ]
Coello, Carlos A. Coello [3 ,4 ]
机构
[1] Dongguan Univ Technol, Sch Comp Sci & Technol, Dongguan, Peoples R China
[2] Shenzhen Univ, Coll Comp Sci & Software Engn, Shenzhen, Peoples R China
[3] CINVESTAV IPN, Dept Comp Sci, Mexico City 07360, Mexico
[4] Tecnol Monterrey, Sch Engn & Sci, Monterrey, Mexico
关键词
Feature selection; Multiobjective optimization; Evolutionary algorithm; High-dimensional classification; Resource allocation; BINARY DIFFERENTIAL EVOLUTION; GENETIC ALGORITHM; PSO;
D O I
10.1016/j.asoc.2023.110360
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Feature selection (FS) is an important technique in data preprocessing that aims to reduce the number of features for training while maintaining a high accuracy for classification. In recent studies, FS has been extended to optimize multiple objectives simultaneously in classification. To better solve this problem, this paper proposes a new multiobjective optimization algorithm with dynamic operator selection for feature selection in high-dimensional classification, called FS-DOS. First, two complementary search operators with different characteristics are designed, where the first operator is a quick search (QS) operator aiming to accelerate the convergence speed, and the other operator is a modified binary differential evolution (BDE) operator that can prevent the algorithm from falling into a local optimum. In addition, a dynamic selection strategy based on the idea of resource allocation is also designed to dynamically select the most suitable operator for each solution according to its corresponding performance improvement on aggregated objective values. The simulation results on fifteen different real-world high-dimensional FS datasets show that FS-DOS can obtain a feature subset with higher quality than several state-of-the-art FS algorithms. Importantly, in terms of error rate, FS DOS wins 55 out of 75 comparisons. In terms of dimensionality reduction, the number of features selected by FS-DOS is between one hundredth and one thousandth of the original dataset.& COPY; 2023 Elsevier B.V. All rights reserved.
引用
收藏
页数:15
相关论文
共 50 条
  • [41] Genetic programming for feature construction and selection in classification on high-dimensional data
    Binh Tran
    Bing Xue
    Mengjie Zhang
    [J]. Memetic Computing, 2016, 8 : 3 - 15
  • [42] Benchmark for filter methods for feature selection in high-dimensional classification data
    Bommert, Andrea
    Sun, Xudong
    Bischl, Bernd
    Rahnenfuehrer, Joerg
    Lang, Michel
    [J]. COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2020, 143
  • [43] An iterative SVM approach to feature selection and classification in high-dimensional datasets
    Liu, Dehua
    Qian, Hui
    Dai, Guang
    Zhang, Zhihua
    [J]. PATTERN RECOGNITION, 2013, 46 (09) : 2531 - 2537
  • [44] Stability of feature selection in classification issues for high-dimensional correlated data
    Perthame, Emeline
    Friguet, Chloe
    Causeur, David
    [J]. STATISTICS AND COMPUTING, 2016, 26 (04) : 783 - 796
  • [45] Genetic programming for feature construction and selection in classification on high-dimensional data
    Binh Tran
    Xue, Bing
    Zhang, Mengjie
    [J]. MEMETIC COMPUTING, 2016, 8 (01) : 3 - 15
  • [46] An Evolutionary Multitasking Algorithm With Multiple Filtering for High-Dimensional Feature Selection
    Li, Lingjie
    Xuan, Manlin
    Lin, Qiuzhen
    Jiang, Min
    Ming, Zhong
    Tan, Kay Chen
    [J]. IEEE TRANSACTIONS ON EVOLUTIONARY COMPUTATION, 2023, 27 (04) : 802 - 816
  • [47] Improving Evolutionary Algorithm Performance for Feature Selection in High-Dimensional Data
    Cilia, N.
    De Stefano, C.
    Fontanella, F.
    di Freca, A. Scotto
    [J]. APPLICATIONS OF EVOLUTIONARY COMPUTATION, EVOAPPLICATIONS 2018, 2018, 10784 : 439 - 454
  • [48] A two-stage clonal selection algorithm for local feature selection on high-dimensional data
    Wang, Yi
    Tian, Hao
    Li, Tao
    Liu, Xiaojie
    [J]. INFORMATION SCIENCES, 2024, 677
  • [49] FEATURE SELECTION FOR HIGH-DIMENSIONAL DATA ANALYSIS
    Verleysen, Michel
    [J]. NCTA 2011: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON NEURAL COMPUTATION THEORY AND APPLICATIONS, 2011, : IS23 - IS25
  • [50] Feature selection for high-dimensional data in astronomy
    Zheng, Hongwen
    Zhang, Yanxia
    [J]. ADVANCES IN SPACE RESEARCH, 2008, 41 (12) : 1960 - 1964