A Adaptive Cooperative Coevolutionary Algorithm for Parallel Feature Selection in High-Dimensional Datasets

被引:1
|
作者
Firouznia, Marjan [1 ]
Trunfio, Giuseppe A. [2 ]
机构
[1] Amirkabir Univ Technol, Elect Engn Dept, Tehran, Iran
[2] Univ Sassari, Dept Biomed Sci, Sassari, Italy
关键词
Feature selection; Differential Evolution; Cooperative Cooperative; Parallel Computing; GENETIC ALGORITHM; OPTIMIZATION;
D O I
10.1109/PDP55904.2022.00040
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Nowadays, it is common in many disciplines and application fields to collect large volumes of data characterized by a high number of features. Such datasets are at the basis of modern applications of supervised Machine Learning, where the goal is to create a classifier for newly presented data. However, it is well known that the presence of irrelevant features in the dataset can lead to a harder learning phase and, above all, can produce suboptimal classifiers. For this reason, the ability to select an appropriate subset of the available features is becoming increasingly important. Traditionally, optimization metaheuristics have been used with success in the task of feature selection. However, many of the approaches presented in the literature are not applicable to datasets with thousands of features since common optimization algorithms often suffer from poor scalability with respect to the size of the search space. In this paper, the problem of feature subset optimization is successfully addressed by a cooperative coevolutionary algorithm based on Differential Evolution. In the proposed algorithm, parallelized for multi-threaded execution on shared-memory architectures, a suitable strategy for reducing the dimensionality of the search space and adapting the population size during the optimization results in a significant performance. A numerical investigation on sonic high-dimensional datasets show that, in most cases, the proposed approach can achieve smaller feature subsets and higher classification performance than other state-of-the-art methods.
引用
收藏
页码:211 / 218
页数:8
相关论文
共 50 条
  • [1] Adaptive cooperative coevolutionary differential evolution for parallel feature selection in high-dimensional datasets
    Marjan Firouznia
    Pietro Ruiu
    Giuseppe A. Trunfio
    [J]. The Journal of Supercomputing, 2023, 79 : 15215 - 15244
  • [2] Adaptive cooperative coevolutionary differential evolution for parallel feature selection in high-dimensional datasets
    Firouznia, Marjan
    Ruiu, Pietro
    Trunfio, Giuseppe A.
    [J]. JOURNAL OF SUPERCOMPUTING, 2023, 79 (14): : 15215 - 15244
  • [3] A differential evolution algorithm with cooperative coevolutionary selection operation for high-dimensional optimization
    Wang, Chao
    Gao, J. -H.
    [J]. OPTIMIZATION LETTERS, 2014, 8 (02) : 477 - 492
  • [4] A differential evolution algorithm with cooperative coevolutionary selection operation for high-dimensional optimization
    Chao Wang
    J.-H. Gao
    [J]. Optimization Letters, 2014, 8 : 477 - 492
  • [5] A Cooperative Coevolutionary Approach to Discretization-Based Feature Selection for High-Dimensional Data
    Zhou, Yu
    Kang, Junhao
    Zhang, Xiao
    [J]. ENTROPY, 2020, 22 (06)
  • [6] High-dimensional feature selection for genomic datasets
    Afshar, Majid
    Usefi, Hamid
    [J]. KNOWLEDGE-BASED SYSTEMS, 2020, 206
  • [7] Evolutionary binary feature selection using adaptive ebola optimization search algorithm for high-dimensional datasets
    Oyelade, Olaide N. N.
    Agushaka, Jeffrey O. O.
    Ezugwu, Absalom E. E.
    [J]. PLOS ONE, 2023, 18 (03):
  • [8] A Nested Genetic Algorithm for feature selection in high-dimensional cancer Microarray datasets
    Sayed, Sabah
    Nassef, Mohammad
    Badr, Amr
    Farag, Ibrahim
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2019, 121 : 233 - 243
  • [9] Improved PSO for Feature Selection on High-Dimensional Datasets
    Tran, Binh
    Xue, Bing
    Zhang, Mengjie
    [J]. SIMULATED EVOLUTION AND LEARNING (SEAL 2014), 2014, 8886 : 503 - 515
  • [10] Improved PSO for feature selection on high-dimensional datasets
    [J]. Tran, Binh (binh.tran@ecs.vuw.ac.nz), 1600, Springer Verlag (8886):