Bi-objective feature selection in high-dimensional datasets using improved binary chimp optimization algorithm

被引:0
|
作者
Al-qudah, Nour Elhuda A. [1 ]
Abed-alguni, Bilal H. [1 ]
Barhoush, Malek [2 ]
机构
[1] Yarmouk Univ, Dept Comp Sci, Irbid, Jordan
[2] Yarmouk Univ, Dept Informat Technol Cybersecur Program, Irbid, Jordan
关键词
Chimp optimization algorithm; Opposition-based learning; High-dimensional datasets; Feature selection; Levy flight; beta-Hill climbing algorithm; CUCKOO SEARCH; MODELS;
D O I
10.1007/s13042-024-02308-y
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The machine learning process in high-dimensional datasets is far more complicated than in low-dimensional datasets. In high-dimensional datasets, Feature Selection (FS) is necessary to decrease the complexity of learning. However, FS in high-dimensional datasets is a complex process that requires the combination of several search techniques. The Chimp Optimization Algorithm, known as ChOA, is a new meta-heuristic method inspired by the chimps' individual intellect and sexual incentive in cooperative hunting. It is basically employed in solving complex continuous optimization problems, while its binary version is frequently utilized in solving difficult binary optimization problems. Both versions of ChOA are subject to premature convergence and are incapable of effectively solving high-dimensional optimization problems. This paper proposes the Binary Improved ChOA Algorithm (BICHOA) for solving the bi-objective, high-dimensional FS problems (i.e., high-dimensional FS problems that aim to maximize the classifier's accuracy and minimize the number of selected features from a dataset). BICHOA improves the performance of ChOA using four new exploration and exploitation techniques. First, it employs the opposition-based learning approach to initially create a population of diverse binary feasible solutions. Second, it incorporates the L & eacute;vy mutation function in the main probabilistic update function of ChOA to boost its searching and exploring capabilities. Third, it uses an iterative exploration technique based on an exploratory local search method called the beta-hill climbing algorithm. Finally, it employs a new binary time-varying transfer function to calculate binary feasible solutions from the continuous feasible solutions generated by the update equations of the ChOA and beta-hill climbing algorithms. BICHOA's performance was assessed and compared against six machine learning classifiers, five integer programming methods, and nine efficient popular optimization algorithms using 25 real-world high-dimensional datasets from various domains. According to the overall experimental findings, BICHOA scored the highest accuracy, best objective value, and fewest selected features for each of the 25 real-world high-dimensional datasets. Besides, the reliability of the experimental findings was established using Friedman and Wilcoxon statistical tests.
引用
收藏
页码:6107 / 6148
页数:42
相关论文
共 50 条
  • [1] A Bi-Search Evolutionary Algorithm for High-Dimensional Bi-Objective Feature Selection
    Xu, Hang
    Xue, Bing
    Zhang, Mengjie
    IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2024, 8 (05): : 3489 - 3502
  • [2] A binary individual search strategy-based bi-objective evolutionary algorithm for high-dimensional feature selection
    Li, Tao
    Zhan, Zhi-Hui
    Xu, Jiu-Cheng
    Yang, Qiang
    Ma, Yuan-Yuan
    INFORMATION SCIENCES, 2022, 610 : 651 - 673
  • [3] Evolutionary binary feature selection using adaptive ebola optimization search algorithm for high-dimensional datasets
    Oyelade, Olaide N. N.
    Agushaka, Jeffrey O. O.
    Ezugwu, Absalom E. E.
    PLOS ONE, 2023, 18 (03):
  • [4] Improved PSO for Feature Selection on High-Dimensional Datasets
    Tran, Binh
    Xue, Bing
    Zhang, Mengjie
    SIMULATED EVOLUTION AND LEARNING (SEAL 2014), 2014, 8886 : 503 - 515
  • [5] Improved PSO for feature selection on high-dimensional datasets
    Tran, Binh (binh.tran@ecs.vuw.ac.nz), 1600, Springer Verlag (8886):
  • [6] Feature selection in high-dimensional microarray cancer datasets using an improved equilibrium optimization approach
    Balakrishnan, Kulanthaivel
    Dhanalakshmi, Ramasamy
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2022, 34 (28):
  • [7] Hybrid binary Coral Reefs Optimization algorithm with Simulated Annealing for Feature Selection in high-dimensional biomedical datasets
    Yan, Chaokun
    Ma, Jingjing
    Luo, Huimin
    Patel, Ashutosh
    CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2019, 184 : 102 - 111
  • [8] Ensemble feature selection using bi-objective genetic algorithm
    Das, Asit K.
    Das, Sunanda
    Ghosh, Arka
    KNOWLEDGE-BASED SYSTEMS, 2017, 123 : 116 - 127
  • [9] A Length-Adaptive Non-Dominated Sorting Genetic Algorithm for Bi-Objective High-Dimensional Feature Selection
    Gong, Yanlu
    Zhou, Junhai
    Wu, Quanwang
    Zhou, MengChu
    Wen, Junhao
    IEEE-CAA JOURNAL OF AUTOMATICA SINICA, 2023, 10 (09) : 1834 - 1844
  • [10] A Length-Adaptive Non-Dominated Sorting Genetic Algorithm for Bi-Objective High-Dimensional Feature Selection
    Yanlu Gong
    Junhai Zhou
    Quanwang Wu
    MengChu Zhou
    Junhao Wen
    IEEE/CAAJournalofAutomaticaSinica, 2023, 10 (09) : 1834 - 1844