Feature selection in high-dimensional microarray cancer datasets using an improved equilibrium optimization approach

被引:1
|
作者
Balakrishnan, Kulanthaivel [1 ]
Dhanalakshmi, Ramasamy [1 ]
机构
[1] Indian Inst Informat Technol Tiruchirappalli, Dept Comp Sci & Engn, Tiruchirappalli, Tamil Nadu, India
来源
关键词
equilibrium optimization; feature selection; high dimensional; random-opposition-based learning; ALGORITHM; SEARCH;
D O I
10.1002/cpe.7381
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Optimal feature selection of a high-dimensional micro-array datasets has gained a significant importance in medical applications for early detection and prevention of disease. Traditional Optimal feature selection percolates through a population-based meta-heuristic optimization technique, a Machine Learning classifier and traditional wrapper method for transforming the original feature set into a better feature set. These techniques require a number of iterations for the convergence of random solutions to the global optimum with high-dimensionality issues such as over-fitting, memory constraints, computational costs, and low accuracy. In this article, an efficient equilibrium optimization technique is proposed for an optimized feature selection that increases the diversity of the population in the search space through Random Opposition based learning and classify the best features using a 10-fold cross-validation-based wrapper method. The proposed method is tested with six standard micro-array datasets and compared with the conventional algorithms such as Marine Predators Algorithm, Harris Hawks Optimization, Whale Optimization Algorithm, and conventional Equilibrium Optimization. From the statistical results using the standard metrics, it is interpreted that the proposed method converges to the global minimum in a few iterations through optimized feature selection, fitness value and higher classification accuracy. This proves its efficacy in exploring and finding a better solution as compared to the counterpart algorithms. In addition to complexity analysis, these results indicate a global optimum solution, an effective representation of least amount of data-high dimensionality reduction and an avoidance of over-fitting problems. The source code is available at
引用
收藏
页数:18
相关论文
共 50 条
  • [1] A Nested Genetic Algorithm for feature selection in high-dimensional cancer Microarray datasets
    Sayed, Sabah
    Nassef, Mohammad
    Badr, Amr
    Farag, Ibrahim
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2019, 121 : 233 - 243
  • [2] Improved PSO for Feature Selection on High-Dimensional Datasets
    Tran, Binh
    Xue, Bing
    Zhang, Mengjie
    [J]. SIMULATED EVOLUTION AND LEARNING (SEAL 2014), 2014, 8886 : 503 - 515
  • [3] Feature selection and computational optimization in high-dimensional microarray cancer datasets via InfoGain-modified bat algorithm
    Hambali, Moshood A.
    Oladele, Tinuke O.
    Adewole, Kayode S.
    Sangaiah, Arun Kumar
    Gao, Wei
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (25) : 36505 - 36549
  • [4] Feature selection and computational optimization in high-dimensional microarray cancer datasets via InfoGain-modified bat algorithm
    Moshood A. Hambali
    Tinuke O. Oladele
    Kayode S. Adewole
    Arun Kumar Sangaiah
    Wei Gao
    [J]. Multimedia Tools and Applications, 2022, 81 : 36505 - 36549
  • [5] Bi-objective feature selection in high-dimensional datasets using improved binary chimp optimization algorithm
    Al-qudah, Nour Elhuda A.
    Abed-alguni, Bilal H.
    Barhoush, Malek
    [J]. INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2024,
  • [6] High-dimensional feature selection for genomic datasets
    Afshar, Majid
    Usefi, Hamid
    [J]. KNOWLEDGE-BASED SYSTEMS, 2020, 206
  • [7] Distributed feature selection: A hesitant fuzzy correlation concept for microarray high-dimensional datasets
    Ebrahimpour, Mohammad Kazem
    Eftekhari, Mahdi
    [J]. CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2018, 173 : 51 - 64
  • [8] An iterative SVM approach to feature selection and classification in high-dimensional datasets
    Liu, Dehua
    Qian, Hui
    Dai, Guang
    Zhang, Zhihua
    [J]. PATTERN RECOGNITION, 2013, 46 (09) : 2531 - 2537
  • [9] Stable Feature Selection using Improved Whale Optimization Algorithm for Microarray Datasets
    Theng, Dipti
    Bhoyar, Kishor K.
    [J]. ADCAIJ-ADVANCES IN DISTRIBUTED COMPUTING AND ARTIFICIAL INTELLIGENCE JOURNAL, 2023, 12 (01):
  • [10] Efficient Multiclass Classification Using Feature Selection in High-Dimensional Datasets
    Kumar, Ankur
    Kaur, Avinash
    Singh, Parminder
    Driss, Maha
    Boulila, Wadii
    [J]. ELECTRONICS, 2023, 12 (10)