A hybrid algorithm for feature subset selection in high-dimensional datasets using FICA and IWSSr algorithm

被引:20
|
作者
Moradkhani, Mostafa [1 ]
Amiri, Ali [1 ]
Javaherian, Mohsen [2 ]
Safari, Hossein [2 ]
机构
[1] Univ Zanjan, Dept Comp Engn, Zanjan 4537138791, Iran
[2] Univ Zanjan, Dept Phys, Zanjan 4537138791, Iran
关键词
Feature subset selection; FICA; IWSSr algorithm; High dimensional classification problems; MINIMUM REDUNDANCY; BAYES;
D O I
10.1016/j.asoc.2015.03.049
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Feature subset selection is a substantial problem in the field of data classification tasks. The purpose of feature subset selection is a mechanism to find efficient subset retrieved from original datasets to increase both efficiency and accuracy rate and reduce the costs of data classification. Working on high-dimensional datasets with a very large number of predictive attributes while the number of instances is presented in a low volume needs to be employed techniques to select an optimal feature subset. In this paper, a hybrid method is proposed for efficient subset selection in high-dimensional datasets. The proposed algorithm runs filter-wrapper algorithms in two phases. The symmetrical uncertainty (SU) criterion is exploited to weight features in filter phase for discriminating the classes. In wrapper phase, both FICA (fuzzy imperialist competitive algorithm) and IWSSr (Incremental Wrapper Subset Selection with replacement) in weighted feature space are executed to find relevant attributes. The new scheme is successfully applied on 10 standard high-dimensional datasets, especially within the field of biosciences and medicine, where the number of features compared to the number of samples is large, inducing a severe curse of dimensionality problem. The comparison between the results of our method and other algorithms confirms that our method has the most accuracy rate and it is also able to achieve to the efficient compact subset. (C) 2015 Elsevier B.V. All rights reserved.
引用
收藏
页码:123 / 135
页数:13
相关论文
共 50 条
  • [1] A GRASP algorithm for fast hybrid (filter-wrapper) feature subset selection in high-dimensional datasets
    Bermejo, Pablo
    Gamez, Jose A.
    Puerta, Jose M.
    [J]. PATTERN RECOGNITION LETTERS, 2011, 32 (05) : 701 - 711
  • [2] A Adaptive Cooperative Coevolutionary Algorithm for Parallel Feature Selection in High-Dimensional Datasets
    Firouznia, Marjan
    Trunfio, Giuseppe A.
    [J]. 30TH EUROMICRO INTERNATIONAL CONFERENCE ON PARALLEL, DISTRIBUTED AND NETWORK-BASED PROCESSING (PDP 2022), 2022, : 211 - 218
  • [3] A Nested Genetic Algorithm for feature selection in high-dimensional cancer Microarray datasets
    Sayed, Sabah
    Nassef, Mohammad
    Badr, Amr
    Farag, Ibrahim
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2019, 121 : 233 - 243
  • [4] A PSO Based Hybrid Feature Selection Algorithm for High-Dimensional Classification
    Binh Tran
    Zhang, Mengjie
    Xue, Bing
    [J]. 2016 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2016, : 3801 - 3808
  • [5] Hybrid binary Coral Reefs Optimization algorithm with Simulated Annealing for Feature Selection in high-dimensional biomedical datasets
    Yan, Chaokun
    Ma, Jingjing
    Luo, Huimin
    Patel, Ashutosh
    [J]. CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2019, 184 : 102 - 111
  • [6] A Fast Clustering-Based Feature Subset Selection Algorithm for High-Dimensional Data
    Song, Qinbao
    Ni, Jingjie
    Wang, Guangtao
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2013, 25 (01) : 1 - 14
  • [7] Evolutionary binary feature selection using adaptive ebola optimization search algorithm for high-dimensional datasets
    Oyelade, Olaide N. N.
    Agushaka, Jeffrey O. O.
    Ezugwu, Absalom E. E.
    [J]. PLOS ONE, 2023, 18 (03):
  • [8] A fast dual-module hybrid high-dimensional feature selection algorithm
    Yang, Geying
    He, Junjiang
    Lan, Xiaolong
    Li, Tao
    Fang, Wenbo
    [J]. INFORMATION SCIENCES, 2024, 681
  • [9] An Efficient Hybrid Feature Selection Method Using the Artificial Immune Algorithm for High-Dimensional Data
    Zhu, Yongbin
    Li, Tao
    Li, Wenshan
    [J]. COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2022, 2022
  • [10] FACO: A Novel Hybrid Feature Selection Algorithm for High-Dimensional Data Classification
    Popoola, Gideon
    Oyeniran, Kayode
    [J]. SOUTHEASTCON 2024, 2024, : 61 - 68