A hybrid algorithm for feature subset selection in high-dimensional datasets using FICA and IWSSr algorithm

被引:20
|
作者
Moradkhani, Mostafa [1 ]
Amiri, Ali [1 ]
Javaherian, Mohsen [2 ]
Safari, Hossein [2 ]
机构
[1] Univ Zanjan, Dept Comp Engn, Zanjan 4537138791, Iran
[2] Univ Zanjan, Dept Phys, Zanjan 4537138791, Iran
关键词
Feature subset selection; FICA; IWSSr algorithm; High dimensional classification problems; MINIMUM REDUNDANCY; BAYES;
D O I
10.1016/j.asoc.2015.03.049
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Feature subset selection is a substantial problem in the field of data classification tasks. The purpose of feature subset selection is a mechanism to find efficient subset retrieved from original datasets to increase both efficiency and accuracy rate and reduce the costs of data classification. Working on high-dimensional datasets with a very large number of predictive attributes while the number of instances is presented in a low volume needs to be employed techniques to select an optimal feature subset. In this paper, a hybrid method is proposed for efficient subset selection in high-dimensional datasets. The proposed algorithm runs filter-wrapper algorithms in two phases. The symmetrical uncertainty (SU) criterion is exploited to weight features in filter phase for discriminating the classes. In wrapper phase, both FICA (fuzzy imperialist competitive algorithm) and IWSSr (Incremental Wrapper Subset Selection with replacement) in weighted feature space are executed to find relevant attributes. The new scheme is successfully applied on 10 standard high-dimensional datasets, especially within the field of biosciences and medicine, where the number of features compared to the number of samples is large, inducing a severe curse of dimensionality problem. The comparison between the results of our method and other algorithms confirms that our method has the most accuracy rate and it is also able to achieve to the efficient compact subset. (C) 2015 Elsevier B.V. All rights reserved.
引用
收藏
页码:123 / 135
页数:13
相关论文
共 50 条
  • [21] Monte Carlo Tree Search-Based Recursive Algorithm for Feature Selection in High-Dimensional Datasets
    Chaudhry, Muhammad Umar
    Yasir, Muhammad
    Asghar, Muhammad Nabeel
    Lee, Jee-Hyong
    [J]. ENTROPY, 2020, 22 (10) : 1 - 15
  • [22] Efficient Multiclass Classification Using Feature Selection in High-Dimensional Datasets
    Kumar, Ankur
    Kaur, Avinash
    Singh, Parminder
    Driss, Maha
    Boulila, Wadii
    [J]. ELECTRONICS, 2023, 12 (10)
  • [23] A Solution to the High-Dimensional Classification Problem Using an Improved Hybrid Feature Selection Algorithm Guided by Interaction Information
    Nakariyakul, Songyot
    [J]. IEEE ACCESS, 2020, 8 : 145909 - 145917
  • [24] Improved PSO for Feature Selection on High-Dimensional Datasets
    Tran, Binh
    Xue, Bing
    Zhang, Mengjie
    [J]. SIMULATED EVOLUTION AND LEARNING (SEAL 2014), 2014, 8886 : 503 - 515
  • [25] Hybrid binary COOT algorithm with simulated annealing for feature selection in high-dimensional microarray data
    Elnaz Pashaei
    Elham Pashaei
    [J]. Neural Computing and Applications, 2023, 35 : 353 - 374
  • [26] Hybrid binary COOT algorithm with simulated annealing for feature selection in high-dimensional microarray data
    Pashaei, Elnaz
    Pashaei, Elham
    [J]. NEURAL COMPUTING & APPLICATIONS, 2023, 35 (01): : 353 - 374
  • [27] Feature subset selection using a genetic algorithm
    Yang, JH
    Honavar, V
    [J]. IEEE INTELLIGENT SYSTEMS & THEIR APPLICATIONS, 1998, 13 (02): : 44 - 49
  • [28] High-Dimensional Feature Selection for Automatic Classification of Coronary Stenosis Using an Evolutionary Algorithm
    Gil-Rios, Miguel-Angel
    Cruz-Aceves, Ivan
    Hernandez-Aguirre, Arturo
    Moya-Albor, Ernesto
    Brieva, Jorge
    Hernandez-Gonzalez, Martha-Alicia
    Solorio-Meza, Sergio-Eduardo
    [J]. DIAGNOSTICS, 2024, 14 (03)
  • [29] An Improved Feature Subset Selection Algorithm for High-Dimensional Data and its Application on Hot rolling Process Data
    Wang, Yihan
    Liu, Bin
    Li, Weigang
    [J]. PROCEEDINGS OF THE 36TH CHINESE CONTROL CONFERENCE (CCC 2017), 2017, : 9714 - 9719
  • [30] A hybrid approach using rough set theory and hypergraph for feature selection on high-dimensional medical datasets
    M. R. Gauthama Raman
    Somu Nivethitha
    Krithivasan Kannan
    V. S. Shankar Sriram
    [J]. Soft Computing, 2019, 23 : 12655 - 12672