Interaction-based clustering algorithm for feature selection: a multivariate filter approach

被引:0
|
作者
Ahmad Esfandiari
Hamid Khaloozadeh
Faezeh Farivar
机构
[1] Science and Research Branch,Department of Computer Engineering
[2] Islamic Azad University,Department of Systems and Control, Faculty of Electrical Engineering
[3] K. N. Toosi University of Technology,Department of Mechatronics Engineering, Science and Research Branch
[4] Islamic Azad University,undefined
关键词
Conditional Mutual Information; Mutual Information; Symmetric Uncertainty; Feature selection; Feature clustering; Akaike Information Criterion;
D O I
暂无
中图分类号
学科分类号
摘要
In pattern recognition and data mining, feature selection is a preprocessing step during which the dimensions of data are reduced by removing redundant, irrelevant, and noisy features for a machine learning task. Identifying the most informative features in a suitable computational time is one of the most important challenges in the existing feature selection methods. This paper introduces a multivariate filter feature selection method based on feature clustering technique called interaction-based feature clustering (IFC), which is very cost-effective in terms of computational cost while achieving high classification accuracy. In the proposed method, first, the features are ranked based on the symmetric uncertainty criterion, and then, the clustering of the features is performed by calculating their interactive weight as a similarity measure. To evaluate the performance, the results of the IFC algorithm are compared with six well-known multivariate filter methods on sixteen benchmark datasets using three classifiers of SVM, NB and kNN. In addition, for further evaluation, a comparison is made using the Akaike Information Criterion (AIC) and Pareto front curves. Experimental results prove that the IFC algorithm is often more efficient than the comparable methods in terms of classification accuracy and computational time and can be considered as a suitable method in the preprocessing step.
引用
收藏
页码:1769 / 1782
页数:13
相关论文
共 50 条
  • [1] Interaction-based clustering algorithm for feature selection: a multivariate filter approach
    Esfandiari, Ahmad
    Khaloozadeh, Hamid
    Farivar, Faezeh
    [J]. INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2023, 14 (05) : 1769 - 1782
  • [2] Interaction-based Clustering of Multivariate Time Series
    Plant, Claudia
    Wohlschlaeger, Afra M.
    Zherdin, Andrew
    [J]. 2009 9TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING, 2009, : 914 - 919
  • [3] Dynamic interaction-based feature selection algorithm for maximal relevance minimal redundancy
    Kexin Yin
    Aifeng Xie
    Junren Zhai
    Jianqi Zhu
    [J]. Applied Intelligence, 2023, 53 : 8910 - 8926
  • [4] Dynamic interaction-based feature selection algorithm for maximal relevance minimal redundancy
    Yin, Kexin
    Xie, Aifeng
    Zhai, Junren
    Zhu, Jianqi
    [J]. APPLIED INTELLIGENCE, 2023, 53 (08) : 8910 - 8926
  • [5] A novel feature selection approach based on clustering algorithm
    Moslehi, Fateme
    Haeri, Abdorrahman
    [J]. JOURNAL OF STATISTICAL COMPUTATION AND SIMULATION, 2021, 91 (03) : 581 - 604
  • [6] Interaction-based feature selection using Factorial Design
    Tang, Xiaochuan
    Dai, Yuanshun
    Sun, Peng
    Meng, Sa
    [J]. NEUROCOMPUTING, 2018, 281 : 47 - 54
  • [7] A Robust Approach for Multivariate Binary Vectors Clustering and Feature Selection
    Al Mashrgy, Mohamed
    Bouguila, Nizar
    Daoudi, Khalid
    [J]. NEURAL INFORMATION PROCESSING, PT II, 2011, 7063 : 125 - +
  • [8] A fuzzy clustering based algorithm for feature selection
    Sun, HJ
    Wang, SR
    Mei, Z
    [J]. 2002 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-4, PROCEEDINGS, 2002, : 1993 - 1998
  • [9] A Clustering Based Genetic Algorithm for Feature Selection
    Rostami, Mehrdad
    Moradi, Parham
    [J]. 2014 6TH CONFERENCE ON INFORMATION AND KNOWLEDGE TECHNOLOGY (IKT), 2014, : 112 - 116
  • [10] A feature selection Bayesian approach for a clustering genetic algorithm
    Hruschka, ER
    Hruschka, ER
    Ebecken, NFF
    [J]. DATA MINING IV, 2004, 7 : 181 - 192