A Supervised Filter Feature Selection method for mixed data based on Spectral Feature Selection and Information-theory redundancy analysis

被引:24
|
作者
Solorio-Fernandez, Saul [1 ]
Fco Martinez-Trinidad, Jose [1 ]
Ariel Carrasco-Ochoa, J. [1 ]
机构
[1] Inst Nacl Astrofis Opt & Electr, Comp Sci Dept, Luis Enrique Erro 1, Puebla 72840, Mexico
关键词
Supervised feature selection; Mixed data; Filter feature subset selection; Redundancy analysis; EFFICIENT FEATURE-SELECTION; MUTUAL INFORMATION; ALGORITHM; RELEVANCE;
D O I
10.1016/j.patrec.2020.07.039
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Spectral analysis and Information-theory are two powerful and successful frameworks for feature selection in supervised classification problems. However, most of the methods developed under these frameworks have been introduced for handling exclusively numerical or non- numerical data. In this paper, we propose a supervised filter feature selection method that combines Spectral Feature Selection and Information-theory based redundancy analysis for selecting relevant and non-redundant features in supervised mixed datasets; i.e., datasets where the objects are described simultaneously by both, numerical and non-numerical features. To demonstrate the effectiveness of our proposed supervised filter feature selection method, we conducted several experiments on 40 public real-world datasets. Additionally, we compare our method against relevant state-of-the-art supervised filter methods for numerical, nonnumerical, and mixed data. From this comparison, our method, in general, obtains better results than the results obtained by the other evaluated filter feature selection methods. (C) 2020 Elsevier B.V. All rights reserved.
引用
收藏
页码:321 / 328
页数:8
相关论文
共 50 条
  • [1] A Supervised Filter Feature Selection Method for Mixed Data Based on the Spectral Gap Score
    Solorio-Fernandez, Saul
    Fco Martinez-Trinidad, Jose
    Ariel Carrasco-Ochoa, Jesus
    [J]. PATTERN RECOGNITION, MCPR 2019, 2019, 11524 : 3 - 13
  • [2] Filter unsupervised spectral feature selection method for mixed data based on a new feature correlation measure
    Solorio-Fernandez, Saul
    Carrasco-Ochoa, J. Ariel
    Martinez-Trinidad, Jose Fco.
    [J]. NEUROCOMPUTING, 2024, 571
  • [3] A new Unsupervised Spectral Feature Selection Method for mixed data: A filter approach
    Solorio-Fernandez, Saul
    Fco Martinez-Trinidad, Jose
    Ariel Carrasco-Ochoa, J.
    [J]. PATTERN RECOGNITION, 2017, 72 : 314 - 326
  • [4] A novel information theory method for filter feature selection
    Bonev, Boyan
    Escolano, Francisco
    Cazorla, Miguel Angel
    [J]. MICAI 2007: ADVANCES IN ARTIFICIAL INTELLIGENCE, 2007, 4827 : 431 - +
  • [5] Differential evolution for filter feature selection based on information theory and feature ranking
    Hancer, Emrah
    Xue, Bing
    Zhang, Mengjie
    [J]. KNOWLEDGE-BASED SYSTEMS, 2018, 140 : 103 - 119
  • [6] Dynamic feature selection method with minimum redundancy information for linear data
    Zhou, HongFang
    Wen, Jing
    [J]. APPLIED INTELLIGENCE, 2020, 50 (11) : 3660 - 3677
  • [7] Dynamic feature selection method with minimum redundancy information for linear data
    HongFang Zhou
    Jing Wen
    [J]. Applied Intelligence, 2020, 50 : 3660 - 3677
  • [8] A PARTITION-BASED FEATURE SELECTION METHOD FOR MIXED DATA: A FILTER APPROACH
    Dutt, Ashish
    Ismail, Maizatul Akmar
    [J]. MALAYSIAN JOURNAL OF COMPUTER SCIENCE, 2020, 33 (02) : 152 - 169
  • [9] A Supervised Feature Selection Method For Mixed-Type Data using Density-based Feature Clustering
    Yan, Xuyang
    Sarkar, Mrinmoy
    Gebru, Biniam
    Nazmi, Shabnam
    Homaifar, Abdollah
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2021, : 1900 - 1905
  • [10] A Filter Feature Selection Method Based LLRFC and Redundancy Analysis for Tumor Classification Using Gene Expression Data
    Li, Jiangeng
    Li, Xiaodan
    Zhang, Wei
    [J]. PROCEEDINGS OF THE 2016 12TH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION (WCICA), 2016, : 2861 - 2867