A Supervised Filter Feature Selection method for mixed data based on Spectral Feature Selection and Information-theory redundancy analysis

被引:24
|
作者
Solorio-Fernandez, Saul [1 ]
Fco Martinez-Trinidad, Jose [1 ]
Ariel Carrasco-Ochoa, J. [1 ]
机构
[1] Inst Nacl Astrofis Opt & Electr, Comp Sci Dept, Luis Enrique Erro 1, Puebla 72840, Mexico
关键词
Supervised feature selection; Mixed data; Filter feature subset selection; Redundancy analysis; EFFICIENT FEATURE-SELECTION; MUTUAL INFORMATION; ALGORITHM; RELEVANCE;
D O I
10.1016/j.patrec.2020.07.039
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Spectral analysis and Information-theory are two powerful and successful frameworks for feature selection in supervised classification problems. However, most of the methods developed under these frameworks have been introduced for handling exclusively numerical or non- numerical data. In this paper, we propose a supervised filter feature selection method that combines Spectral Feature Selection and Information-theory based redundancy analysis for selecting relevant and non-redundant features in supervised mixed datasets; i.e., datasets where the objects are described simultaneously by both, numerical and non-numerical features. To demonstrate the effectiveness of our proposed supervised filter feature selection method, we conducted several experiments on 40 public real-world datasets. Additionally, we compare our method against relevant state-of-the-art supervised filter methods for numerical, nonnumerical, and mixed data. From this comparison, our method, in general, obtains better results than the results obtained by the other evaluated filter feature selection methods. (C) 2020 Elsevier B.V. All rights reserved.
引用
收藏
页码:321 / 328
页数:8
相关论文
共 50 条
  • [41] An efficient semi-supervised representatives feature selection algorithm based on information theory
    Wang, Yintong
    Wang, Jiandong
    Liao, Hao
    Chen, Haiyan
    [J]. PATTERN RECOGNITION, 2017, 61 : 511 - 523
  • [42] Semi-supervised feature selection with minimal redundancy based on local adaptive
    Wu, Xinping
    Chen, Hongmei
    Li, Tianrui
    Wan, Jihong
    [J]. APPLIED INTELLIGENCE, 2021, 51 (11) : 8542 - 8563
  • [43] Feature Selection Method Based on Weighted Mutual Information for Imbalanced Data
    Li, Kewen
    Yu, Mingxiao
    Liu, Lu
    Li, Timing
    Zhai, Jiannan
    [J]. INTERNATIONAL JOURNAL OF SOFTWARE ENGINEERING AND KNOWLEDGE ENGINEERING, 2018, 28 (08) : 1177 - 1194
  • [44] A filter feature selection method for clustering
    Jouve, PE
    Nicoloyannis, N
    [J]. FOUNDATIONS OF INTELLIGENT SYSTEMS, PROCEEDINGS, 2005, 3488 : 583 - 593
  • [45] Information gain-based semi-supervised feature selection for hybrid data
    Shu, Wenhao
    Yan, Zhenchao
    Yu, Jianhui
    Qian, Wenbin
    [J]. APPLIED INTELLIGENCE, 2023, 53 (06) : 7310 - 7325
  • [46] Information gain-based semi-supervised feature selection for hybrid data
    Wenhao Shu
    Zhenchao Yan
    Jianhui Yu
    Wenbin Qian
    [J]. Applied Intelligence, 2023, 53 : 7310 - 7325
  • [47] Feature selection based on mutual information and redundancy-synergy coefficient
    Yang S.
    Gu J.
    [J]. Journal of Zhejiang University-SCIENCE A, 2004, 5 (11): : 1382 - 1391
  • [48] Unsupervised Feature Selection: Minimize Information Redundancy of Features
    Yen, Chun-Chao
    Chen, Liang-Chieh
    Lin, Shou-De
    [J]. INTERNATIONAL CONFERENCE ON TECHNOLOGIES AND APPLICATIONS OF ARTIFICIAL INTELLIGENCE (TAAI 2010), 2010, : 247 - 254
  • [49] Feature selection based on mutual information and redundancy-synergy coefficient
    杨胜
    顾钧
    [J]. Journal of Zhejiang University-Science A(Applied Physics & Engineering), 2004, (11) : 71 - 80
  • [50] A feature selection algorithm based on redundancy analysis and interaction weight
    Gu, Xiangyuan
    Guo, Jichang
    Li, Chongyi
    Xiao, Lijun
    [J]. APPLIED INTELLIGENCE, 2021, 51 (04) : 2672 - 2686