Practical Challenges and Recommendations of Filter Methods for Feature Selection

被引:7
|
作者
Rajab, Mohammed [1 ]
Wang, Dennis [1 ,2 ,3 ]
机构
[1] Univ Sheffield, Dept Comp Sci, Sheffield, S Yorkshire, England
[2] Sheffield Inst Translat Neurosci, Sheffield, S Yorkshire, England
[3] NIHR Sheffield Biomed Res Ctr, Sheffield, S Yorkshire, England
关键词
Feature selection; filter methods; machine learning; data imbalance; ranking methods; MUTUAL INFORMATION; FEATURE RANKING; RELEVANCE;
D O I
10.1142/S0219649220400195
中图分类号
G25 [图书馆学、图书馆事业]; G35 [情报学、情报工作];
学科分类号
1205 ; 120501 ;
摘要
Feature selection, the process of identifying relevant features to be incorporated into a proposed model, is one of the significant steps of the learning process. It removes noise from the data to increase the learning performance while reducing the computational complexity. The literature review indicated that most previous studies had focused on improving the overall classifier performance or reducing costs associated with training time during building of the classifiers. However, in this era of big data, there is an urgent need to deal with more complex issues that makes feature selection, especially using filter-based methods, more challenging; this in terms of dimensionality, data structures, data format, domain experts' availability, data sparsity, and result discrepancies, among others. Filter methods identify the informative features of a given dataset to establish various predictive models using mathematical models. This paper takes a new route in an attempt to pinpoint recent practical challenges associated with filter methods and discusses potential areas of development to yield better performance. Several practical recommendations, based on recent studies, are made to overcome the identified challenges and make the feature selection process simpler and more efficient.
引用
下载
收藏
页数:15
相关论文
共 50 条
  • [1] Feature Selection: Filter Methods Performance Challenges
    Cherrington, Marianne
    Thabtah, Fadi
    Lu, Joan
    Xu, Qiang
    2019 INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION SCIENCES (ICCIS), 2019, : 252 - 255
  • [2] Filter and Embedded Feature Selection Methods to Meet Big Data Visualization Challenges
    ElDahshan, Kamal A.
    AlHabshy, AbdAllah A.
    Mohammed, Luay Thamer
    CMC-COMPUTERS MATERIALS & CONTINUA, 2023, 74 (01): : 817 - 839
  • [3] Filter methods for feature selection -: A comparative study
    Sanchez-Marono, Noelia
    Alonso-Betanzos, Amparo
    Tombilla-Sanroman, Marfa
    INTELLIGENT DATA ENGINEERING AND AUTOMATED LEARNING - IDEAL 2007, 2007, 4881 : 178 - 187
  • [4] A systematic evaluation of filter Unsupervised Feature Selection methods
    Solorio-Fernandez, Saul
    Carrasco-Ochoa, J. Ariel
    Martinez-Trinidad, Jose Fco
    EXPERT SYSTEMS WITH APPLICATIONS, 2020, 162 (162)
  • [5] A New Way for Combining Filter Feature Selection Methods
    Bouaguel, Waad
    Limam, Mohamed
    PROCEEDINGS OF 3RD INTERNATIONAL CONFERENCE ON ADVANCED COMPUTING, NETWORKING AND INFORMATICS (ICACNI 2015), VOL 1, 2016, 43 : 411 - 419
  • [6] Improved filter feature selection methods by samples localization
    Sun, Ying
    Wang, Yan
    Chen, Liang
    Du, Wei
    Cao, Zhongbo
    Zhou, You
    Journal of Bionanoscience, 2013, 7 (04): : 371 - 377
  • [7] Filter feature selection methods for text classification: a review
    Hong Ming
    Wang Heyong
    Multimedia Tools and Applications, 2024, 83 : 2053 - 2091
  • [8] Filter feature selection methods for text classification: a review
    Ming, Hong
    Heyong, Wang
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 83 (1) : 2053 - 2091
  • [9] Heuristic filter feature selection methods for medical datasets
    Alirezanejad, Mehdi
    Enayatifar, Rasul
    Motameni, Homayun
    Nematzadeh, Hossein
    GENOMICS, 2020, 112 (02) : 1173 - 1181
  • [10] An evaluation of filter and wrapper methods for feature selection in categorical clustering
    Talavera, L
    ADVANCES IN INTELLIGENT DATA ANALYSIS VI, PROCEEDINGS, 2005, 3646 : 440 - 451