A tutorial-based survey on feature selection: Recent advancements on feature selection

被引:21
|
作者
Moslemi, Amir [1 ]
机构
[1] Sunnybrook Hlth Sci Ctr, Imaging Res & Phys Sci, Toronto, ON M4N 3M5, Canada
关键词
Feature selection; Matrix factorization; Sparse representation learning; Information theory; Evolutionary computation; Reinforcement learning; UNSUPERVISED FEATURE-SELECTION; SUPERVISED FEATURE-SELECTION; NONNEGATIVE MATRIX FACTORIZATION; PARTICLE SWARM OPTIMIZATION; EFFICIENT FEATURE-SELECTION; SPARSE FEATURE-SELECTION; LABEL FEATURE-SELECTION; HESITANT FUZZY-SETS; GENETIC ALGORITHM; MUTUAL INFORMATION;
D O I
10.1016/j.engappai.2023.107136
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Curse of dimensionality is known as big challenges in data mining, pattern recognition, computer vison and machine learning in recent years. Feature selection and feature extraction are two main approaches to circumvent this challenge. The main objective in feature selection is to remove the redundant features and preserve the relevant features in order to improve the learning algorithm performance. This survey provides a comprehensive overview of state-of-art feature selection techniques including mathematical formulas and fundamental algorithm to facilitate understanding. This survey encompasses different approaches of feature selection which can be categorized to five domains including: A) subspace learning which involves matrix factorization and matrix projection, B) sparse representation learning which includes compressed sensing and dictionary learning, C) information theory which covers multi-label neighborhood entropy, symmetrical un-certainty, Monte Carlo and Markov blanket, D) evolutionary computational algorithms including Genetic algo-rithm (GA), particle swarm optimization (PSO), Ant colony (AC) and Grey wolf optimization (GWO), and E) reinforcement learning techniques. This survey can be helpful for researchers to acquire deep understanding of feature selection techniques and choose a proper feature selection technique. Moreover, researcher can choose one of the A, B, C, D and E domains to become deep in this field for future study. A potential avenue for future research could involve exploring methods to reduce computational complexity while simultaneously maintaining performance efficiency. This would involve investigating ways to achieve a more efficient balance between computational resources and overall performance. For matrix-based techniques, the main limitation of these techniques lies in the need to tune the coefficients of the regularization terms, as this process can be challenging and time-consuming. For evolutionary computational techniques, getting stuck in local minimum and finding an appropriate objective function are two main limitations.
引用
收藏
页数:28
相关论文
共 50 条
  • [21] Feature Selection Based on Semantics
    Chua, Stephanie
    Kulathuramaiyer, Narayanan
    INNOVATIONS AND ADVANCED TECHNIQUES IN SYSTEMS, COMPUTING SCIENCES AND SOFTWARE ENGINEERING, 2008, : 471 - 476
  • [22] Feature selection based on bootstrapping
    Diaz-Diaz, Norberto
    Aguilar-Ruiz, Jesus S.
    Nepomuceno, Juan A.
    Garcia, Jorge
    2005 ICSC CONGRESS ON COMPUTATIONAL INTELLIGENCE METHODS AND APPLICATIONS (CIMA 2005), 2005, : 217 - 222
  • [23] Feature selection based on similarity
    Lazzerini, B
    Marcelloni, F
    ELECTRONICS LETTERS, 2002, 38 (03) : 121 - 122
  • [24] Consistency based feature selection
    Dash, M
    Liu, H
    Motoda, H
    KNOWLEDGE DISCOVERY AND DATA MINING, PROCEEDINGS: CURRENT ISSUES AND NEW APPLICATIONS, 2000, 1805 : 98 - 109
  • [25] Feature selection based on QFT
    Zhou, Rigui
    Jiang, Nan
    Yang, Shuqun
    Ding, Qiulin
    ICNC 2007: THIRD INTERNATIONAL CONFERENCE ON NATURAL COMPUTATION, VOL 1, PROCEEDINGS, 2007, : 159 - +
  • [26] A Survey on Causal Feature Selection Based on Markov Boundary Discovery
    Wu X.
    Jiang B.
    Lü S.
    Wang X.
    Chen Q.
    Chen H.
    Moshi Shibie yu Rengong Zhineng/Pattern Recognition and Artificial Intelligence, 2022, 35 (05): : 422 - 438
  • [27] PCA Indexing based Feature Learning and Feature Selection
    Ibrahim, Marwa Farouk Ibrahim
    Al-Jumaily, Adel Ali
    2016 8TH CAIRO INTERNATIONAL BIOMEDICAL ENGINEERING CONFERENCE (CIBEC), 2016, : 68 - 71
  • [28] Online Streaming Feature Selection Based on Feature Interaction
    Lv, Yan
    Lin, Yaojin
    Chen, Xiangyan
    Wang, Dongxing
    Wang, Chenxi
    11TH IEEE INTERNATIONAL CONFERENCE ON KNOWLEDGE GRAPH (ICKG 2020), 2020, : 49 - 57
  • [29] Feature guide: A statistically based feature selection scheme
    Jane, Y
    Dillon, T
    Pissaloux, E
    2001 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOL II, PROCEEDINGS, 2001, : 717 - 720
  • [30] FEATURE SELECTION BASED ON COMPLEMENTARITY OF FEATURE CLASSIFICATION CAPABILITY
    Gao, Fei
    Yu, Tian
    Wei, Yang
    Jin, Han
    Wei, Jin-Mao
    PROCEEDINGS OF 2013 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS (ICMLC), VOLS 1-4, 2013, : 130 - 135