Feature Selection in High Dimensional Data: A Review

被引:1
|
作者
Silaich, Sarita [1 ]
Gupta, Suneet [2 ]
机构
[1] Govt Polytech Coll Jhunjhunu, Dept Comp Sci & Engn, Jhunjhunu, India
[2] Mody Univ Laxmangarh, CSE Dept, Sikar, India
来源
THIRD CONGRESS ON INTELLIGENT SYSTEMS, CIS 2022, VOL 1 | 2023年 / 608卷
关键词
Feature selection; Filter; Wrapper; Embedded; High dimensional data; Machine learning;
D O I
10.1007/978-981-19-9225-4_51
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
By choosing an ideal subset of the total features, feature selection in machine learning is essential to reducing the quantity of the data and increasing classifier performance. Nowadays, the size of data is increasing exponentially in fields like text classification, microarray data, bioinformatics, gene expression, information retrieval, etc. In high dimensional or big data, the learning model's predictions are not accurate because of noisy or irrelevant features, so there is a challenge to reduce the data dimensionality. This paper introduces the concepts of feature relevance, relevant feature selection, and evaluation criteria. An overview and comparison of existing feature selection methods for various application domains are also done.
引用
收藏
页码:703 / 717
页数:15
相关论文
共 50 条
  • [31] A hybrid feature selection scheme for high-dimensional data
    Ganjei, Mohammad Ahmadi
    Boostani, Reza
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2022, 113
  • [32] On the scalability of feature selection methods on high-dimensional data
    V. Bolón-Canedo
    D. Rego-Fernández
    D. Peteiro-Barral
    A. Alonso-Betanzos
    B. Guijarro-Berdiñas
    N. Sánchez-Maroño
    Knowledge and Information Systems, 2018, 56 : 395 - 442
  • [33] Simultaneous Feature and Model Selection for High-Dimensional Data
    Perolini, Alessandro
    Guerif, Sebastien
    2011 23RD IEEE INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2011), 2011, : 47 - 50
  • [34] Dynamic Feature Selection for Clustering High Dimensional Data Streams
    Fahy, Conor
    Yang, Shengxiang
    IEEE ACCESS, 2019, 7 : 127128 - 127140
  • [35] Analysis of high dimensional data using feature selection models
    Mahajan, Shubham
    Pandit, Amit Kant
    INTERNATIONAL JOURNAL OF NANOTECHNOLOGY, 2023, 20 (1-4) : 116 - 128
  • [36] Unsupervised Feature Selection for Efficient Exploration of High Dimensional Data
    Chakrabarti, Arnab
    Das, Abhijeet
    Cochez, Michael
    Quix, Christoph
    ADVANCES IN DATABASES AND INFORMATION SYSTEMS, ADBIS 2021, 2021, 12843 : 183 - 197
  • [37] Hybrid Feature Selection for High-Dimensional Manufacturing Data
    Sun, Yajuan
    Yu, Jianlin
    Li, Xiang
    Wu, Ji Yan
    Lu, Wen Feng
    2021 26TH IEEE INTERNATIONAL CONFERENCE ON EMERGING TECHNOLOGIES AND FACTORY AUTOMATION (ETFA), 2021,
  • [38] Feature Selection for High-Dimensional Data: The Issue of Stability
    Pes, Barbara
    2017 IEEE 26TH INTERNATIONAL CONFERENCE ON ENABLING TECHNOLOGIES - INFRASTRUCTURE FOR COLLABORATIVE ENTERPRISES (WETICE), 2017, : 170 - 175
  • [39] A hybrid feature selection method for high-dimensional data
    Taheri, Nooshin
    Nezamabadi-pour, Hossein
    2014 4TH INTERNATIONAL CONFERENCE ON COMPUTER AND KNOWLEDGE ENGINEERING (ICCKE), 2014, : 141 - 145
  • [40] Information Theoretic Feature Selection for High Dimensional Metagenomic Data
    Ditzler, Gregory
    Rosen, Gail
    Polikar, Robi
    2012 IEEE INTERNATIONAL WORKSHOP ON GENOMIC SIGNAL PROCESSING AND STATISTICS (GENSIPS), 2012, : 143 - 146