Feature Selection in High Dimensional Data: A Review

被引:1
|
作者
Silaich, Sarita [1 ]
Gupta, Suneet [2 ]
机构
[1] Govt Polytech Coll Jhunjhunu, Dept Comp Sci & Engn, Jhunjhunu, India
[2] Mody Univ Laxmangarh, CSE Dept, Sikar, India
来源
THIRD CONGRESS ON INTELLIGENT SYSTEMS, CIS 2022, VOL 1 | 2023年 / 608卷
关键词
Feature selection; Filter; Wrapper; Embedded; High dimensional data; Machine learning;
D O I
10.1007/978-981-19-9225-4_51
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
By choosing an ideal subset of the total features, feature selection in machine learning is essential to reducing the quantity of the data and increasing classifier performance. Nowadays, the size of data is increasing exponentially in fields like text classification, microarray data, bioinformatics, gene expression, information retrieval, etc. In high dimensional or big data, the learning model's predictions are not accurate because of noisy or irrelevant features, so there is a challenge to reduce the data dimensionality. This paper introduces the concepts of feature relevance, relevant feature selection, and evaluation criteria. An overview and comparison of existing feature selection methods for various application domains are also done.
引用
收藏
页码:703 / 717
页数:15
相关论文
共 50 条
  • [41] A Hybrid Scheme for Feature Selection of High Dimensional Educational Data
    Ali, Usman
    Arif, Khawaja Sarmad
    Qamar, Usman
    2019 INTERNATIONAL CONFERENCE ON COMMUNICATION TECHNOLOGIES (COMTECH), 2019, : 71 - 75
  • [42] Feature Selection on High Dimensional Data using Wrapper Based Subset Selection
    Manikandan, G.
    Susi, E.
    Abirami, S.
    2017 SECOND INTERNATIONAL CONFERENCE ON RECENT TRENDS AND CHALLENGES IN COMPUTATIONAL MODELS (ICRTCCM), 2017, : 320 - 325
  • [43] Combining feature selection and feature construction to improve concept learning for high dimensional data
    Hanczar, B
    ABSTRACTION, REFORMULATION AND APPROXIMATION, PROCEEDINGS, 2005, 3607 : 261 - 273
  • [44] A Light Causal Feature Selection Approach to High-Dimensional Data
    Ling, Zhaolong
    Li, Ying
    Zhang, Yiwen
    Yu, Kui
    Zhou, Peng
    Li, Bo
    Wu, Xindong
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (08) : 7639 - 7650
  • [45] Feature Selection Strategies for Classifying High Dimensional Astronomical Data Sets
    Donalek, Ciro
    Djorgovski, S. G.
    Mahabal, Ashish A.
    Graham, Matthew J.
    Drake, Andrew J.
    Fuchs, Thomas J.
    Turmon, Michael J.
    Kumar, Arun A.
    Philip, N. Sajeeth
    Yang, Michael Ting-Chang
    Longo, Giuseppe
    2013 IEEE INTERNATIONAL CONFERENCE ON BIG DATA, 2013,
  • [46] Single Sequence Fast Feature Selection for High-Dimensional Data
    Boldt, Francisco de Assis
    Rauber, Thomas W.
    Varejao, Flavio M.
    2015 IEEE 27TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2015), 2015, : 697 - 704
  • [47] Filter Feature Selection Performance Comparison in High-dimensional Data
    Huertas, Carlos
    Juarez-Ramirez, Reyes
    2014 17TH INTERNATIONAL CONFERENCE ON INFORMATION FUSION (FUSION), 2014,
  • [48] Feature selection based on geometric distance for high-dimensional data
    Lee, J. -H.
    Oh, S. -Y.
    ELECTRONICS LETTERS, 2016, 52 (06) : 473 - 474
  • [49] Diagonal Discriminant Analysis With Feature Selection for High-Dimensional Data
    Romanes, Sarah E.
    Ormerod, John T.
    Yang, Jean Y. H.
    JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 2020, 29 (01) : 114 - 127
  • [50] Distributed Feature Selection using Vertical Partitioning for High Dimensional Data
    Prasad, Bakshi Rohit
    Bendale, Unmesh Kishor
    Agarwal, Sonali
    2016 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI), 2016, : 807 - 813