A Comprehensive Review of Feature Selection and Feature Selection Stability in Machine Learning

被引:10
|
作者
Buyukkececi, Mustafa [1 ]
Okur, Mehmet Cudi [2 ]
机构
[1] Univerlist, Izmir, Turkiye
[2] Yasar Univ, Fac Engn, Dept Software Engn, Izmir, Turkiye
来源
GAZI UNIVERSITY JOURNAL OF SCIENCE | 2023年 / 36卷 / 04期
关键词
Feature selection; Dimensionality reduction; Types of feature selection; Feature selection stability; Stability measures; MICROARRAY; ALGORITHMS; BIAS;
D O I
10.35378/gujs.993763
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Feature selection is a dimension reduction technique used to select features that are relevant to machine learning tasks. Reducing the dataset size by eliminating redundant and irrelevant features plays a pivotal role in increasing the performance of machine learning algorithms, speeding up the learning process, and building simple models. The apparent need for feature selection has aroused considerable interest amongst researchers and has caused feature selection to find a wide range of application domains including text mining, pattern recognition, cybersecurity, bioinformatics, and big data. As a result, over the years, a substantial amount of literature has been published on feature selection and a wide variety of feature selection methods have been proposed. The quality of feature selection algorithms is measured not only by evaluating the quality of the models built using the features they select, or by the clustering tendencies of the features they select, but also by their stability. Therefore, this study focused on feature selection and feature selection stability. In the pages that follow, general concepts and methods of feature selection, feature selection stability, stability measures, and reasons and solutions for instability are discussed.
引用
收藏
页码:1506 / 1520
页数:15
相关论文
共 50 条
  • [41] On the Stability of Feature Selection Algorithms
    Nogueira, Sarah
    Sechidis, Konstantinos
    Brown, Gavin
    JOURNAL OF MACHINE LEARNING RESEARCH, 2018, 18
  • [42] Stability of feature selection algorithms
    Kalousis, A
    Prados, J
    Hilario, M
    FIFTH IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2005, : 218 - 225
  • [43] A Review of the Stability of Feature Selection Techniques for Bioinformatics Data
    Awada, Wael
    Khoshgoftaar, Taghi M.
    Dittman, David
    Wald, Randall
    Napolitano, Amri
    2012 IEEE 13TH INTERNATIONAL CONFERENCE ON INFORMATION REUSE AND INTEGRATION (IRI), 2012, : 356 - 363
  • [44] Nonlinear feature selection by relevance feature vector machine
    Cheng, Haibin
    Chen, Haifeng
    Jiang, Guofei
    Yoshihira, Kenji
    MACHINE LEARNING AND DATA MINING IN PATTERN RECOGNITION, PROCEEDINGS, 2007, 4571 : 144 - +
  • [45] Impacts of Feature Selection on Predicting Machine Failures by Machine Learning Algorithms
    Bezerra, Francisco Elanio
    de Oliveira Neto, Geraldo Cardoso
    Cervi, Gabriel Magalhaes
    Mazetto, Rafaella Francesconi
    de Faria, Aline Mariane
    Vido, Marcos
    Lima, Gustavo Araujo
    de Araujo, Sidnei Alves
    Sampaio, Mauro
    Amorim, Marlene
    APPLIED SCIENCES-BASEL, 2024, 14 (08):
  • [46] Solar Flare Prediction Using Advanced Feature Extraction, Machine Learning, and Feature Selection
    Omar W. Ahmed
    Rami Qahwaji
    Tufan Colak
    Paul A. Higgins
    Peter T. Gallagher
    D. Shaun Bloomfield
    Solar Physics, 2013, 283 : 157 - 175
  • [47] Machine learning-based intrusion detection: feature selection versus feature extraction
    Ngo, Vu-Duc
    Vuong, Tuan-Cuong
    Van Luong, Thien
    Tran, Hung
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2024, 27 (03): : 2365 - 2379
  • [48] A Survey of Feature Selection for Vulnerability Prediction Using Feature-based Machine Learning
    Li, ZhanJun
    Shao, Yan
    ICMLC 2019: 2019 11TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND COMPUTING, 2019, : 30 - 36
  • [49] Solar Flare Prediction Using Advanced Feature Extraction, Machine Learning, and Feature Selection
    Ahmed, Omar W.
    Qahwaji, Rami
    Colak, Tufan
    Higgins, Paul A.
    Gallagher, Peter T.
    Bloomfield, D. Shaun
    SOLAR PHYSICS, 2013, 283 (01) : 157 - 175
  • [50] A Literature Review of Feature Selection Techniques and Applications Review of feature selection in data mining
    Visalakshi, S.
    Radha, V.
    2014 IEEE INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND COMPUTING RESEARCH (IEEE ICCIC), 2014, : 966 - 971