A general framework for boosting feature subset selection algorithms

被引:14
|
作者
Perez-Rodriguez, Javier [1 ]
de Haro-Garcia, Aida [1 ]
Romero del Castillo, Juan A. [1 ]
Garcia-Pedrajas, Nicolas [1 ]
机构
[1] Univ Cordoba, Campus Rabanales, Cordoba 14011, Spain
关键词
Feature selection; Boosting; Classifier ensembles; CLASSIFICATION; ENSEMBLES;
D O I
10.1016/j.inffus.2018.03.003
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Feature selection is one of the most important tasks in many machine learning and data mining problems. Due to the increasing size of the problems, removing useless, erroneous or noisy features is frequently an initial step that is performed before other data mining algorithms are applied. The aim is to reproduce, or even improve, the performance of the data mining algorithm when all the features are used. Furthermore, the selection of the most relevant features may offer the expert valuable information about the problem to be solved. Over the past few decades, many different feature selection algorithms have been proposed, each with its own strengths and weaknesses. However, as in the case of classification, it is unlikely that a single feature selection algorithm would be able to achieve good results across many different datasets and application fields. Furthermore, when we are dealing with thousands of features, the most powerful feature selection methods are frequently too time consuming to be applied. In classification, one of the most successful ways of consistently improving the performance of a single weak learner is to construct ensembles using boosting methods. In this paper, we propose a general framework for feature selection boosting in the same way boosting is applied to classification. The proposed approach opens a new field of research in which to apply the many techniques developed for boosting classifiers. Using 120 datasets, the experiments reported show a clear improvement in several state-of-the-art feature selection algorithms using the proposed methodology.
引用
收藏
页码:147 / 175
页数:29
相关论文
共 50 条
  • [1] A framework for feature selection through boosting
    Alsahaf, Ahmad
    Petkov, Nicolai
    Shenoy, Vikram
    Azzopardi, George
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2022, 187
  • [2] Boosting Algorithms for Simultaneous Feature Extraction and Selection
    Saberian, Mohammad J.
    Vasconcelos, Nuno
    [J]. 2012 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2012, : 2448 - 2455
  • [3] A hybrid framework for optimal feature subset selection
    Shukla, Alok Kumar
    Singh, Pradeep
    Vardhan, Manu
    [J]. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2019, 36 (03) : 2247 - 2259
  • [4] Balanced accuracy for feature subset selection with genetic algorithms
    Peterson, MR
    Raymer, ML
    Lamont, GB
    [J]. 2005 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION, VOLS 1-3, PROCEEDINGS, 2005, : 2506 - 2513
  • [5] An improvement on floating search algorithms for feature subset selection
    Nakariyakul, Songyot
    Casasent, David P.
    [J]. PATTERN RECOGNITION, 2009, 42 (09) : 1932 - 1940
  • [6] Feature subset selection, class separability, and genetic algorithms
    Cantú-Paz, E
    [J]. GENETIC AND EVOLUTIONARY COMPUTATION - GECCO 2004, PT 1, PROCEEDINGS, 2004, 3102 : 959 - 970
  • [7] Orthogonal forward selection and backward elimination algorithms for feature subset selection
    Mao, KZ
    [J]. IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2004, 34 (01): : 629 - 634
  • [8] Application of Genetic Algorithms to Feature Subset Selection in a Farsi OCR
    Soryani, M.
    Rafat, N.
    [J]. PROCEEDINGS OF WORLD ACADEMY OF SCIENCE, ENGINEERING AND TECHNOLOGY, VOL 18, 2006, 18 : 113 - 116
  • [9] A Hybrid Approach for Optimal Feature Subset Selection with Evolutionary Algorithms
    Kawamura, Atsushi
    Chakraborty, Basabi
    [J]. 2017 IEEE 8TH INTERNATIONAL CONFERENCE ON AWARENESS SCIENCE AND TECHNOLOGY (ICAST), 2017, : 564 - 568
  • [10] Feature Subset Selection Based on Bio-Inspired Algorithms
    Yun, Chulmin
    Oh, Byonghwa
    Yang, Jihoon
    Nang, Jongho
    [J]. JOURNAL OF INFORMATION SCIENCE AND ENGINEERING, 2011, 27 (05) : 1667 - 1686