On Supervised Feature Selection from High Dimensional Feature Spaces

被引:12
|
作者
Yang, Yijing [1 ]
Wang, Wei [1 ]
Fu, Hongyu [1 ]
Kuo, C. -C. Jay [1 ]
机构
[1] Univ Southern Calif, Los Angeles, CA 90007 USA
关键词
Machine learning; classification; regression; supervised feature selection;
D O I
10.1561/116.00000016
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The application of machine learning to image and video data often yields a high dimensional feature space. Effective feature selection techniques identify a discriminant feature subspace that lowers computational and modeling costs with little performance degradation. A novel supervised feature selection methodology is proposed for machine learning decisions in this work. The resulting tests are called the discriminant feature test (DFT) and the relevant feature test (RFT) for the classification and regression problems, respectively. The DFT and RFT procedures are described in detail. Furthermore, we compare the effectiveness of DFT and RFT with several classic feature selection methods. To this end, we use deep features obtained by LeNet-5 for MNIST and Fashion-MNIST datasets as illustrative examples. Other datasets with handcrafted and gene expressions features are also included for performance evaluation. It is shown by experimental results that DFT and RFT can select a lower dimensional feature subspace distinctly and robustly while maintaining high decision performance.
引用
收藏
页数:23
相关论文
共 50 条
  • [1] GENEmops: Supervised feature selection from high dimensional biomedical dataset
    Agarwalla, Prativa
    Mukhopadhyay, Sumitra
    [J]. APPLIED SOFT COMPUTING, 2022, 123
  • [2] Unsupervised Feature Selection in High Dimensional Spaces and Uncertainty
    Villar, Jose R.
    Suarez, Maria R.
    Sedano, Javier
    Mateos, Felipe
    [J]. HYBRID ARTIFICIAL INTELLIGENCE SYSTEMS, 2009, 5572 : 565 - +
  • [3] Tournament screening cum EBIC for feature selection with high-dimensional feature spaces
    CHEN ZeHua CHEN JiaHua Department of Statistics Applied ProbabilityNational University of Singapore Science Drive Singapore Department of StatisticsUniversity of British ColumbiaVancouverBCVT ZCanada
    [J]. Science in China(Series A:Mathematics), 2009, 52 (06) - 1341
  • [4] Tournament screening cum EBIC for feature selection with high-dimensional feature spaces
    ZeHua Chen
    JiaHua Chen
    [J]. Science in China Series A: Mathematics, 2009, 52 : 1327 - 1341
  • [5] Tournament screening cum EBIC for feature selection with high-dimensional feature spaces
    CHEN ZeHua1 & CHEN JiaHua2 1 Department of Statistics & Applied Probability
    [J]. Science China Mathematics, 2009, (06) : 1327 - 1341
  • [6] Design and optimization of an autonomous feature selection pipeline for high dimensional, heterogeneous feature spaces
    Schlegel, Bernhard
    Sick, Bernhard
    [J]. PROCEEDINGS OF 2016 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (SSCI), 2016,
  • [7] Tournament screening cum EBIC for feature selection with high-dimensional feature spaces
    Chen Zehua
    Chen JiaHua
    [J]. SCIENCE IN CHINA SERIES A-MATHEMATICS, 2009, 52 (06): : 1327 - 1341
  • [8] Semi supervised relevance learning for feature selection on high dimensional data
    Ben Brahim, Afef
    Kalousis, Alexandros
    [J]. 2017 IEEE/ACS 14TH INTERNATIONAL CONFERENCE ON COMPUTER SYSTEMS AND APPLICATIONS (AICCSA), 2017, : 579 - 584
  • [9] Stability of feature selection algorithms: a study on high-dimensional spaces
    Kalousis, Alexandros
    Prados, Julien
    Hilario, Melanie
    [J]. KNOWLEDGE AND INFORMATION SYSTEMS, 2007, 12 (01) : 95 - 116
  • [10] SUBMODULAR FEATURE SELECTION FOR HIGH-DIMENSIONAL ACOUSTIC SCORE SPACES
    Liu, Yuzong
    Wei, Kai
    Kirchhoff, Katrin
    Song, Yisong
    Bilmes, Jeff
    [J]. 2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 7184 - 7188