RaSE: A Variable Screening Framework via Random Subspace Ensembles

被引:5
|
作者
Tian, Ye [1 ]
Feng, Yang [2 ]
机构
[1] Columbia Univ, Dept Stat, New York, NY USA
[2] NYU, Sch Global Publ Hlth, Dept Biostat, New York, NY 10027 USA
关键词
Ensemble learning; High-dimensional data; Random subspace method; Rank consistency; Sure screening property; Variable screening; Variable selection; KOLMOGOROV FILTER; GENE-EXPRESSION; SELECTION; REGRESSION; MODELS;
D O I
10.1080/01621459.2021.1938084
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Variable screening methods have been shown to be effective in dimension reduction under the ultra-high dimensional setting. Most existing screening methods are designed to rank the predictors according to their individual contributions to the response. As a result, variables that are marginally independent but jointly dependent with the response could be missed. In this work, we propose a new framework for variable screening, random subspace ensemble (RaSE), which works by evaluating the quality of random subspaces that may cover multiple predictors. This new screening framework can be naturally combined with any subspace evaluation criterion, which leads to an array of screening methods. The framework is capable to identify signals with no marginal effect or with high-order interaction effects. It is shown to enjoy the sure screening property and rank consistency. We also develop an iterative version of RaSE screening with theoretical support. Extensive simulation studies and real-data analysis show the effectiveness of the new screening framework.
引用
收藏
页码:457 / 468
页数:12
相关论文
共 50 条
  • [1] RaSE: Random Subspace Ensemble Classification
    Tian, Ye
    Feng, Yang
    JOURNAL OF MACHINE LEARNING RESEARCH, 2021, 22
  • [2] RaSE: Random subspace ensemble classification
    Tian, Ye
    Feng, Yang
    Journal of Machine Learning Research, 2021, 22
  • [3] Super RaSE: Super Random Subspace Ensemble Classification
    Zhu, Jianan
    Feng, Yang
    JOURNAL OF RISK AND FINANCIAL MANAGEMENT, 2021, 14 (12)
  • [4] Random Subspace Ensembles for fMRI Classification
    Kuncheva, Ludmila I.
    Rodriguez, Juan J.
    Plumpton, Catrin O.
    Linden, David E. J.
    Johnston, Stephen J.
    IEEE TRANSACTIONS ON MEDICAL IMAGING, 2010, 29 (02) : 531 - 542
  • [5] A note on variable selection in functional regression via random subspace method
    Łukasz Smaga
    Hidetoshi Matsui
    Statistical Methods & Applications, 2018, 27 : 455 - 477
  • [6] A note on variable selection in functional regression via random subspace method
    Smaga, Lukasz
    Matsui, Hidetoshi
    STATISTICAL METHODS AND APPLICATIONS, 2018, 27 (03): : 455 - 477
  • [7] A Semi-Random Subspace Method for Classification Ensembles
    Amasyali, Mehmet Fatih
    2013 21ST SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2013,
  • [8] Choosing Parameters for Random Subspace Ensembles for fMERI Classification
    Kuncheva, Ludmila I.
    Plumpton, Catrin O.
    MULTIPLE CLASSIFIER SYSTEMS, PROCEEDINGS, 2010, 5997 : 54 - 63
  • [9] Optimizing model-agnostic random subspace ensembles
    Huynh-Thu, Van Anh
    Geurts, Pierre
    MACHINE LEARNING, 2024, 113 (02) : 993 - 1042
  • [10] Optimizing model-agnostic random subspace ensembles
    Vân Anh Huynh-Thu
    Pierre Geurts
    Machine Learning, 2024, 113 : 993 - 1042