RaSE: A Variable Screening Framework via Random Subspace Ensembles

被引:5
|
作者
Tian, Ye [1 ]
Feng, Yang [2 ]
机构
[1] Columbia Univ, Dept Stat, New York, NY USA
[2] NYU, Sch Global Publ Hlth, Dept Biostat, New York, NY 10027 USA
关键词
Ensemble learning; High-dimensional data; Random subspace method; Rank consistency; Sure screening property; Variable screening; Variable selection; KOLMOGOROV FILTER; GENE-EXPRESSION; SELECTION; REGRESSION; MODELS;
D O I
10.1080/01621459.2021.1938084
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Variable screening methods have been shown to be effective in dimension reduction under the ultra-high dimensional setting. Most existing screening methods are designed to rank the predictors according to their individual contributions to the response. As a result, variables that are marginally independent but jointly dependent with the response could be missed. In this work, we propose a new framework for variable screening, random subspace ensemble (RaSE), which works by evaluating the quality of random subspaces that may cover multiple predictors. This new screening framework can be naturally combined with any subspace evaluation criterion, which leads to an array of screening methods. The framework is capable to identify signals with no marginal effect or with high-order interaction effects. It is shown to enjoy the sure screening property and rank consistency. We also develop an iterative version of RaSE screening with theoretical support. Extensive simulation studies and real-data analysis show the effectiveness of the new screening framework.
引用
收藏
页码:457 / 468
页数:12
相关论文
共 50 条
  • [41] Recognizing driving postures by combined features of contourlet transform and edge orientation histogram, and random subspace classifier ensembles
    Zhao, Chihang
    Zhang, Xiaozheng
    Zhang, Yunsheng
    Dang, Qian
    Zhang, Xiaoqin
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2014, 27 (04) : 2011 - 2021
  • [42] Variable Screening via Quantile Partial Correlation
    Ma, Shujie
    Li, Runze
    Tsai, Chih-Ling
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2017, 112 (518) : 650 - 663
  • [43] Exact density of states for finite Gaussian random matrix ensembles via supersymmetry
    Kalisch, F
    Braak, D
    JOURNAL OF PHYSICS A-MATHEMATICAL AND GENERAL, 2002, 35 (47): : 9957 - 9969
  • [44] A framework for measuring association of random vectors via collapsed random variables
    Hofert, Marius
    Oldford, Wayne
    Prasad, Avinash
    Zhu, Mu
    JOURNAL OF MULTIVARIATE ANALYSIS, 2019, 172 : 5 - 27
  • [45] A Model-Agnostic Randomized Learning Framework based on Random Hypothesis Subspace Sampling
    Cao, Yiting
    Lan, Chao
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
  • [46] Machine learning with random subspace ensembles identifies antimicrobial resistance determinants from pan-genomes of three pathogens
    Hyun, Jason C.
    Kavvas, Erol S.
    Monk, Jonathan M.
    Palsson, Bernhard O.
    PLOS COMPUTATIONAL BIOLOGY, 2020, 16 (03)
  • [47] Classification of driver fatigue expressions by combined curvelet features and gabor features, and random subspace ensembles of support vector machines
    Zhao, Chihang
    Lian, Jie
    Dang, Qian
    Tong, Can
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2014, 26 (01) : 91 - 100
  • [48] Anti-cross validation technique for constructing and boosting random subspace neural network ensembles for hyperspectral image classification
    Eeti, Laxmi Narayana
    Buddhiraju, Krishna Mohan
    GEOCARTO INTERNATIONAL, 2021, 36 (06) : 676 - 697
  • [49] Motor Imagery Eeg Classification Using Random Subspace Ensemble Network With Variable Length Features
    Nazi Z.A.
    Hossain A.B.M.A.
    Islam M.M.
    International Journal Bioautomation, 2021, 25 (01) : 13 - 24
  • [50] Multilinear Subspace Analysis of State Ensembles in Gas-liquid Two Phase Flow via PWUD Signal
    Li, Zhao
    Zhang, Shumei
    Dong, Feng
    2023 IEEE INTERNATIONAL INSTRUMENTATION AND MEASUREMENT TECHNOLOGY CONFERENCE, I2MTC, 2023,