Principled sure independence screening for Cox models with ultra-high-dimensional covariates

被引:162
|
作者
Zhao, Sihai Dave [1 ]
Li, Yi [1 ]
机构
[1] Harvard Univ, Sch Publ Hlth, Dept Biostat, Boston, MA 02115 USA
关键词
Cox model; Multiple myeloma; Sure independence screening; Ultra-high-dimensional covariates; Variable selection; NONCONCAVE PENALIZED LIKELIHOOD; FALSE DISCOVERY RATE; VARIABLE SELECTION; GENE-EXPRESSION; MULTIPLE-MYELOMA; ADAPTIVE LASSO; REGRESSION; SHRINKAGE;
D O I
10.1016/j.jmva.2011.08.002
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
It is rather challenging for current variable selectors to handle situations where the number of covariates under consideration is ultra-high. Consider a motivating clinical trial of the drug bortezomib for the treatment of multiple myeloma, where overall survival and expression levels of 44760 probesets were measured for each of 80 patients with the goal of identifying genes that predict survival after treatment. This dataset defies analysis even with regularized regression. Some remedies have been proposed for the linear model and for generalized linear models, but there are few solutions in the survival setting and, to our knowledge, no theoretical support. Furthermore, existing strategies often involve tuning parameters that are difficult to interpret. In this paper, we propose and theoretically justify a principled method for reducing dimensionality in the analysis of censored data by selecting only the important covariates. Our procedure involves a tuning parameter that has a simple interpretation as the desired false positive rate of this selection. We present simulation results and apply the proposed procedure to analyze the aforementioned myeloma study. (C) 2011 Elsevier Inc. All rights reserved.
引用
下载
收藏
页码:397 / 411
页数:15
相关论文
共 50 条
  • [11] Robust sure independence screening for nonpolynomial dimensional generalized linear models
    Ghosh, Abhik
    Ponzi, Erica
    Sandanger, Torkjel
    Thoresen, Magne
    SCANDINAVIAN JOURNAL OF STATISTICS, 2023, 50 (03) : 1232 - 1262
  • [12] Forward regression for Cox models with high-dimensional covariates
    Hong, Hyokyoung G.
    Zheng, Qi
    Li, Yi
    JOURNAL OF MULTIVARIATE ANALYSIS, 2019, 173 : 268 - 290
  • [13] Quantile screening for ultra-high-dimensional heterogeneous data conditional on some variables
    Liu, Yi
    Chen, Xiaolin
    JOURNAL OF STATISTICAL COMPUTATION AND SIMULATION, 2018, 88 (02) : 329 - 342
  • [14] Feature screening for ultra-high-dimensional data via multiscale graph correlation
    Deng, Luojia
    Wu, Jinhai
    Zhang, Bin
    Zhang, Yue
    COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 2023, 53 (22) : 7942 - 7979
  • [15] ExSIS: Extended sure independence screening for ultrahigh-dimensional linear models
    Ahmed, Talal
    Bajwa, Waheed U.
    SIGNAL PROCESSING, 2019, 159 : 33 - 48
  • [16] Sparse model identification and learning for ultra-high-dimensional additive partially linear models
    Li, Xinyi
    Wang, Li
    Nettleton, Dan
    JOURNAL OF MULTIVARIATE ANALYSIS, 2019, 173 : 204 - 228
  • [17] Conditional screening for ultra-high dimensional covariates with survival outcomes
    Hong, Hyokyoung G.
    Kang, Jian
    Li, Yi
    LIFETIME DATA ANALYSIS, 2018, 24 (01) : 45 - 71
  • [18] SIS: An R Package for Sure Independence Screening in Ultrahigh-Dimensional Statistical Models
    Saldana, Diego Franco
    Feng, Yang
    JOURNAL OF STATISTICAL SOFTWARE, 2018, 83 (02): : 1 - 25
  • [19] Conditional screening for ultra-high dimensional covariates with survival outcomes
    Hyokyoung G. Hong
    Jian Kang
    Yi Li
    Lifetime Data Analysis, 2018, 24 : 45 - 71
  • [20] Sure independence screening for ultrahigh dimensional feature space
    Fan, Jianqing
    Lv, Jinchi
    JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2008, 70 : 849 - 883