Principled sure independence screening for Cox models with ultra-high-dimensional covariates

被引:162
|
作者
Zhao, Sihai Dave [1 ]
Li, Yi [1 ]
机构
[1] Harvard Univ, Sch Publ Hlth, Dept Biostat, Boston, MA 02115 USA
关键词
Cox model; Multiple myeloma; Sure independence screening; Ultra-high-dimensional covariates; Variable selection; NONCONCAVE PENALIZED LIKELIHOOD; FALSE DISCOVERY RATE; VARIABLE SELECTION; GENE-EXPRESSION; MULTIPLE-MYELOMA; ADAPTIVE LASSO; REGRESSION; SHRINKAGE;
D O I
10.1016/j.jmva.2011.08.002
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
It is rather challenging for current variable selectors to handle situations where the number of covariates under consideration is ultra-high. Consider a motivating clinical trial of the drug bortezomib for the treatment of multiple myeloma, where overall survival and expression levels of 44760 probesets were measured for each of 80 patients with the goal of identifying genes that predict survival after treatment. This dataset defies analysis even with regularized regression. Some remedies have been proposed for the linear model and for generalized linear models, but there are few solutions in the survival setting and, to our knowledge, no theoretical support. Furthermore, existing strategies often involve tuning parameters that are difficult to interpret. In this paper, we propose and theoretically justify a principled method for reducing dimensionality in the analysis of censored data by selecting only the important covariates. Our procedure involves a tuning parameter that has a simple interpretation as the desired false positive rate of this selection. We present simulation results and apply the proposed procedure to analyze the aforementioned myeloma study. (C) 2011 Elsevier Inc. All rights reserved.
引用
下载
收藏
页码:397 / 411
页数:15
相关论文
共 50 条
  • [1] Nonparametric Independence Screening in Sparse Ultra-High-Dimensional Additive Models
    Fan, Jianqing
    Feng, Yang
    Song, Rui
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2011, 106 (494) : 544 - 557
  • [2] Nonparametric Independence Screening in Sparse Ultra-High-Dimensional Varying Coefficient Models
    Fan, Jianqing
    Ma, Yunbei
    Dai, Wei
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2014, 109 (507) : 1270 - 1284
  • [3] Nonparametric independence screening for ultra-high-dimensional longitudinal data under additive models
    Niu, Yong
    Zhang, Riquan
    Liu, Jicai
    Li, Huapeng
    JOURNAL OF NONPARAMETRIC STATISTICS, 2018, 30 (04) : 884 - 905
  • [4] A sure independence screening procedure for ultra-high dimensional partially linear additive models
    Kazemi, M.
    Shahsavani, D.
    Arashi, M.
    JOURNAL OF APPLIED STATISTICS, 2019, 46 (08) : 1385 - 1403
  • [5] SURE INDEPENDENCE SCREENING ADJUSTED FOR CONFOUNDING COVARIATES WITH ULTRAHIGH DIMENSIONAL DATA
    Wen, Canhong
    Pan, Wenliang
    Huang, Mian
    Wang, Xueqin
    STATISTICA SINICA, 2018, 28 (01) : 293 - 317
  • [6] Variable selection for ultra-high-dimensional logistic models
    Du, Pang
    Wu, Pan
    Liang, Hua
    PERSPECTIVES ON BIG DATA ANALYSIS: METHODOLOGIES AND APPLICATIONS, 2014, 622 : 141 - 158
  • [7] Sure independence screening in ultrahigh dimensional generalized additive models
    Yang, Guangren
    Yao, Weixin
    Xiang, Sijia
    JOURNAL OF STATISTICAL PLANNING AND INFERENCE, 2019, 199 : 126 - 135
  • [8] Additive partially linear models for ultra-high-dimensional regression
    Li, Xinyi
    Wang, Li
    Nettleton, Dan
    STAT, 2019, 8 (01):
  • [9] Conditional distance correlation sure independence screening for ultra-high dimensional survival data
    Lu, Shuiyun
    Chen, Xiaolin
    Wang, Hong
    COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 2021, 50 (08) : 1936 - 1953
  • [10] Group screening for ultra-high-dimensional feature under linear model
    Niu, Yong
    Zhang, Riquan
    Liu, Jicai
    Li, Huapeng
    STATISTICAL THEORY AND RELATED FIELDS, 2020, 4 (01) : 43 - 54