Principled sure independence screening for Cox models with ultra-high-dimensional covariates

被引:162
|
作者
Zhao, Sihai Dave [1 ]
Li, Yi [1 ]
机构
[1] Harvard Univ, Sch Publ Hlth, Dept Biostat, Boston, MA 02115 USA
关键词
Cox model; Multiple myeloma; Sure independence screening; Ultra-high-dimensional covariates; Variable selection; NONCONCAVE PENALIZED LIKELIHOOD; FALSE DISCOVERY RATE; VARIABLE SELECTION; GENE-EXPRESSION; MULTIPLE-MYELOMA; ADAPTIVE LASSO; REGRESSION; SHRINKAGE;
D O I
10.1016/j.jmva.2011.08.002
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
It is rather challenging for current variable selectors to handle situations where the number of covariates under consideration is ultra-high. Consider a motivating clinical trial of the drug bortezomib for the treatment of multiple myeloma, where overall survival and expression levels of 44760 probesets were measured for each of 80 patients with the goal of identifying genes that predict survival after treatment. This dataset defies analysis even with regularized regression. Some remedies have been proposed for the linear model and for generalized linear models, but there are few solutions in the survival setting and, to our knowledge, no theoretical support. Furthermore, existing strategies often involve tuning parameters that are difficult to interpret. In this paper, we propose and theoretically justify a principled method for reducing dimensionality in the analysis of censored data by selecting only the important covariates. Our procedure involves a tuning parameter that has a simple interpretation as the desired false positive rate of this selection. We present simulation results and apply the proposed procedure to analyze the aforementioned myeloma study. (C) 2011 Elsevier Inc. All rights reserved.
引用
下载
收藏
页码:397 / 411
页数:15
相关论文
共 50 条
  • [21] Robust Model Structure Recovery for Ultra-High-Dimensional Varying-Coefficient Models
    Yang, Jing
    Tian, Guo-Liang
    Lu, Xuewen
    Wang, Mingqiu
    COMMUNICATIONS IN MATHEMATICS AND STATISTICS, 2023,
  • [22] Forward variable selection for sparse ultra-high-dimensional generalized varying coefficient models
    Honda, Toshio
    Lin, Chien-Tong
    JAPANESE JOURNAL OF STATISTICS AND DATA SCIENCE, 2021, 4 (01) : 151 - 179
  • [23] Forward variable selection for sparse ultra-high-dimensional generalized varying coefficient models
    Toshio Honda
    Chien-Tong Lin
    Japanese Journal of Statistics and Data Science, 2021, 4 : 151 - 179
  • [24] VARIABLE SELECTION FOR SPARSE HIGH-DIMENSIONAL NONLINEAR REGRESSION MODELS BY COMBINING NONNEGATIVE GARROTE AND SURE INDEPENDENCE SCREENING
    Wu, Shuang
    Xue, Hongqi
    Wu, Yichao
    Wu, Hulin
    STATISTICA SINICA, 2014, 24 (03) : 1365 - 1387
  • [25] Sure independence screening for ultrahigh dimensional feature space Discussion
    Bickel, Peter
    Buehlmann, Peter
    Yao, Qiwei
    Samworth, Richard
    Hall, Peter
    Titterington, D. M.
    Xue, Jing-Hao
    Anagnostopoulos, C.
    Tasoullis, D. K.
    Zhang, Wenyang
    Xia, Yingcun
    Johnstone, Iain M.
    Richardson, Sylvia
    Bottolo, Leonardo
    Kent, John T.
    Adragni, Kofi
    Cook, R. Dennis
    Gather, Ursula
    Guddat, Charlotte
    Greenshtein, Eitan
    James, Gareth M.
    Radchenko, Peter
    Leng, Chenlei
    Wang, Hansheng
    Levina, Elizaveta
    Zhu, Ji
    Li, Runze
    Liu, Yufeng
    Longford, N. T.
    Luo, Weiqi
    Baxter, Paul D.
    Taylor, Charles C.
    Marron, J. S.
    Morris, Jeffrey S.
    Robert, Christian P.
    Yu, Keming
    Zhang, Cun-Hui
    Zhang, Hao Helen
    Zhou, Harrison H.
    Lin, Xihong
    Zou, Hui
    JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2008, 70 : 883 - 911
  • [26] Nonparametric independence screening for ultra-high dimensional generalized varying coefficient models with longitudinal data
    Zhang, Shen
    Zhao, Peixin
    Li, Gaorong
    Xu, Wangli
    JOURNAL OF MULTIVARIATE ANALYSIS, 2019, 171 : 37 - 52
  • [27] Ultra-high-dimensional feature screening of binary categorical response data based on Jensen-Shannon divergence
    Jiang, Qingqing
    Deng, Guangming
    AIMS MATHEMATICS, 2024, 9 (02): : 2874 - 2907
  • [28] SURE INDEPENDENCE SCREENING IN GENERALIZED LINEAR MODELS WITH NP-DIMENSIONALITY
    Fan, Jianqing
    Song, Rui
    ANNALS OF STATISTICS, 2010, 38 (06): : 3567 - 3604
  • [29] Testing the statistical significance of an ultra-high-dimensional naive Bayes classifier
    An, Baiguo
    Wang, Hansheng
    Guo, Jianhua
    STATISTICS AND ITS INTERFACE, 2013, 6 (02) : 223 - 229
  • [30] Censored mean variance sure independence screening for ultrahigh dimensional survival data
    Zhong, Wei
    Wang, Jiping
    Chen, Xiaolin
    Computational Statistics and Data Analysis, 2021, 159