Principled sure independence screening for Cox models with ultra-high-dimensional covariates

被引:162
|
作者
Zhao, Sihai Dave [1 ]
Li, Yi [1 ]
机构
[1] Harvard Univ, Sch Publ Hlth, Dept Biostat, Boston, MA 02115 USA
关键词
Cox model; Multiple myeloma; Sure independence screening; Ultra-high-dimensional covariates; Variable selection; NONCONCAVE PENALIZED LIKELIHOOD; FALSE DISCOVERY RATE; VARIABLE SELECTION; GENE-EXPRESSION; MULTIPLE-MYELOMA; ADAPTIVE LASSO; REGRESSION; SHRINKAGE;
D O I
10.1016/j.jmva.2011.08.002
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
It is rather challenging for current variable selectors to handle situations where the number of covariates under consideration is ultra-high. Consider a motivating clinical trial of the drug bortezomib for the treatment of multiple myeloma, where overall survival and expression levels of 44760 probesets were measured for each of 80 patients with the goal of identifying genes that predict survival after treatment. This dataset defies analysis even with regularized regression. Some remedies have been proposed for the linear model and for generalized linear models, but there are few solutions in the survival setting and, to our knowledge, no theoretical support. Furthermore, existing strategies often involve tuning parameters that are difficult to interpret. In this paper, we propose and theoretically justify a principled method for reducing dimensionality in the analysis of censored data by selecting only the important covariates. Our procedure involves a tuning parameter that has a simple interpretation as the desired false positive rate of this selection. We present simulation results and apply the proposed procedure to analyze the aforementioned myeloma study. (C) 2011 Elsevier Inc. All rights reserved.
引用
下载
收藏
页码:397 / 411
页数:15
相关论文
共 50 条
  • [41] NONPARAMETRIC INDEPENDENCE SCREENING AND STRUCTURE IDENTIFICATION FOR ULTRA-HIGH DIMENSIONAL LONGITUDINAL DATA
    Cheng, Ming-Yen
    Honda, Toshio
    Li, Jialiang
    Peng, Heng
    ANNALS OF STATISTICS, 2014, 42 (05): : 1819 - 1849
  • [42] Feature Screening in High Dimensional Regression with Endogenous Covariates
    Hu, Qinqin
    Lin, Lu
    COMPUTATIONAL ECONOMICS, 2022, 60 (03) : 949 - 969
  • [43] Feature Screening in High Dimensional Regression with Endogenous Covariates
    Qinqin Hu
    Lu Lin
    Computational Economics, 2022, 60 : 949 - 969
  • [44] Fast Sparse Modeling Technology Opening Up the Future with Ultra-high-dimensional Data
    Ida Y.
    NTT Technical Review, 2023, 21 (06): : 12 - 16
  • [45] Uniform joint screening for ultra-high dimensional graphical models
    Zheng, Zemin
    Shi, Haiyu
    Li, Yang
    Yuan, Hui
    JOURNAL OF MULTIVARIATE ANALYSIS, 2020, 179
  • [46] Ultrahigh-Dimensional Multiclass Linear Discriminant Analysis by Pairwise Sure Independence Screening
    Pan, Rui
    Wang, Hansheng
    Li, Runze
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2016, 111 (513) : 169 - 179
  • [47] Sure Joint Screening for High Dimensional Cox's Proportional Hazards Model Under the Case-Cohort Design
    Liu, Yi
    Li, Gang
    JOURNAL OF COMPUTATIONAL BIOLOGY, 2023, 30 (06) : 663 - 677
  • [48] Joint feature screening for ultra-high-dimensional sparse additive hazards model by the sparsity-restricted pseudo-score estimator
    Chen, Xiaolin
    Liu, Yi
    Wang, Qihua
    ANNALS OF THE INSTITUTE OF STATISTICAL MATHEMATICS, 2019, 71 (05) : 1007 - 1031
  • [49] Joint feature screening for ultra-high-dimensional sparse additive hazards model by the sparsity-restricted pseudo-score estimator
    Xiaolin Chen
    Yi Liu
    Qihua Wang
    Annals of the Institute of Statistical Mathematics, 2019, 71 : 1007 - 1031
  • [50] Feature Screening for Nonparametric and Semiparametric Models with Ultrahigh-Dimensional Covariates
    Junying Zhang
    Riquan Zhang
    Jiajia Zhang
    Journal of Systems Science and Complexity, 2018, 31 : 1350 - 1361