Principled sure independence screening for Cox models with ultra-high-dimensional covariates

被引:162
|
作者
Zhao, Sihai Dave [1 ]
Li, Yi [1 ]
机构
[1] Harvard Univ, Sch Publ Hlth, Dept Biostat, Boston, MA 02115 USA
关键词
Cox model; Multiple myeloma; Sure independence screening; Ultra-high-dimensional covariates; Variable selection; NONCONCAVE PENALIZED LIKELIHOOD; FALSE DISCOVERY RATE; VARIABLE SELECTION; GENE-EXPRESSION; MULTIPLE-MYELOMA; ADAPTIVE LASSO; REGRESSION; SHRINKAGE;
D O I
10.1016/j.jmva.2011.08.002
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
It is rather challenging for current variable selectors to handle situations where the number of covariates under consideration is ultra-high. Consider a motivating clinical trial of the drug bortezomib for the treatment of multiple myeloma, where overall survival and expression levels of 44760 probesets were measured for each of 80 patients with the goal of identifying genes that predict survival after treatment. This dataset defies analysis even with regularized regression. Some remedies have been proposed for the linear model and for generalized linear models, but there are few solutions in the survival setting and, to our knowledge, no theoretical support. Furthermore, existing strategies often involve tuning parameters that are difficult to interpret. In this paper, we propose and theoretically justify a principled method for reducing dimensionality in the analysis of censored data by selecting only the important covariates. Our procedure involves a tuning parameter that has a simple interpretation as the desired false positive rate of this selection. We present simulation results and apply the proposed procedure to analyze the aforementioned myeloma study. (C) 2011 Elsevier Inc. All rights reserved.
引用
下载
收藏
页码:397 / 411
页数:15
相关论文
共 50 条
  • [31] Censored mean variance sure independence screening for ultrahigh dimensional survival data
    Zhong, Wei
    Wang, Jiping
    Chen, Xiaolin
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2021, 159
  • [32] Robust Sure Independence Screening for Ultrahigh Dimensional Non-normal Data
    Zhong, Wei
    ACTA MATHEMATICA SINICA-ENGLISH SERIES, 2014, 30 (11) : 1885 - 1896
  • [33] Robust Sure Independence Screening for Ultrahigh Dimensional Non-normal Data
    Wei ZHONG
    Acta Mathematica Sinica,English Series, 2014, 30 (11) : 1885 - 1896
  • [34] Feasibility of ultra-high-dimensional flow imaging for rapid pediatric cardiopulmonary MRI
    Joseph Y Cheng
    Tao Zhang
    John M Pauly
    Shreyas Vasanawala
    Journal of Cardiovascular Magnetic Resonance, 18 (Suppl 1)
  • [35] Ultra-high-dimensional multi-level optimisation strategies for electrical machines
    Liu, Chengcheng
    Zhang, Shiwei
    Zhang, Hongming
    Wang, Youhua
    Liu, Lin
    IET ELECTRIC POWER APPLICATIONS, 2024, : 1507 - 1517
  • [36] Robust sure independence screening for ultrahigh dimensional non-normal data
    Wei Zhong
    Acta Mathematica Sinica, English Series, 2014, 30 : 1885 - 1896
  • [37] SCAD-penalized regression in additive partially linear proportional hazards models with an ultra-high-dimensional linear part
    Lian, Heng
    Li, Jianbo
    Tang, Xingyu
    JOURNAL OF MULTIVARIATE ANALYSIS, 2014, 125 : 50 - 64
  • [38] Single-index composite quantile regression for ultra-high-dimensional data
    Rong Jiang
    Mengxian Sun
    TEST, 2022, 31 : 443 - 460
  • [39] Single-index composite quantile regression for ultra-high-dimensional data
    Jiang, Rong
    Sun, Mengxian
    TEST, 2022, 31 (02) : 443 - 460
  • [40] PASE: PostgreSQL Ultra-High-Dimensional Approximate Nearest Neighbor Search Extension
    Yang, Wen
    Li, Tao
    Fang, Gai
    Wei, Hong
    SIGMOD'20: PROCEEDINGS OF THE 2020 ACM SIGMOD INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2020, : 2241 - 2253