Variable selection for high-dimensional genomic data with censored outcomes using group lasso prior

被引:7
|
作者
Lee, Kyu Ha [1 ,2 ]
Chakraborty, Sounak [3 ]
Sun, Jianguo [3 ]
机构
[1] Forsyth Inst, Epidemiol & Biostat Core, Cambridge, MA USA
[2] Harvard Sch Dent Med, Dept Oral Hlth Policy & Epidemiol, Boston, MA USA
[3] Univ Missouri, Dept Stat, Columbia, MO 65211 USA
基金
美国国家科学基金会;
关键词
Accelerated failure time model; Bayesian lasso; Gibbs sampler; Group lasso; Penalized regression; FAILURE TIME MODEL; MICROARRAY DATA; SURVIVAL ANALYSIS; HAZARD RATIOS; ELASTIC NET; COX MODEL; REGRESSION; PREDICTION; SHRINKAGE;
D O I
10.1016/j.csda.2017.02.014
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
The variable selection problem is discussed in the context of high-dimensional failure time data arising from the accelerated failure time model. A data augmentation approach is employed in order to deal with censored survival times and to facilitate prior-posterior conjugacy. To identify a set of grouped relevant covariates, a shrinkage prior distribution is specified for regression coefficients mimicking the effect of group lasso penalty. It is noted that unlike the corresponding frequentist method, a Bayesian penalized regression approach cannot shrink the estimates of coefficients to exact zeros in general. Towards resolving the issue, a two-stage thresholding method that exploits the scaled neighbor-hood criterion and the Bayesian information criterion is devised. Simulation studies are performed to assess the robustness and performance of the proposed method in terms of variable selection accuracy and predictive power. The method is successfully applied to a set of microarray data on the individuals diagnosed with diffuse large B-cell lymphoma. In addition, an R package called psbcGroup, which can be downloaded freely from CRAN, is developed for the implementation of the methods. (C) 2017 Elsevier B.V. All rights reserved.
引用
收藏
页码:1 / 13
页数:13
相关论文
共 50 条
  • [41] Variable selection for longitudinal data with high-dimensional covariates and dropouts
    Zheng, Xueying
    Fu, Bo
    Zhang, Jiajia
    Qin, Guoyou
    JOURNAL OF STATISTICAL COMPUTATION AND SIMULATION, 2018, 88 (04) : 712 - 725
  • [42] Stochastic variational variable selection for high-dimensional microbiome data
    Tung Dang
    Kie Kumaishi
    Erika Usui
    Shungo Kobori
    Takumi Sato
    Yusuke Toda
    Yuji Yamasaki
    Hisashi Tsujimoto
    Yasunori Ichihashi
    Hiroyoshi Iwata
    Microbiome, 10
  • [43] Robust network-based regularization and variable selection for high-dimensional genomic data in cancer prognosis
    Ren, Jie
    Du, Yinhao
    Li, Shaoyu
    Ma, Shuangge
    Jiang, Yu
    Wu, Cen
    GENETIC EPIDEMIOLOGY, 2019, 43 (03) : 276 - 291
  • [44] Group Lasso Estimation of High-dimensional Covariance Matrices
    Bigot, Jeremie
    Biscay, Rolando J.
    Loubes, Jean-Michel
    Muniz-Alvarez, Lilian
    JOURNAL OF MACHINE LEARNING RESEARCH, 2011, 12 : 3187 - 3225
  • [45] A note on consistency of Bayesian high-dimensional variable selection under a default prior
    Hua, Min
    Goh, Gyuhyeong
    STAT, 2020, 9 (01):
  • [46] Censored linear model in high dimensionsPenalised linear regression on high-dimensional data with left-censored response variable
    Patric Müller
    Sara van de Geer
    TEST, 2016, 25 : 75 - 92
  • [47] Lasso regularization for left-censored Gaussian outcome and high-dimensional predictors
    Soret, Perrine
    Avalos, Marta
    Wittkop, Linda
    Commenges, Daniel
    Thiebaut, Rodolphe
    BMC MEDICAL RESEARCH METHODOLOGY, 2018, 18
  • [48] Lasso regularization for left-censored Gaussian outcome and high-dimensional predictors
    Perrine Soret
    Marta Avalos
    Linda Wittkop
    Daniel Commenges
    Rodolphe Thiébaut
    BMC Medical Research Methodology, 18
  • [49] Convergence and sparsity of Lasso and group Lasso in high-dimensional generalized linear models
    Lichun Wang
    Yuan You
    Heng Lian
    Statistical Papers, 2015, 56 : 819 - 828
  • [50] Convergence and sparsity of Lasso and group Lasso in high-dimensional generalized linear models
    Wang, Lichun
    You, Yuan
    Lian, Heng
    STATISTICAL PAPERS, 2015, 56 (03) : 819 - 828