Bayesian Variable Selection for Survival Regression in Genetics

被引:2
|
作者
Tachmazidou, Ioanna [1 ]
Johnson, Michael R. [2 ]
De Iorio, Maria [3 ]
机构
[1] Inst Publ Hlth, Biostat Unit, MRC, Cambridge CB2 0SR, England
[2] Univ London Imperial Coll Sci Technol & Med, Div Expt Med, London, England
[3] Univ London Imperial Coll Sci Technol & Med, Dept Epidemiol & Biostat, London, England
基金
英国惠康基金;
关键词
Cox proportional hazards model; genome-wide association study; Lasso; maximum a posteriori estimate; penalization; prospective cohort design; resequencing; shrinkage; single nucleotide polymorphism;
D O I
10.1002/gepi.20530
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Variable selection in regression with very big numbers of variables is challenging both in terms of model specification and computation. We focus on genetic studies in the field of survival, and we present a Bayesian-inspired penalized maximum likelihood approach appropriate for high-dimensional problems. In particular, we employ a simple, efficient algorithm that seeks maximum a posteriori (MAP) estimates of regression coefficients. The latter are assigned a Laplace prior with a sharp mode at zero, and non-zero posterior mode estimates correspond to significant single nucleotide polymorphisms (SNPs). Using the Laplace prior reflects a prior belief that only a small proportion of the SNPs significantly influence the response. The method is fast and can handle datasets arising from imputation or resequencing. We demonstrate the localization performance, power and false-positive rates of our method in large simulation studies of dense-SNP datasets and sequence data, and we compare the performance of our method to the univariate Cox regression and to a recently proposed stochastic search approach. In general, we find that our approach improves localization and power slightly, while the biggest advantage is in false-positive counts and computing times. We also apply our method to a real prospective study, and we observe potential association between candidate ABC transporter genes and epilepsy treatment outcomes. Genet. Epidemiol. 34:689-701, 2010. (C) 2010 Wiley-Liss, Inc.
引用
收藏
页码:689 / 701
页数:13
相关论文
共 50 条
  • [41] Bayesian variable selection for the Cox regression model with missing covariates
    Joseph G. Ibrahim
    Ming-Hui Chen
    Sungduk Kim
    [J]. Lifetime Data Analysis, 2008, 14 : 496 - 520
  • [42] A novel Bayesian approach for variable selection in linear regression models
    Posch, Konstantin
    Arbeiter, Maximilian
    Pilz, Juergen
    [J]. COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2020, 144
  • [43] Bayesian variable selection for the Cox regression model with missing covariates
    Ibrahim, Joseph G.
    Chen, Ming-Hui
    Kim, Sungduk
    [J]. LIFETIME DATA ANALYSIS, 2008, 14 (04) : 496 - 520
  • [44] Prior elicitation, variable selection and Bayesian computation for logistic regression models
    Chen, MH
    Ibrahim, JG
    Yiannoutsos, C
    [J]. JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 1999, 61 : 223 - 242
  • [45] Bayesian sparse multiple regression for simultaneous rank reduction and variable selection
    Chakraborty, Antik
    Bhattacharya, Anirban
    Mallick, Bani K.
    [J]. BIOMETRIKA, 2020, 107 (01) : 205 - 221
  • [46] Bayesian Variable Selection in Linear Regression in One Pass for Large Datasets
    Ordonez, Carlos
    Garcia-Alvarado, Carlos
    Baladandayuthapani, Veerabhadaran
    [J]. ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2014, 9 (01)
  • [47] Bayesian latent factor regression for multivariate functional data with variable selection
    Noh, Heesang
    Choi, Taeryon
    Park, Jinsu
    Chung, Yeonseung
    [J]. JOURNAL OF THE KOREAN STATISTICAL SOCIETY, 2020, 49 (03) : 901 - 923
  • [48] Bayesian latent factor regression for multivariate functional data with variable selection
    Heesang Noh
    Taeryon Choi
    Jinsu Park
    Yeonseung Chung
    [J]. Journal of the Korean Statistical Society, 2020, 49 : 901 - 923
  • [49] On the consistency of Bayesian variable selection for high dimensional binary regression and classification
    Jiang, Wenxin
    [J]. NEURAL COMPUTATION, 2006, 18 (11) : 2762 - 2776
  • [50] A novel Bayesian method for variable selection and estimation in binary quantile regression
    Dao, Mai
    Wang, Min
    Ghosh, Souparno
    [J]. STATISTICAL ANALYSIS AND DATA MINING, 2022, 15 (06) : 766 - 780