Semiparametric Bayesian analysis of gene-environment interactions with error in measurement of environmental covariates and missing genetic data

被引:0
|
作者
Lobach, Iryna [1 ]
Mallick, Bani [2 ]
Carroll, Raymond J. [2 ]
机构
[1] NYU, Sch Med, Div Biostat, New York, NY 10016 USA
[2] Texas A&M Univ, Dept Stat, College Stn, TX 77840 USA
基金
美国国家科学基金会;
关键词
Bayesian inference; Errors in variables; Gene-environment interactions; Markov Chain Monte Carlo sampling; Missing data; Pseudo-likelihood; Semiparametric methods;
D O I
暂无
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Case-control studies are widely used to detect gene-environment interactions in the etiology of complex diseases. Many variables that are of interest to biomedical researchers are difficult to measure on an individual level, e. g. nutrient intake, cigarette smoking exposure, long-term toxic exposure. Measurement error causes bias in parameter estimates, thus masking key features of data and leading to loss of power and spurious/masked associations. We develop a Bayesian methodology for analysis of case-control studies for the case when measurement error is present in an environmental covariate and the genetic variable has missing data. This approach offers several advantages. It allows prior information to enter the model to make estimation and inference more precise. The environmental covariates measured exactly are modeled completely nonparametrically. Further, information about the probability of disease can be incorporated in the estimation procedure to improve quality of parameter estimates, what cannot be done in conventional case-control studies. A unique feature of the procedure under investigation is that the analysis is based on a pseudo-likelihood function therefore conventional Bayesian techniques may not be technically correct. We propose an approach using Markov Chain Monte Carlo sampling as well as a computationally simple method based on an asymptotic posterior distribution. Simulation experiments demonstrated that our method produced parameter estimates that are nearly unbiased even for small sample sizes. An application of our method is illustrated using a population-based case-control study of the association between calcium intake with the risk of colorectal adenoma development.
引用
收藏
页码:305 / 315
页数:11
相关论文
共 50 条
  • [1] Semiparametric Bayesian variable selection for gene-environment interactions
    Ren, Jie
    Zhou, Fei
    Li, Xiaoxi
    Chen, Qi
    Zhang, Hongmei
    Ma, Shuangge
    Jiang, Yu
    Wu, Cen
    [J]. STATISTICS IN MEDICINE, 2020, 39 (05) : 617 - 638
  • [2] A general framework for studying genetic effects and gene-environment interactions with missing data
    Hu, Y. J.
    Lin, D. Y.
    Zeng, D.
    [J]. BIOSTATISTICS, 2010, 11 (04) : 583 - 598
  • [3] Semiparametric Bayesian analysis of case-control data under conditional gene-environment independence
    Mukherjee, Bhramar
    Zhang, Li
    Ghosh, Malay
    Sinha, Samiran
    [J]. BIOMETRICS, 2007, 63 (03) : 834 - 844
  • [4] Finding the missing gene-environment interactions
    Kraft, Peter
    Aschard, Hugues
    [J]. EUROPEAN JOURNAL OF EPIDEMIOLOGY, 2015, 30 (05) : 353 - 355
  • [5] Genotype-based Bayesian Analysis of Gene-Environment Interactions with Multiple Genetic Markers and Misclassification in Environmental Factors
    Lobach, Iryna
    Fan, Ruzong
    [J]. GENETIC EPIDEMIOLOGY, 2012, 36 (07) : 764 - 764
  • [6] Genotype-Based Bayesian Analysis of Gene-Environment Interactions with Multiple Genetic Markers and Misclassification in Environmental Factors
    Lobach, Iryna
    Ruzong Fan
    [J]. JOURNAL OF PROBABILITY AND STATISTICS, 2012, 2012
  • [7] Estimation of magnitude in gene-environment interactions in the presence of measurement error
    Wong, MY
    Day, NE
    Luan, JA
    Wareham, NJ
    [J]. STATISTICS IN MEDICINE, 2004, 23 (06) : 987 - 998
  • [8] BAYESIAN SEMIPARAMETRIC ANALYSIS FOR TWO-PHASE STUDIES OF GENE-ENVIRONMENT INTERACTION
    Ahn, Jaeil
    Mukherjee, Bhramar
    Gruber, Stephen B.
    Ghosh, Malay
    [J]. ANNALS OF APPLIED STATISTICS, 2013, 7 (01): : 543 - 569
  • [9] A penalized robust semiparametric approach for gene-environment interactions
    Wu, Cen
    Shi, Xingjie
    Cui, Yuehua
    Ma, Shuangge
    [J]. STATISTICS IN MEDICINE, 2015, 34 (30) : 4016 - 4030
  • [10] A Bayesian semiparametric accelerated failure time model for arbitrarily censored data with covariates subject to measurement error
    Lin, Xiaoyan
    [J]. COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 2017, 46 (01) : 747 - 756