Using bivariate models to understand between- and within-cluster regression coefficients, with application to twin data

被引:14
|
作者
Gurrin, Lyle C. [1 ]
Carlin, John B.
Sterne, Jonathan A. C.
Dite, Gillian S.
Hopper, John L.
机构
[1] Univ Melbourne, Sch Populat Sci, Ctr Mol Environm Genet & Analyt Epidemiol, Carlton, Vic 3053, Australia
[2] Royal Childrens Hosp, Murdoch Childrens Res Inst, Clin Epidemiol & Biostat Unit, Parkville, Vic 3052, Australia
[3] Univ Bristol, Dept Social Med, Bristol BS8 2PR, Avon, England
关键词
between-cluster regression coefficient; clustered data; genetic variance components; Markov chain Monte Carlo; mixed models; twins; WinBUGS; within-cluster regression coefficient;
D O I
10.1111/j.1541-0420.2006.00561.x
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
In the regression analysis of clustered data it is important to allow for the possibility of distinct between- and within-cluster exposure effects on the outcome measure, represented, respectively, by regression coefficients for the cluster mean and the deviation of the individual-level exposure value from this mean. In twin data, the within-pair regression effect represents association conditional on exposures shared within pairs, including any common genetic or environmental influences on the outcome measure. It has therefore been proposed that a comparison of the within-pair regression effects between monozygous (MZ) and dizygous (DZ) twins can be used to examine whether the association between exposure and outcome has a genetic origin. We address this issue by proposing a bivariate model for exposure and outcome measurements in twin-pair data. The between- and within-pair regression coefficients are shown to be weighted averages of ratios of the exposure and outcome variances and covariances, from which it is straightforward to determine the conditions under which the within-pair regression effect in MZ pairs will be different from that in DZ pairs. In particular, we show that a correlation structure in twin pairs for exposure and outcome that appears to be due to genetic factors will not necessarily be reflected in distinct MZ and DZ values for the within-pair regression coefficients. We illustrate these results in a study of female twin pairs from Australia and North America relating mammographic breast density to weight and body mass index.
引用
收藏
页码:745 / 751
页数:7
相关论文
共 50 条
  • [21] Regression standardization and attributable fraction estimation with between-within frailty models for clustered survival data
    Dahlqwist, Elisabeth
    Pawitan, Yudi
    Sjolander, Arvid
    [J]. STATISTICAL METHODS IN MEDICAL RESEARCH, 2019, 28 (02) : 462 - 485
  • [22] Comparing Regression Coefficients Between Same-sample Nested Models Using Logit and Probit: A New Method
    Karlson, Kristian Bernt
    Holm, Anders
    Breen, Richard
    [J]. SOCIOLOGICAL METHODOLOGY 2012, VOL 42, 2012, 42 : 286 - 313
  • [23] Data analysis using regression models with missing observations and long-memory: an application study
    Iglesias, P
    Jorquera, H
    Palma, W
    [J]. COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2006, 50 (08) : 2028 - 2043
  • [24] Relationship between missing data likelihoods and complete data restricted likelihoods for regression time series models: An application to total ozone data
    Basu, S
    Reinsel, GC
    [J]. JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES C-APPLIED STATISTICS, 1996, 45 (01) : 63 - 72
  • [25] Peak electricity demand forecasting using time series regression models: An application to South African data
    Sigauke, Caston
    Chikobvu, Delson
    [J]. JOURNAL OF STATISTICS & MANAGEMENT SYSTEMS, 2016, 19 (04): : 567 - 586
  • [26] Modelling semi-continuous data using mixture regression models with an application to cattle production yields
    Belasco, E. J.
    Ghosh, S. K.
    [J]. JOURNAL OF AGRICULTURAL SCIENCE, 2012, 150 : 109 - 121
  • [27] Estimating Soil Quality Indicators Using Remote Sensing Data: An Application of Machine Learning Regression Models
    Diaz-Gonzalez, Freddy A.
    Vallejo, Victoria E.
    Vuelvas, Jose
    Patino, Diego
    [J]. 2023 IEEE 6TH COLOMBIAN CONFERENCE ON AUTOMATIC CONTROL, CCAC, 2023, : 38 - 43
  • [28] Preventing over-fitting in PLS calibration models of near-infrared (NIR) spectroscopy data using regression coefficients
    Gowen, A. A.
    Downey, G.
    Esquerre, C.
    O'Donnell, C. P.
    [J]. JOURNAL OF CHEMOMETRICS, 2011, 25 (07) : 375 - 381
  • [29] On Disaggregating Between-Person and Within-Person Effects With Longitudinal Data Using Multilevel Models
    Wang, Lijuan
    Maxwell, Scott E.
    [J]. PSYCHOLOGICAL METHODS, 2015, 20 (01) : 63 - 83
  • [30] Improving performance of hurdle models using rare-event weighted logistic regression: an application to maternal mortality data
    Okello, Sharon Awuor
    Omondi, Evans Otieno
    Odhiambo, Collins O.
    [J]. ROYAL SOCIETY OPEN SCIENCE, 2023, 10 (08):