Bias due to participant overlap in two-sample Mendelian randomization

被引:914
|
作者
Burgess, Stephen [1 ]
Davies, Neil M. [2 ,3 ]
Thompson, Simon G. [1 ]
机构
[1] Univ Cambridge, Dept Publ Hlth & Primary Care, Cambridge, England
[2] Univ Bristol, MRC Integrat Epidemiol Unit, Bristol, Avon, England
[3] Univ Bristol, Sch Social & Community Med, Bristol, Avon, England
基金
英国医学研究理事会; 欧洲研究理事会; 英国惠康基金;
关键词
aggregated data; instrumental variables; Mendelian randomization; summarized data; weak instrument bias; INSTRUMENTAL VARIABLES ESTIMATION; WEAK INSTRUMENTS; GENETIC-VARIANTS; REGRESSION; IDENTIFICATION; INSIGHTS; IV;
D O I
10.1002/gepi.21998
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Mendelian randomization analyses are often performed using summarized data. The causal estimate from a one-sample analysis (in which data are taken from a single data source) with weak instrumental variables is biased in the direction of the observational association between the risk factor and outcome, whereas the estimate from a two-sample analysis (in which data on the risk factor and outcome are taken from non-overlapping datasets) is less biased and any bias is in the direction of the null. When using genetic consortia that have partially overlapping sets of participants, the direction and extent of bias are uncertain. In this paper, we perform simulation studies to investigate the magnitude of bias and Type 1 error rate inflation arising from sample overlap. We consider both a continuous outcome and a case-control setting with a binary outcome. For a continuous outcome, bias due to sample overlap is a linear function of the proportion of overlap between the samples. So, in the case of a null causal effect, if the relative bias of the one-sample instrumental variable estimate is 10% (corresponding to an F parameter of 10), then the relative bias with 50% sample overlap is 5%, and with 30% sample overlap is 3%. In a case-control setting, if risk factor measurements are only included for the control participants, unbiased estimates are obtained even in a one-sample setting. However, if risk factor data on both control and case participants are used, then bias is similar with a binary outcome as with a continuous outcome. Consortia releasing publicly available data on the associations of genetic variants with continuous risk factors should provide estimates that exclude case participants from case-control samples.
引用
下载
收藏
页码:597 / 608
页数:12
相关论文
共 50 条
  • [1] Correction for Sample Overlap, Winner's Curse and Weak Instruments Bias in Two-Sample Mendelian Randomization
    Ninon, Mounier
    Zoltan, Kutalik
    HUMAN HEREDITY, 2021, 85 (02) : 86 - 86
  • [2] Correction for sample overlap, Winner's curse and weak-instruments bias in two-sample Mendelian Randomization
    Mounier, Ninon
    Kutalik, Zoltan
    EUROPEAN JOURNAL OF HUMAN GENETICS, 2022, 30 (SUPPL 1) : 492 - 492
  • [3] Approximation of bias and mean-squared error in two-sample Mendelian randomization analyses
    Deng, Lu
    Zhang, Han
    Song, Lei
    Yu, Kai
    BIOMETRICS, 2020, 76 (02) : 369 - 379
  • [4] Periodontitis and osteoporosis: a two-sample Mendelian randomization analysis
    Wu, Jiale
    Yao, Lihui
    Liu, Yuchen
    Zhang, ShuaiShuai
    Wang, Kan
    BRAZILIAN JOURNAL OF MEDICAL AND BIOLOGICAL RESEARCH, 2024, 57
  • [5] A two-sample Mendelian randomization study of atherosclerosis and dementia
    Zhang, Qiaoyun
    Wu, Guangheng
    Zhang, Xiaoyu
    Wang, Sheng
    Wang, Youxin
    ISCIENCE, 2023, 26 (11)
  • [6] Lipoprotein(a) and stroke: a two-sample Mendelian randomization study
    Huang, Yi
    Zhang, Ruijie
    Han, Liyuan
    Wu, Yiwen
    Deng, Xinpeng
    Xu, Tianqi
    Wu, Yuefei
    Gao, Xiang
    Zhou, Chenhui
    Sun, Jie
    FRONTIERS IN AGING NEUROSCIENCE, 2023, 15
  • [7] Diabetes and osteoporosis: a two-sample mendelian randomization study
    Qu, Yu-Dun
    Zhu, Zhao-Hua
    Li, Jia-Xuan
    Zhang, Wei
    Chen, Qi
    Xia, Chang-Liang
    Ma, Jun-Nan
    Ou, Shuan-Ji
    Yang, Yang
    Qi, Yong
    Xu, Chang-Peng
    BMC MUSCULOSKELETAL DISORDERS, 2024, 25 (01)
  • [8] Commentary: Two-sample Mendelian randomization: opportunities and challenges
    Lawlor, Debbie A.
    INTERNATIONAL JOURNAL OF EPIDEMIOLOGY, 2016, 45 (03) : 908 - 915
  • [9] Inflammation and heart failure: a two-sample Mendelian randomization study
    Remmelzwaal, Sharon
    van Oort, Sabine
    Handoko, M. Louis
    van Empel, Vanessa
    Heymans, Stephane R. B.
    Beulens, Joline W. J.
    JOURNAL OF CARDIOVASCULAR MEDICINE, 2022, 23 (11) : 728 - 735
  • [10] Narcolepsy and cardiovascular disease: A two-sample Mendelian randomization study
    Tao, Yanmin
    Luo, Jingsong
    Xu, Yaxin
    Wang, Hongyan
    Tian, Jing
    Yang, Shenbi
    Yu, Kexin
    Peng, Sihan
    Zhang, Xiangeng
    SLEEP MEDICINE, 2024, 113 : 6 - 12