Participant identification in genetic association studies: improved methods and practical implications

被引:13
|
作者
Masca, Nicholas [1 ]
Burton, Paul R. [1 ]
Sheehan, Nuala A. [1 ]
机构
[1] Univ Leicester, Dept Hlth Sci, Leicester LE1 7RH, Leics, England
基金
英国惠康基金; 英国医学研究理事会;
关键词
Identification; linear regression; generalized estimating equations; linkage disequilibrium; case-control genetic association studies; GENOME-WIDE ASSOCIATION; ESTIMATING EQUATIONS; LONGITUDINAL DATA; POPULATIONS; PRIVACY; MIXTURE; DISEASE; MODELS;
D O I
10.1093/ije/dyr149
中图分类号
R1 [预防医学、卫生学];
学科分类号
1004 ; 120402 ;
摘要
Background In a recent paper by Homer et al. (Resolving individuals contributing trace amounts of DNA to highly complex mixtures using high-density SNP genotyping microarrays. PLoS Genet 2008; 4: e1000167), a method for detecting whether a given individual is a contributor to a particular genomic mixture was proposed. This prompted grave concern about the public dissemination of aggregate statistics from genome-wide association studies. It is of clear scientific importance that such data be shared widely, but the confidentiality of study participants must not be compromised. The issue of what summary genomic data can safely be posted on the web is only addressed satisfactorily when the theoretical underpinnings of the proposed method are clarified and its performance evaluated in terms of dependence on underlying assumptions. Methods The original method raised a number of concerns and several alternatives have since been proposed, including a simple linear regression approach. In our proposed generalized estimating equation approach, we maintain the simplicity of the linear regression model but obtain inferences that are more robust to approximation of the variance/covariance structure and can accommodate linkage disequilibrium. Results We affirm that, in principle, it is possible to determine that a 'candidate' individual has participated in a study, given a subset of aggregate statistics from that study. However, the methods depend critically on a number of key factors including: the ancestry of participants in the study; the absolute and relative numbers of cases and controls; and the number of single nucleotide polymorphisms. Conclusions Simple guidelines for publication that are based on a single criterion are therefore unlikely to suffice. In particular, 'directed' summary statistics should not be posted openly on the web but could be protected by an internet-based access check as proposed by the P3G_Consortium et al. (Public access to genome-wide data: five views on balancing research with privacy and protection. PLoS Genet 2009;5:e1000665).
引用
收藏
页码:1629 / 1642
页数:14
相关论文
共 50 条
  • [41] Practical implications for genetic modeling in the genomics era
    VanRaden, P. M.
    [J]. JOURNAL OF DAIRY SCIENCE, 2016, 99 (03) : 2405 - 2412
  • [42] SecureMA: protecting participant privacy in genetic association meta-analysis
    Xie, Wei
    Kantarcioglu, Murat
    Bush, William S.
    Crawford, Dana
    Denny, Joshua C.
    Heatherly, Raymond
    Malin, Bradley A.
    [J]. BIOINFORMATICS, 2014, 30 (23) : 3334 - 3341
  • [43] Adjusting for collider bias in genetic association studies using instrumental variable methods
    Cai, Siyang
    Hartley, April
    Mahmoud, Osama
    Tilling, Kate
    Dudbridge, Frank
    [J]. GENETIC EPIDEMIOLOGY, 2022, 46 (5-6) : 303 - 316
  • [44] Methods for meta-analysis in genetic association studies: a review of their potential and pitfalls
    Fotini K. Kavvoura
    John P. A. Ioannidis
    [J]. Human Genetics, 2008, 123 : 1 - 14
  • [45] Methods for meta-analysis in genetic association studies: a review of their potential and pitfalls
    Kavvoura, Fotini K.
    Ioannidis, John P. A.
    [J]. HUMAN GENETICS, 2008, 123 (01) : 1 - 14
  • [46] Implications of association studies in asthma
    Martinez, FD
    [J]. CLINICAL AND EXPERIMENTAL ALLERGY, 1998, 28 : 93 - 94
  • [47] Practical methods of gender identification in Kashmir geese
    Hamadani, Henna
    Khan, Azmat Alam
    Hamadani, Ambreen
    Rafiq, Andleeb
    [J]. INDIAN JOURNAL OF ANIMAL SCIENCES, 2017, 87 (05): : 653 - 655
  • [48] Practical Criteria and Methods for the Identification of Hemolytic Streptococci
    Coffey, Julia M.
    [J]. AMERICAN JOURNAL OF PUBLIC HEALTH AND THE NATIONS HEALTH, 1936, 26 (05): : 521 - 522
  • [49] Reliability of self-reported ancestry among siblings: Implications for genetic association studies
    Burnett, MS
    Strain, KJ
    Lesnick, TG
    de Andrade, M
    Rocca, WA
    Maraganore, DM
    [J]. AMERICAN JOURNAL OF EPIDEMIOLOGY, 2006, 163 (05) : 486 - 492
  • [50] In defence of genetic association studies
    Burgner, D
    Hull, J
    [J]. LANCET, 2000, 356 (9229): : 599 - 599