Effect of case and control definitions on genome-wide association study (GWAS) findings
被引:3
|
作者:
Isgut, Monica
论文数: 0引用数: 0
h-index: 0
机构:
Georgia Inst Technol, Dept Bioinformat, Atlanta, GA USAGeorgia Inst Technol, Dept Bioinformat, Atlanta, GA USA
Isgut, Monica
[1
]
Song, Kijoung
论文数: 0引用数: 0
h-index: 0
机构:
GlaxoSmithKline, Dept Human Genet, Collegeville, PA USAGeorgia Inst Technol, Dept Bioinformat, Atlanta, GA USA
Song, Kijoung
[2
]
Ehm, Margaret G.
论文数: 0引用数: 0
h-index: 0
机构:
GlaxoSmithKline, Dept Human Genet, Collegeville, PA USAGeorgia Inst Technol, Dept Bioinformat, Atlanta, GA USA
Ehm, Margaret G.
[2
]
论文数: 引用数:
h-index:
机构:
Wang, May Dongmei
[3
]
Davitte, Jonathan
论文数: 0引用数: 0
h-index: 0
机构:
GlaxoSmithKline, Dept Human Genet, Collegeville, PA USA
GlaxoSmithKline, Dept Human Genet, Collegeville, PA 19426 USAGeorgia Inst Technol, Dept Bioinformat, Atlanta, GA USA
Davitte, Jonathan
[2
,4
]
机构:
[1] Georgia Inst Technol, Dept Bioinformat, Atlanta, GA USA
[2] GlaxoSmithKline, Dept Human Genet, Collegeville, PA USA
[3] Emory Univ, Georgia Inst Technol, Sch Biomed Engn, Atlanta, GA USA
[4] GlaxoSmithKline, Dept Human Genet, Collegeville, PA 19426 USA
genetic correlation;
GWAS;
selection bias;
study design;
UK Biobank;
BIOBANK;
D O I:
10.1002/gepi.22523
中图分类号:
Q3 [遗传学];
学科分类号:
071007 ;
090102 ;
摘要:
Genome-wide association studies (GWAS) have significantly advanced our understanding of the genetic underpinnings of diseases, but case and control cohort definitions for a given disease can vary between different published studies. For example, two GWAS for the same disease using the UK Biobank data set might use different data sources (i.e., self-reported questionnaires, hospital records, etc.) or different levels of granularity (i.e., specificity of inclusion criteria) to define cases and controls. The extent to which this variability in cohort definitions impacts the end-results of a GWAS study is unclear. In this study, we systematically evaluated the effect of the data sources used for case and control definitions on GWAS findings. Using the UK Biobank, we selected three diseases-glaucoma, migraine, and iron-deficiency anemia. For each disease, we designed 13 GWAS, each using different combinations of data sources to define cases and controls, and then calculated the pairwise genetic correlations between all GWAS for each disease. We found that the data sources used to define cases for a given disease can have a significant impact on GWAS end-results, but the extent of this depends heavily on the disease in question. This suggests the need for greater scrutiny on how case cohorts are defined for GWAS.
机构:
Univ Manchester, Arthrit Res UK Ctr Genet & Genom, Manchester, Lancs, EnglandUniv Manchester, Arthrit Res UK Ctr Genet & Genom, Manchester, Lancs, England
机构:
Washington State Univ, USDA ARS, Plant Germplasm Intro & Testing Res Unit, 201 Clark Hall, Pullman, WA 99164 USAUniv Sulaimani, Coll Agr Engn Sci, Hort Dept, Sulaimani 46001, Iraq
Wallace, Lyle T.
Hart, John P.
论文数: 0引用数: 0
h-index: 0
机构:
USDA ARS, Trop Agr Res Stn TARS, 2200 P A Campos Ave,Suite 201, Mayaguez, PR 00680 USAUniv Sulaimani, Coll Agr Engn Sci, Hort Dept, Sulaimani 46001, Iraq
Hart, John P.
Griffiths, Phillip D.
论文数: 0引用数: 0
h-index: 0
机构:
Cornell Univ Agritech, Sch Integrated Plant Sci, Hort Sect, 635 W North St, Geneva, NY 14456 USAUniv Sulaimani, Coll Agr Engn Sci, Hort Dept, Sulaimani 46001, Iraq
Griffiths, Phillip D.
Myers, James R.
论文数: 0引用数: 0
h-index: 0
机构:
Oregon State Univ, Dept Hort, 4017 Ag & Life Sci Bldg, Corvallis, OR 97331 USAUniv Sulaimani, Coll Agr Engn Sci, Hort Dept, Sulaimani 46001, Iraq