Impact of the inaccessible genome on genotype imputation and genome-wide association studies

被引:0
|
作者
Koenig, Eva
Mitchell, Jonathan Stewart
Filosi, Michele
Fuchsberger, Christian [1 ,2 ]
机构
[1] Eurac Res, Inst Biomed, Via Volta 21, I-39100 Bolzano, Italy
[2] Univ Lubeck, Lubeck, Germany
关键词
NGS; GWAS; accessibility; genotyping chips; web tool;
D O I
10.1093/hmg/ddae062
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Genotype imputation is widely used in genome-wide association studies (GWAS). However, both the genotyping chips and imputation reference panels are dependent on next-generation sequencing (NGS). Due to the nature of NGS, some regions of the genome are inaccessible to sequencing. To date, there has been no complete evaluation of these regions and their impact on the identification of associations in GWAS remains unclear. In this study, we systematically assess the extent to which variants in inaccessible regions are underrepresented on genotyping chips and imputation reference panels, in GWAS results and in variant databases. We also determine the proportion of genes located in inaccessible regions and compare the results across variant masks defined by the 1000 Genomes Project and the TOPMed program. Overall, fewer variants were observed in inaccessible regions in all categories analyzed. Depending on the mask used and normalized for region size, only 4%-17% of the genotyped variants are located in inaccessible regions and 52 to 581 genes were almost completely inaccessible. From the Cooperative Health Research in South Tyrol (CHRIS) study, we present a case study of an association located in an inaccessible region that is driven by genotyped variants and cannot be reproduced by imputation in GRCh37. We conclude that genotyping, NGS, genotype imputation and downstream analyses such as GWAS and fine mapping are systematically biased in inaccessible regions, due to missed variants and spurious associations. To help researchers assess gene and variant accessibility, we provide an online application (https://gab.gm.eurac.edu).
引用
收藏
页码:1207 / 1214
页数:8
相关论文
共 50 条
  • [31] Replication in Genome-Wide Association Studies
    Kraft, Peter
    Zeggini, Eleftheria
    Ioannidis, John P. A.
    STATISTICAL SCIENCE, 2009, 24 (04) : 561 - 573
  • [32] Genome-wide association studies: a primer
    Corvin, A.
    Craddock, N.
    Sullivan, P. F.
    PSYCHOLOGICAL MEDICINE, 2010, 40 (07) : 1063 - 1077
  • [33] Genome-wide association studies with metabolomics
    Adamski, Jerzy
    GENOME MEDICINE, 2012, 4
  • [34] Genome-Wide Association Studies and Diet
    Ferguson, Lynnette R.
    JOURNAL OF NUTRIGENETICS AND NUTRIGENOMICS, 2010, 3 (4-6) : 144 - 150
  • [35] Guidelines for Genome-Wide Association Studies
    Barsh, Gregory S.
    Copenhaver, Gregory P.
    Gibson, Greg
    Williams, Scott M.
    PLOS GENETICS, 2012, 8 (07):
  • [36] Genome-wide association studies in neurology
    Tan, Meng-Shan
    Jiang, Teng
    Tan, Lan
    Yu, Jin-Tai
    ANNALS OF TRANSLATIONAL MEDICINE, 2014, 2 (12)
  • [37] Genome-Wide Association Studies and Diet
    Ferguson, Lynnette R.
    JOURNAL OF NUTRIGENETICS AND NUTRIGENOMICS, 2009, 2 (4-5) : 191 - 191
  • [38] The road to genome-wide association studies
    Kruglyak, Leonid
    NATURE REVIEWS GENETICS, 2008, 9 (04) : 314 - 318
  • [39] Genome-wide association studies in ADHD
    Barbara Franke
    Benjamin M. Neale
    Stephen V. Faraone
    Human Genetics, 2009, 126 : 13 - 50
  • [40] Genome-Wide Association Studies of Cancer
    Stadler, Zsofia K.
    Thom, Peter
    Robson, Mark E.
    Weitzel, Jeffrey N.
    Kauff, Noah D.
    Hurley, Karen E.
    Devlin, Vincent
    Gold, Bert
    Klein, Robert J.
    Offit, Kenneth
    JOURNAL OF CLINICAL ONCOLOGY, 2010, 28 (27) : 4255 - 4267