A reference haplotype panel for genome-wide imputation of short tandem repeats

被引:46
|
作者
Saini, Shubham [1 ]
Mitra, Ileena [2 ]
Mousavi, Nima [3 ]
Fotsing, Stephanie Feupe [2 ,4 ]
Gymrek, Melissa [1 ,5 ]
机构
[1] Univ Calif San Diego, Dept Comp Sci & Engn, 9500 Gilman Dr, La Jolla, CA 92093 USA
[2] Univ Calif San Diego, Bioinformat & Syst Biol Program, 9500 Gilman Dr, La Jolla, CA 92093 USA
[3] Univ Calif San Diego, Dept Elect & Comp Engn, 9500 Gilman Dr, La Jolla, CA 92093 USA
[4] Univ Calif San Diego, Dept Biomed Informat, 9500 Gilman Dr, La Jolla, CA 92093 USA
[5] Univ Calif San Diego, Dept Med, 9500 Gilman Dr, La Jolla, CA 92093 USA
基金
美国国家卫生研究院; 美国国家科学基金会;
关键词
GENE-EXPRESSION VARIATION; LINKAGE DISEQUILIBRIUM; DNA METHYLATION; CAG REPEAT; EXPANSION; MICROSATELLITE; VARIANTS; MUTATION; DISEASE; ASSOCIATION;
D O I
10.1038/s41467-018-06694-0
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Short tandem repeats (STRs) are involved in dozens of Mendelian disorders and have been implicated in complex traits. However, genotyping arrays used in genome-wide association studies focus on single nucleotide polymorphisms (SNPs) and do not readily allow identification of STR associations. We leverage next-generation sequencing (NGS) from 479 families to create a SNP + STR reference haplotype panel. Our panel enables imputing STR genotypes into SNP array data when NGS is not available for directly genotyping STRs. Imputed genotypes achieve mean concordance of 97% with observed genotypes in an external dataset compared to 71% expected under a naive model. Performance varies widely across STRs, with near perfect concordance at bi-allelic STRs vs. 70% at highly polymorphic repeats. Imputation increases power over individual SNPs to detect STR associations with gene expression. Imputing STRs into existing SNP datasets will enable the first large-scale STR association studies across a range of complex traits.
引用
收藏
页数:11
相关论文
共 50 条
  • [21] Genome-wide enhancer-associated tandem repeats are expanded in cardiomyopathy
    Mitina, Aleksandra
    Khan, Mahreen
    Lesurf, Robert
    Yin, Yue
    Engchuan, Worrawat
    Hamdan, Omar
    Pellecchia, Giovanna
    Trost, Brett
    Backstrom, Ian
    Guo, Keyi
    Pallotto, Linda M.
    Doong, Phoenix Hoi Lam
    Wang, Zhuozhi
    Nalpathamkalam, Thomas
    Thiruvahindrapuram, Bhooma
    Papaz, Tanya
    Pearson, Christopher E.
    Ragoussis, Jiannis
    Subbarao, Padmaja
    Azad, Meghan B.
    Turvey, Stuart E.
    Mandhane, Piushkumar
    Moraes, Theo J.
    Simons, Elinor
    Scherer, Stephen W.
    Lougheed, Jane
    Mondal, Tapas
    Smythe, John
    Altamirano-Diaz, Luis
    Oechslin, Erwin
    Mital, Seema
    Yuen, Ryan K. C.
    EBIOMEDICINE, 2024, 101
  • [22] Genome-wide contribution of common short-tandem repeats to Parkinson's disease genetic risk
    Bustos, Bernabe, I
    Billingsley, Kimberley
    Blauwendraat, Cornelis
    Gibbs, J. Raphael
    Gan-Or, Ziv
    Krainc, Dimitri
    Singleton, Andrew B.
    Lubbe, Steven J.
    BRAIN, 2023, 146 (01) : 65 - 74
  • [23] Genome-wide analysis of tandem repeats in Daphnia pulex - a comparative approach
    Mayer, Christoph
    Leese, Florian
    Tollrian, Ralph
    BMC GENOMICS, 2010, 11
  • [24] Genome-wide analysis of tandem repeats in Daphnia pulex - a comparative approach
    Christoph Mayer
    Florian Leese
    Ralph Tollrian
    BMC Genomics, 11
  • [25] SeqEntropy: Genome-Wide Assessment of Repeats for Short Read Sequencing
    Chu, Hsueh-Ting
    Hsiao, William W. L.
    Tsao, Theresa T. H.
    Hsu, D. Frank
    Chen, Chaur-Chin
    Lee, Sheng-An
    Kao, Cheng-Yan
    PLOS ONE, 2013, 8 (03):
  • [26] PHARP: a pig haplotype reference panel for genotype imputation
    Wang, Zhen
    Zhang, Zhenyang
    Chen, Zitao
    Sun, Jiabao
    Cao, Caiyun
    Wu, Fen
    Xu, Zhong
    Zhao, Wei
    Sun, Hao
    Guo, Longyu
    Zhang, Zhe
    Wang, Qishan
    Pan, Yuchun
    SCIENTIFIC REPORTS, 2022, 12 (01)
  • [27] Effect of Genome-Wide Genotyping and Reference Panels on Rare Variants Imputation
    Martin Ladouceur
    Celia M.T.Greenwood
    J.Brent Richards
    Journal of Genetics and Genomics, 2012, (10) : 545 - 550
  • [28] Effect of Genome-Wide Genotyping and Reference Panels on Rare Variants Imputation
    Martin Ladouceur
    Celia MTGreenwood
    JBrent Richards
    遗传学报, 2012, 39 (10) : 545 - 550
  • [29] Effect of Genome-Wide Genotyping and Reference Panels on Rare Variants Imputation
    Zheng, Hou-Feng
    Ladouceur, Martin
    Greenwood, Celia M. T.
    Richards, J. Brent
    JOURNAL OF GENETICS AND GENOMICS, 2012, 39 (10) : 545 - 550
  • [30] Impact of Reference Panel Choice for Imputation on Genome-wide Association Study Results for Type 2 Diabetes in Arab Population
    Almeer, Hossam
    Alsmadi, Osama
    Elkum, Naser
    Saad, Mohamad
    GENETIC EPIDEMIOLOGY, 2019, 43 (07) : 864 - 864