Comprehensive evaluation of imputation performance in African Americans

被引:0
|
作者
Pritam Chanda
Naoya Yuhki
Man Li
Joel S Bader
Alex Hartz
Eric Boerwinkle
WH Linda Kao
Dan E Arking
机构
[1] Johns Hopkins University,Department of Biomedical Engineering
[2] High-Throughput Biology Center,Department of Epidemiology
[3] Johns Hopkins University School of Medicine,Human Genetics Center and Division of Epidemiology
[4] Johns Hopkins Bloomberg School of Public Health,undefined
[5] The University of Texas,undefined
[6] McKusick-Nathans Institute of Genetic Medicine,undefined
[7] Johns Hopkins University School of Medicine,undefined
来源
Journal of Human Genetics | 2012年 / 57卷
关键词
concordance; GWAS; Hapmap; imputation; imputation accuracy; kappa; 1000 genomes;
D O I
暂无
中图分类号
学科分类号
摘要
Imputation of genome-wide single-nucleotide polymorphism (SNP) arrays to a larger known reference panel of SNPs has become a standard and an essential part of genome-wide association studies. However, little is known about the behavior of imputation in African Americans with respect to the different imputation algorithms, the reference population(s) and the reference SNP panels used. Genome-wide SNP data (Affymetrix 6.0) from 3207 African American samples in the Atherosclerosis Risk in Communities Study (ARIC) was used to systematically evaluate imputation quality and yield. Imputation was performed with the imputation algorithms MACH, IMPUTE and BEAGLE using several combinations of three reference panels of HapMap III (ASW, YRI and CEU) and 1000 Genomes Project (pilot 1 YRI June 2010 release, EUR and AFR August 2010 and June 2011 releases) panels with SNP data on chromosomes 18, 20 and 22. About 10% of the directly genotyped SNPs from each chromosome were masked, and SNPs common between the reference panels were used for evaluating the imputation quality using two statistical metrics—concordance accuracy and Cohen’s kappa (κ) coefficient. The dependencies of these metrics on the minor allele frequencies (MAF) and specific genotype categories (minor allele homozygotes, heterozygotes and major allele homozygotes) were thoroughly investigated to determine the best panel and method for imputation in African Americans. In addition, the power to detect imputed SNPs associated with simulated phenotypes was studied using the mean genotype of each masked SNP in the imputed data. Our results indicate that the genotype concordances after stratification into each genotype category and Cohen’s κ coefficient are considerably better equipped to differentiate imputation performance compared with the traditionally used total concordance statistic, and both statistics improved with increasing MAF irrespective of the imputation method. We also find that both MACH and IMPUTE performed equally well and consistently better than BEAGLE irrespective of the reference panel used. Of the various combinations of reference panels, for both HapMap III and 1000 Genomes Project reference panels, the multi-ethnic panels had better imputation accuracy than those containing only single ethnic samples. The most recent 1000 Genomes Project release June 2011 had substantially higher number of imputed SNPs than HapMap III and performed as well or better than the best combined HapMap III reference panels and previous releases of the 1000 Genomes Project.
引用
收藏
页码:411 / 421
页数:10
相关论文
共 50 条
  • [1] Comprehensive evaluation of imputation performance in African Americans
    Chanda, Pritam
    Yuhki, Naoya
    Li, Man
    Bader, Joel S.
    Hartz, Alex
    Boerwinkle, Eric
    Kao, W. H. Linda
    Arking, Dan E.
    JOURNAL OF HUMAN GENETICS, 2012, 57 (07) : 411 - 421
  • [2] Genotype Imputation African Americans
    Sun, Yan V.
    Zhao, Wei
    Kardia, Sharon L. R.
    GENETIC EPIDEMIOLOGY, 2010, 34 (08) : 951 - 951
  • [3] A Comprehensive Evaluation of SNP Genotype Imputation
    Nothnagel, Michael
    Ellinghaus, David
    Schreiber, Stefan
    Krawczak, Michael
    Franke, Andre
    ANNALS OF HUMAN GENETICS, 2009, 73 : 659 - 659
  • [4] A comprehensive evaluation of SNP genotype imputation
    Michael Nothnagel
    David Ellinghaus
    Stefan Schreiber
    Michael Krawczak
    Andre Franke
    Human Genetics, 2009, 125 : 163 - 171
  • [5] Comprehensive Assessment of Genotype Imputation Performance
    Shi, Shuo
    Yuan, Na
    Yang, Ming
    Du, Zhenglin
    Wang, Jinyue
    Sheng, Xin
    Wu, Jiayan
    Xiao, Jingfa
    HUMAN HEREDITY, 2017, 83 (03) : 107 - 116
  • [6] A comprehensive evaluation of SNP genotype imputation
    Nothnagel, Michael
    Ellinghaus, David
    Schreiber, Stefan
    Krawczak, Michael
    Franke, Andre
    HUMAN GENETICS, 2009, 125 (02) : 163 - 171
  • [7] Imputation of coding variants in African Americans: better performance using data from the exome sequencing project
    Duan, Qing
    Liu, Eric Yi
    Auer, Paul L.
    Zhang, Guosheng
    Lange, Ethan M.
    Jun, Goo
    Bizon, Chris
    Jiao, Shuo
    Buyske, Steven
    Franceschini, Nora
    Carlson, Chris S.
    Hsu, Li
    Reiner, Alex P.
    Peters, Ulrike
    Haessler, Jeffrey
    Curtis, Keith
    Wassel, Christina L.
    Robinson, Jennifer G.
    Martin, Lisa W.
    Haiman, Christopher A.
    Le Marchand, Loic
    Matise, Tara C.
    Hindorff, Lucia A.
    Crawford, Dana C.
    Assimes, Themistocles L.
    Kang, Hyun Min
    Heiss, Gerardo
    Jackson, Rebecca D.
    Kooperberg, Charles
    Wilson, James G.
    Abecasis, Goncalo R.
    North, Kari E.
    Nickerson, Deborah A.
    Lange, Leslie A.
    Li, Yun
    BIOINFORMATICS, 2013, 29 (21) : 2744 - 2749
  • [8] African-Americans and Comprehensive Service Use
    Matthew T. Theriot
    Steven P. Segal
    Max J. Cowsert
    Community Mental Health Journal, 2003, 39 : 225 - 237
  • [9] A comprehensive survey of African Americans on kidney disease
    Hostetter, TH
    Chawla, P
    Melcher, C
    Gladstone, E
    Stryer, D
    Mcclellan, WM
    JOURNAL OF THE AMERICAN SOCIETY OF NEPHROLOGY, 2003, 14 : 289A - 289A
  • [10] African-Americans and comprehensive service use
    Theriot, MT
    Segal, SP
    Cowsert, MJ
    COMMUNITY MENTAL HEALTH JOURNAL, 2003, 39 (03) : 225 - 237