Accuracy of imputation to whole-genome sequence in sheep

被引:40
|
作者
Bolormaa, Sunduimijid [1 ,2 ]
Chamberlain, Amanda J. [1 ]
Khansefid, Majid [1 ,2 ]
Stothard, Paul [3 ]
Swan, Andrew A. [2 ,4 ]
Mason, Brett [1 ]
Prowse-Wilkins, Claire P. [1 ]
Duijvesteijn, Naomi [2 ,5 ]
Moghaddar, Nasir [2 ,5 ]
van der Werf, Julius H. [2 ,5 ]
Daetwyler, Hans D. [1 ,2 ,6 ]
MacLeod, Iona M. [1 ,2 ]
机构
[1] Agr Victoria, Ctr AgriBiosci, AgriBio, 5 Ring Rd, Bundoora, Vic 3083, Australia
[2] Cooperat Res Ctr Sheep Ind Innovat, Armidale, NSW 2351, Australia
[3] Univ Alberta, Fac Agr Life & Environm Sci, Edmonton, AB T6G 2R3, Canada
[4] Univ New England, Anim Genet & Breeding Unit, Armidale, NSW 2351, Australia
[5] Univ New England, Sch Environm & Rural Sci, Armidale, NSW 2351, Australia
[6] La Trobe Univ, Sch Appl Syst Biol, Bundoora, Vic 3086, Australia
关键词
GENOTYPE IMPUTATION; PREDICTIONS; BREEDS; RELIABILITY; VARIANTS; IMPROVE; DESIGN;
D O I
10.1186/s12711-018-0443-5
中图分类号
S8 [畜牧、 动物医学、狩猎、蚕、蜂];
学科分类号
0905 ;
摘要
BackgroundThe use of whole-genome sequence (WGS) data for genomic prediction and association studies is highly desirable because the causal mutations should be present in the data. The sequencing of 935 sheep from a range of breeds provides the opportunity to impute sheep genotyped with single nucleotide polymorphism (SNP) arrays to WGS. This study evaluated the accuracy of imputation from SNP genotypes to WGS using this reference population of 935 sequenced sheep.ResultsThe accuracy of imputation from the Ovine Infinium((R)) HD BeadChip SNP (similar to 500k) to WGS was assessed for three target breeds: Merino, Poll Dorset and F1 Border LeicesterxMerino. Imputation accuracy was highest for the Poll Dorset breed, although there were more Merino individuals in the sequenced reference population than Poll Dorset individuals. In addition, empirical imputation accuracies were higher (by up to 1.7%) when using larger multi-breed reference populations compared to using a smaller single-breed reference population. The mean accuracy of imputation across target breeds using the Minimac3 or the FImpute software was 0.94. The empirical imputation accuracy varied considerably across the genome; six chromosomes carried regions of one or more Mb with a mean imputation accuracy of <0.7. Imputation accuracy in five variant annotation classes ranged from 0.87 (missense) up to 0.94 (intronic variants), where lower accuracy corresponded to higher proportions of rare alleles. The imputation quality statistic reported from Minimac3 (R-2) had a clear positive relationship with the empirical imputation accuracy. Therefore, by first discarding imputed variants with an R-2 below 0.4, the mean empirical accuracy across target breeds increased to 0.97. Although accuracy of genomic prediction was less affected by filtering on R-2 in a multi-breed population of sheep with imputed WGS, the genomic heritability clearly tended to be lower when using variants with an R-2 0.4.ConclusionsThe mean imputation accuracy was high for all target breeds and was increased by combining smaller breed sets into a multi-breed reference. We found that the Minimac3 software imputation quality statistic (R-2) was a useful indicator of empirical imputation accuracy, enabling removal of very poorly imputed variants before downstream analyses.
引用
收藏
页数:17
相关论文
共 50 条
  • [1] Accuracy of imputation to whole-genome sequence in sheep
    Sunduimijid Bolormaa
    Amanda J. Chamberlain
    Majid Khansefid
    Paul Stothard
    Andrew A. Swan
    Brett Mason
    Claire P. Prowse-Wilkins
    Naomi Duijvesteijn
    Nasir Moghaddar
    Julius H. van der Werf
    Hans D. Daetwyler
    Iona M. MacLeod
    Genetics Selection Evolution, 51
  • [2] Imputation accuracy to whole-genome sequence in Nellore cattle
    Gerardo A. Fernandes Júnior
    Roberto Carvalheiro
    Henrique N. de Oliveira
    Mehdi Sargolzaei
    Roy Costilla
    Ricardo V. Ventura
    Larissa F. S. Fonseca
    Haroldo H. R. Neves
    Ben J. Hayes
    Lucia G. de Albuquerque
    Genetics Selection Evolution, 53
  • [3] Imputation accuracy to whole-genome sequence in Nellore cattle
    Fernandes Junior, Gerardo A.
    Carvalheiro, Roberto
    de Oliveira, Henrique N.
    Sargolzaei, Mehdi
    Costilla, Roy
    Ventura, Ricardo V.
    Fonseca, Larissa F. S.
    Neves, Haroldo H. R.
    Hayes, Ben J.
    de Albuquerque, Lucia G.
    GENETICS SELECTION EVOLUTION, 2021, 53 (01)
  • [4] Accuracy of imputation to whole-genome sequence data in Holstein Friesian cattle
    Rianne van Binsbergen
    Marco CAM Bink
    Mario PL Calus
    Fred A van Eeuwijk
    Ben J Hayes
    Ina Hulsegge
    Roel F Veerkamp
    Genetics Selection Evolution, 46
  • [5] Accuracy of imputation to whole-genome sequence data in Holstein Friesian cattle
    van Binsbergen, Rianne
    Bink, Marco C. A. M.
    Calus, Mario P. L.
    van Eeuwijk, Fred A.
    Hayes, Ben J.
    Hulsegge, Ina
    Veerkamp, Roel F.
    GENETICS SELECTION EVOLUTION, 2014, 46
  • [6] Novel methods for genotype imputation to whole-genome sequence and a simple linear model to predict imputation accuracy
    Steven G. Larmer
    Mehdi Sargolzaei
    Luiz F. Brito
    Ricardo V. Ventura
    Flávio S. Schenkel
    BMC Genetics, 18
  • [7] Novel methods for genotype imputation to whole-genome sequence and a simple linear model to predict imputation accuracy
    Larmer, Steven G.
    Sargolzaei, Mehdi
    Brito, Luiz F.
    Ventura, Ricardo V.
    Schenkel, Flavio S.
    BMC GENETICS, 2017, 18
  • [8] Accuracy of whole-genome sequence imputation using hybrid peeling in large pedigreed livestock populations
    Roger Ros-Freixedes
    Andrew Whalen
    Ching-Yi Chen
    Gregor Gorjanc
    William O. Herring
    Alan J. Mileham
    John M. Hickey
    Genetics Selection Evolution, 52
  • [9] Accuracy of whole-genome sequence imputation using hybrid peeling in large pedigreed livestock populations
    Ros-Freixedes, Roger
    Whalen, Andrew
    Chen, Ching-Yi
    Gorjanc, Gregor
    Herring, William O.
    Mileham, Alan J.
    Hickey, John M.
    GENETICS SELECTION EVOLUTION, 2020, 52 (01)
  • [10] Evaluation of Whole-Genome Sequence Imputation Strategies in Korean Hanwoo Cattle
    Nawaz, Muhammad Yasir
    Bernardes, Priscila Arrigucci
    Savegnago, Rodrigo Pelicioni
    Lim, Dajeong
    Lee, Seung Hwan
    Gondro, Cedric
    ANIMALS, 2022, 12 (17):