Accuracy of imputation to whole-genome sequence in sheep

被引:40
|
作者
Bolormaa, Sunduimijid [1 ,2 ]
Chamberlain, Amanda J. [1 ]
Khansefid, Majid [1 ,2 ]
Stothard, Paul [3 ]
Swan, Andrew A. [2 ,4 ]
Mason, Brett [1 ]
Prowse-Wilkins, Claire P. [1 ]
Duijvesteijn, Naomi [2 ,5 ]
Moghaddar, Nasir [2 ,5 ]
van der Werf, Julius H. [2 ,5 ]
Daetwyler, Hans D. [1 ,2 ,6 ]
MacLeod, Iona M. [1 ,2 ]
机构
[1] Agr Victoria, Ctr AgriBiosci, AgriBio, 5 Ring Rd, Bundoora, Vic 3083, Australia
[2] Cooperat Res Ctr Sheep Ind Innovat, Armidale, NSW 2351, Australia
[3] Univ Alberta, Fac Agr Life & Environm Sci, Edmonton, AB T6G 2R3, Canada
[4] Univ New England, Anim Genet & Breeding Unit, Armidale, NSW 2351, Australia
[5] Univ New England, Sch Environm & Rural Sci, Armidale, NSW 2351, Australia
[6] La Trobe Univ, Sch Appl Syst Biol, Bundoora, Vic 3086, Australia
关键词
GENOTYPE IMPUTATION; PREDICTIONS; BREEDS; RELIABILITY; VARIANTS; IMPROVE; DESIGN;
D O I
10.1186/s12711-018-0443-5
中图分类号
S8 [畜牧、 动物医学、狩猎、蚕、蜂];
学科分类号
0905 ;
摘要
BackgroundThe use of whole-genome sequence (WGS) data for genomic prediction and association studies is highly desirable because the causal mutations should be present in the data. The sequencing of 935 sheep from a range of breeds provides the opportunity to impute sheep genotyped with single nucleotide polymorphism (SNP) arrays to WGS. This study evaluated the accuracy of imputation from SNP genotypes to WGS using this reference population of 935 sequenced sheep.ResultsThe accuracy of imputation from the Ovine Infinium((R)) HD BeadChip SNP (similar to 500k) to WGS was assessed for three target breeds: Merino, Poll Dorset and F1 Border LeicesterxMerino. Imputation accuracy was highest for the Poll Dorset breed, although there were more Merino individuals in the sequenced reference population than Poll Dorset individuals. In addition, empirical imputation accuracies were higher (by up to 1.7%) when using larger multi-breed reference populations compared to using a smaller single-breed reference population. The mean accuracy of imputation across target breeds using the Minimac3 or the FImpute software was 0.94. The empirical imputation accuracy varied considerably across the genome; six chromosomes carried regions of one or more Mb with a mean imputation accuracy of <0.7. Imputation accuracy in five variant annotation classes ranged from 0.87 (missense) up to 0.94 (intronic variants), where lower accuracy corresponded to higher proportions of rare alleles. The imputation quality statistic reported from Minimac3 (R-2) had a clear positive relationship with the empirical imputation accuracy. Therefore, by first discarding imputed variants with an R-2 below 0.4, the mean empirical accuracy across target breeds increased to 0.97. Although accuracy of genomic prediction was less affected by filtering on R-2 in a multi-breed population of sheep with imputed WGS, the genomic heritability clearly tended to be lower when using variants with an R-2 0.4.ConclusionsThe mean imputation accuracy was high for all target breeds and was increased by combining smaller breed sets into a multi-breed reference. We found that the Minimac3 software imputation quality statistic (R-2) was a useful indicator of empirical imputation accuracy, enabling removal of very poorly imputed variants before downstream analyses.
引用
收藏
页数:17
相关论文
共 50 条
  • [31] Evaluation of sequencing strategies for whole-genome imputation with hybrid peeling
    Roger Ros-Freixedes
    Andrew Whalen
    Gregor Gorjanc
    Alan J. Mileham
    John M. Hickey
    Genetics Selection Evolution, 52
  • [32] Accuracy of genomic prediction using imputed whole-genome sequence data in white layers
    Heidaritabar, M.
    Calus, M. P. L.
    Megens, H-J.
    Vereijken, A.
    Groenen, M. A. M.
    Bastiaansen, J. W. M.
    JOURNAL OF ANIMAL BREEDING AND GENETICS, 2016, 133 (03) : 167 - 179
  • [33] The Whole-Genome Sequence of the Coral Acropora millepora
    Ying, Hua
    Hayward, David C.
    Cooke, Ira
    Wang, Weiwen
    Moya, Aurelie
    Siemering, Kirby R.
    Sprungala, Susanne
    Ball, Eldon E.
    Foret, Sylvain
    Miller, David J.
    GENOME BIOLOGY AND EVOLUTION, 2019, 11 (05): : 1374 - 1379
  • [34] The whole-genome sequence analysis of Morchella sextelata
    Mei-Han
    Qingshan-Wang
    Baiyintala
    Wuhanqimuge
    SCIENTIFIC REPORTS, 2019, 9 (1)
  • [35] Emergence and whole-genome sequence of Senecavirus A in Colombia
    Sun, D.
    Vannucci, F.
    Knutson, T. P.
    Corzo, C.
    Marthaler, D. G.
    TRANSBOUNDARY AND EMERGING DISEASES, 2017, 64 (05) : 1346 - 1349
  • [37] Whole-genome sequence and annotation of Penstemon davidsonii
    Ostevik, Kate L.
    Alabady, Magdy
    Zhang, Mengrui
    Rausher, Mark D.
    G3-GENES GENOMES GENETICS, 2024, 14 (03):
  • [38] Whole-Genome Resequencing to Study Brucellosis Susceptibility in Sheep
    Li, Xiaolong
    Wu, Qingmin
    Zhang, Xiaoxue
    Li, Chong
    Zhang, Deyin
    Li, Guoze
    Zhang, Yukun
    Zhao, Yuan
    Shi, Zhaoguo
    Wang, Weimin
    Li, Fadi
    FRONTIERS IN GENETICS, 2021, 12
  • [39] Functionally prioritised whole-genome sequence variants improve the accuracy of genomic prediction for heat tolerance
    Cheruiyot, Evans K.
    Haile-Mariam, Mekonnen
    Cocks, Benjamin G.
    MacLeod, Iona M.
    Mrode, Raphael
    Pryce, Jennie E.
    GENETICS SELECTION EVOLUTION, 2022, 54 (01)
  • [40] Functionally prioritised whole-genome sequence variants improve the accuracy of genomic prediction for heat tolerance
    Evans K. Cheruiyot
    Mekonnen Haile-Mariam
    Benjamin G. Cocks
    Iona M. MacLeod
    Raphael Mrode
    Jennie E. Pryce
    Genetics Selection Evolution, 54