Accuracy of imputation to whole-genome sequence in sheep

被引:40
|
作者
Bolormaa, Sunduimijid [1 ,2 ]
Chamberlain, Amanda J. [1 ]
Khansefid, Majid [1 ,2 ]
Stothard, Paul [3 ]
Swan, Andrew A. [2 ,4 ]
Mason, Brett [1 ]
Prowse-Wilkins, Claire P. [1 ]
Duijvesteijn, Naomi [2 ,5 ]
Moghaddar, Nasir [2 ,5 ]
van der Werf, Julius H. [2 ,5 ]
Daetwyler, Hans D. [1 ,2 ,6 ]
MacLeod, Iona M. [1 ,2 ]
机构
[1] Agr Victoria, Ctr AgriBiosci, AgriBio, 5 Ring Rd, Bundoora, Vic 3083, Australia
[2] Cooperat Res Ctr Sheep Ind Innovat, Armidale, NSW 2351, Australia
[3] Univ Alberta, Fac Agr Life & Environm Sci, Edmonton, AB T6G 2R3, Canada
[4] Univ New England, Anim Genet & Breeding Unit, Armidale, NSW 2351, Australia
[5] Univ New England, Sch Environm & Rural Sci, Armidale, NSW 2351, Australia
[6] La Trobe Univ, Sch Appl Syst Biol, Bundoora, Vic 3086, Australia
关键词
GENOTYPE IMPUTATION; PREDICTIONS; BREEDS; RELIABILITY; VARIANTS; IMPROVE; DESIGN;
D O I
10.1186/s12711-018-0443-5
中图分类号
S8 [畜牧、 动物医学、狩猎、蚕、蜂];
学科分类号
0905 ;
摘要
BackgroundThe use of whole-genome sequence (WGS) data for genomic prediction and association studies is highly desirable because the causal mutations should be present in the data. The sequencing of 935 sheep from a range of breeds provides the opportunity to impute sheep genotyped with single nucleotide polymorphism (SNP) arrays to WGS. This study evaluated the accuracy of imputation from SNP genotypes to WGS using this reference population of 935 sequenced sheep.ResultsThe accuracy of imputation from the Ovine Infinium((R)) HD BeadChip SNP (similar to 500k) to WGS was assessed for three target breeds: Merino, Poll Dorset and F1 Border LeicesterxMerino. Imputation accuracy was highest for the Poll Dorset breed, although there were more Merino individuals in the sequenced reference population than Poll Dorset individuals. In addition, empirical imputation accuracies were higher (by up to 1.7%) when using larger multi-breed reference populations compared to using a smaller single-breed reference population. The mean accuracy of imputation across target breeds using the Minimac3 or the FImpute software was 0.94. The empirical imputation accuracy varied considerably across the genome; six chromosomes carried regions of one or more Mb with a mean imputation accuracy of <0.7. Imputation accuracy in five variant annotation classes ranged from 0.87 (missense) up to 0.94 (intronic variants), where lower accuracy corresponded to higher proportions of rare alleles. The imputation quality statistic reported from Minimac3 (R-2) had a clear positive relationship with the empirical imputation accuracy. Therefore, by first discarding imputed variants with an R-2 below 0.4, the mean empirical accuracy across target breeds increased to 0.97. Although accuracy of genomic prediction was less affected by filtering on R-2 in a multi-breed population of sheep with imputed WGS, the genomic heritability clearly tended to be lower when using variants with an R-2 0.4.ConclusionsThe mean imputation accuracy was high for all target breeds and was increased by combining smaller breed sets into a multi-breed reference. We found that the Minimac3 software imputation quality statistic (R-2) was a useful indicator of empirical imputation accuracy, enabling removal of very poorly imputed variants before downstream analyses.
引用
收藏
页数:17
相关论文
共 50 条
  • [41] Imputation to whole-genome sequence and its use in genome-wide association studies for pork colour traits in crossbred and purebred pigs
    Heidaritabar, Marzieh
    Huisman, Abe
    Krivushin, Kirill
    Stothard, Paul
    Dervishi, Elda
    Charagu, Patrick
    Bink, Marco C. A. M.
    Plastow, Graham S.
    FRONTIERS IN GENETICS, 2022, 13
  • [42] Genome-Wide Association Study on Reproductive Traits Using Imputation-Based Whole-Genome Sequence Data in Yorkshire Pigs
    Sun, Jingchun
    Xiao, Jinhong
    Jiang, Yifan
    Wang, Yaxin
    Cao, Minghao
    Wei, Jialin
    Yu, Taiyong
    Ding, Xiangdong
    Yang, Gongshe
    GENES, 2023, 14 (04)
  • [43] Whole-genome bisulfite sequencing with improved accuracy and cost
    Suzuki, Masako
    Liao, Will
    Wos, Frank
    Johnston, Andrew D.
    DeGrazia, Justin
    Ishii, Jennifer
    Bloom, Toby
    Zody, Michael C.
    Germer, Soren
    Greally, John M.
    GENOME RESEARCH, 2018, 28 (09) : 1364 - 1371
  • [45] Using imputation-based whole-genome sequencing data to improve the accuracy of genomic prediction for combined populations in pigs
    Hailiang Song
    Shaopan Ye
    Yifan Jiang
    Zhe Zhang
    Qin Zhang
    Xiangdong Ding
    Genetics Selection Evolution, 51
  • [46] Using imputation-based whole-genome sequencing data to improve the accuracy of genomic prediction for combined populations in pigs
    Song, Hailiang
    Ye, Shaopan
    Jiang, Yifan
    Zhang, Zhe
    Zhang, Qin
    Ding, Xiangdong
    GENETICS SELECTION EVOLUTION, 2019, 51 (01)
  • [47] Whole-genome sequence of Macaca fascicularis: liver tissue
    Seo, Eun-Hye
    Kim, Jeong-Hwan
    Kim, Da-Hee
    Oh, Jung-Hwa
    Kim, Seon-Young
    BMC GENOMIC DATA, 2023, 24 (01):
  • [48] Addressing chromosome evolution in the whole-genome sequence era
    Faraut, Thomas
    CHROMOSOME RESEARCH, 2008, 16 (01) : 5 - 16
  • [49] Whole-genome sequence of six Rhizobium laguerreae strains
    Kirichek, Evgenii A.
    Flores-Felix, Jose D.
    Velazquez, Encarna
    Tsyganova, Anna V.
    Tsyganov, Viktor E.
    MICROBIOLOGY RESOURCE ANNOUNCEMENTS, 2024, 13 (06):
  • [50] A highly annotated whole-genome sequence of a Korean individual
    Jong-Il Kim
    Young Seok Ju
    Hansoo Park
    Sheehyun Kim
    Seonwook Lee
    Jae-Hyuk Yi
    Joann Mudge
    Neil A. Miller
    Dongwan Hong
    Callum J. Bell
    Hye-Sun Kim
    In-Soon Chung
    Woo-Chung Lee
    Ji-Sun Lee
    Seung-Hyun Seo
    Ji-Young Yun
    Hyun Nyun Woo
    Heewook Lee
    Dongwhan Suh
    Seungbok Lee
    Hyun-Jin Kim
    Maryam Yavartanoo
    Minhye Kwak
    Ying Zheng
    Mi Kyeong Lee
    Hyunjun Park
    Jeong Yeon Kim
    Omer Gokcumen
    Ryan E. Mills
    Alexander Wait Zaranek
    Joseph Thakuria
    Xiaodi Wu
    Ryan W. Kim
    Jim J. Huntley
    Shujun Luo
    Gary P. Schroth
    Thomas D. Wu
    HyeRan Kim
    Kap-Seok Yang
    Woong-Yang Park
    Hyungtae Kim
    George M. Church
    Charles Lee
    Stephen F. Kingsmore
    Jeong-Sun Seo
    Nature, 2009, 460 : 1011 - 1015