Imputation to whole-genome sequence using multiple pig populations and its use in genome-wide association studies

被引:54
|
作者
van den Berg, Sanne [1 ,2 ]
Vandenplas, Jeremie [1 ]
van Eeuwijk, Fred A. [2 ]
Bouwman, Aniek C. [1 ]
Lopes, Marcos S. [3 ,4 ]
Veerkamp, Roel F. [1 ]
机构
[1] Wageningen Univ & Res, Anim Breeding & Genom, POB 338, NL-6700 AH Wageningen, Netherlands
[2] Wageningen Univ & Res, Biometris, POB 16, NL-6700 AA Wageningen, Netherlands
[3] Topigs Norsvin Res Ctr, NL-6640 AA Beuningen, Netherlands
[4] Topigs Norsvin, BR-80420190 Curitiba, Parana, Brazil
关键词
QUANTITATIVE TRAIT LOCI; GENOTYPE IMPUTATION; LINKAGE DISEQUILIBRIUM; GENETIC DIVERSITY; TEAT NUMBER; ACCURACY; HOLSTEIN; INFERENCE; MARKERS; CATTLE;
D O I
10.1186/s12711-019-0445-y
中图分类号
S8 [畜牧、 动物医学、狩猎、蚕、蜂];
学科分类号
0905 ;
摘要
BackgroundUse of whole-genome sequence data (WGS) is expected to improve identification of quantitative trait loci (QTL). However, this requires imputation to WGS, often with a limited number of sequenced animals for the target population. The objective of this study was to investigate imputation to WGS in two pig lines using a multi-line reference population and, subsequently, to investigate the effect of using these imputed WGS (iWGS) for GWAS.MethodsPhenotypes and genotypes were available on 12,184 Large White pigs (LW-line) and 4943 Dutch Landrace pigs (DL-line). Imputed 660K and 80K genotypes for the LW-line and DL-line, respectively, were imputed to iWGS using Beagle v.4.1. Since only 32 LW-line and 12 DL-line boars were sequenced, 142 animals from eight commercial lines were added. GWAS were performed for each line using the 80K and 660K SNPs, the genotype scores of iWGS SNPs that had an imputation accuracy (Beagle R-2) higher than 0.6, and the dosage scores of all iWGS SNPs.ResultsFor the DL-line (LW-line), imputation of 80K genotypes to iWGS resulted in an average Beagle R-2 of 0.39 (0.49). After quality control, 2.5x10(6) (3.5x10(6)) SNPs had a Beagle R-2 higher than 0.6, resulting in an average Beagle R-2 of 0.83 (0.93). Compared to the 80K and 660K genotypes, using iWGS led to the identification of 48.9 and 64.4% more QTL regions, for the DL-line and LW-line, respectively, and the most significant SNPs in the QTL regions explained a higher proportion of phenotypic variance. Using dosage instead of genotype scores improved the identification of QTL, because the model accounted for uncertainty of imputation, and all SNPs were used in the analysis.ConclusionsImputation to WGS using the multi-line reference population resulted in relatively poor imputation, especially when imputing from 80K (DL-line). In spite of the poor imputation accuracies, using iWGS instead of a lower density SNP chip increased the number of detected QTL and the estimated proportion of phenotypic variance explained by these QTL, especially when dosage scores were used instead of genotype scores. Thus, iWGS, even with poor imputation accuracy, can be used to identify possible interesting regions for fine mapping.
引用
收藏
页数:13
相关论文
共 50 条
  • [21] Accuracy of imputation to whole-genome sequence in sheep
    Sunduimijid Bolormaa
    Amanda J. Chamberlain
    Majid Khansefid
    Paul Stothard
    Andrew A. Swan
    Brett Mason
    Claire P. Prowse-Wilkins
    Naomi Duijvesteijn
    Nasir Moghaddar
    Julius H. van der Werf
    Hans D. Daetwyler
    Iona M. MacLeod
    Genetics Selection Evolution, 51
  • [22] Genome-wide association studies and whole-genome prediction reveal the genetic architecture of KRN in maize
    Yixin An
    Lin Chen
    Yong-Xiang Li
    Chunhui Li
    Yunsu Shi
    Dengfeng Zhang
    Yu Li
    Tianyu Wang
    BMC Plant Biology, 20
  • [23] Accuracy of imputation to whole-genome sequence in sheep
    Bolormaa, Sunduimijid
    Chamberlain, Amanda J.
    Khansefid, Majid
    Stothard, Paul
    Swan, Andrew A.
    Mason, Brett
    Prowse-Wilkins, Claire P.
    Duijvesteijn, Naomi
    Moghaddar, Nasir
    van der Werf, Julius H.
    Daetwyler, Hans D.
    MacLeod, Iona M.
    GENETICS SELECTION EVOLUTION, 2019, 51 (1)
  • [24] A genome-wide scan statistic framework for whole-genome sequence data analysis
    Zihuai He
    Bin Xu
    Joseph Buxbaum
    Iuliana Ionita-Laza
    Nature Communications, 10
  • [25] A genome-wide scan statistic framework for whole-genome sequence data analysis
    He, Zihuai
    Xu, Bin
    Buxbaum, Joseph
    Ionita-Laza, Iuliana
    NATURE COMMUNICATIONS, 2019, 10 (1)
  • [26] Genome-Wide Association Study for Seed Yield of Tepary Bean Using Whole-Genome Resequencing
    Ravelombola, Waltram
    Manley, Aurora
    Pham, Hanh
    Brown, Madeline
    Ruhl, Caroline
    Ghosh, Protik
    INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2024, 25 (20)
  • [27] Genome-wide association studies in diverse populations
    Noah A. Rosenberg
    Lucy Huang
    Ethan M. Jewett
    Zachary A. Szpiech
    Ivana Jankovic
    Michael Boehnke
    Nature Reviews Genetics, 2010, 11 : 356 - 366
  • [28] Genome-wide association studies in diverse populations
    Rosenberg, Noah A.
    Huang, Lucy
    Jewett, Ethan M.
    Szpiech, Zachary A.
    Jankovic, Ivana
    Boehnke, Michael
    NATURE REVIEWS GENETICS, 2010, 11 (05) : 356 - 366
  • [29] Genome-wide association studies of fertility and calving traits in Brown Swiss cattle using imputed whole-genome sequences
    Frischknecht, Mirjam
    Bapst, Beat
    Seefried, Franz R.
    Signer-Hasler, Heidi
    Garrick, Dorian
    Stricker, Christian
    Fries, Ruedi
    Russ, Ingolf
    Soelkner, Johann
    Bieber, Anna
    Strillacci, Maria G.
    Gredler-Grandl, Birgit
    Flury, Christine
    BMC GENOMICS, 2017, 18
  • [30] Genome-wide association studies of fertility and calving traits in Brown Swiss cattle using imputed whole-genome sequences
    Mirjam Frischknecht
    Beat Bapst
    Franz R. Seefried
    Heidi Signer-Hasler
    Dorian Garrick
    Christian Stricker
    Ruedi Fries
    Ingolf Russ
    Johann Sölkner
    Anna Bieber
    Maria G. Strillacci
    Birgit Gredler-Grandl
    Christine Flury
    BMC Genomics, 18