Imputation to whole-genome sequence using multiple pig populations and its use in genome-wide association studies

被引:54
|
作者
van den Berg, Sanne [1 ,2 ]
Vandenplas, Jeremie [1 ]
van Eeuwijk, Fred A. [2 ]
Bouwman, Aniek C. [1 ]
Lopes, Marcos S. [3 ,4 ]
Veerkamp, Roel F. [1 ]
机构
[1] Wageningen Univ & Res, Anim Breeding & Genom, POB 338, NL-6700 AH Wageningen, Netherlands
[2] Wageningen Univ & Res, Biometris, POB 16, NL-6700 AA Wageningen, Netherlands
[3] Topigs Norsvin Res Ctr, NL-6640 AA Beuningen, Netherlands
[4] Topigs Norsvin, BR-80420190 Curitiba, Parana, Brazil
关键词
QUANTITATIVE TRAIT LOCI; GENOTYPE IMPUTATION; LINKAGE DISEQUILIBRIUM; GENETIC DIVERSITY; TEAT NUMBER; ACCURACY; HOLSTEIN; INFERENCE; MARKERS; CATTLE;
D O I
10.1186/s12711-019-0445-y
中图分类号
S8 [畜牧、 动物医学、狩猎、蚕、蜂];
学科分类号
0905 ;
摘要
BackgroundUse of whole-genome sequence data (WGS) is expected to improve identification of quantitative trait loci (QTL). However, this requires imputation to WGS, often with a limited number of sequenced animals for the target population. The objective of this study was to investigate imputation to WGS in two pig lines using a multi-line reference population and, subsequently, to investigate the effect of using these imputed WGS (iWGS) for GWAS.MethodsPhenotypes and genotypes were available on 12,184 Large White pigs (LW-line) and 4943 Dutch Landrace pigs (DL-line). Imputed 660K and 80K genotypes for the LW-line and DL-line, respectively, were imputed to iWGS using Beagle v.4.1. Since only 32 LW-line and 12 DL-line boars were sequenced, 142 animals from eight commercial lines were added. GWAS were performed for each line using the 80K and 660K SNPs, the genotype scores of iWGS SNPs that had an imputation accuracy (Beagle R-2) higher than 0.6, and the dosage scores of all iWGS SNPs.ResultsFor the DL-line (LW-line), imputation of 80K genotypes to iWGS resulted in an average Beagle R-2 of 0.39 (0.49). After quality control, 2.5x10(6) (3.5x10(6)) SNPs had a Beagle R-2 higher than 0.6, resulting in an average Beagle R-2 of 0.83 (0.93). Compared to the 80K and 660K genotypes, using iWGS led to the identification of 48.9 and 64.4% more QTL regions, for the DL-line and LW-line, respectively, and the most significant SNPs in the QTL regions explained a higher proportion of phenotypic variance. Using dosage instead of genotype scores improved the identification of QTL, because the model accounted for uncertainty of imputation, and all SNPs were used in the analysis.ConclusionsImputation to WGS using the multi-line reference population resulted in relatively poor imputation, especially when imputing from 80K (DL-line). In spite of the poor imputation accuracies, using iWGS instead of a lower density SNP chip increased the number of detected QTL and the estimated proportion of phenotypic variance explained by these QTL, especially when dosage scores were used instead of genotype scores. Thus, iWGS, even with poor imputation accuracy, can be used to identify possible interesting regions for fine mapping.
引用
收藏
页数:13
相关论文
共 50 条
  • [41] Genome-wide association studies of multiple sclerosis
    Cotsapas, Chris
    Mitrovic, Mitja
    CLINICAL & TRANSLATIONAL IMMUNOLOGY, 2018, 7 (06)
  • [42] Genome-wide association study for longevity with whole-genome sequencing in 3 cattle breeds
    Zhang, Qianqian
    Guldbrandtsen, Bernt
    Thomasen, Jorn Rind
    Lund, Mogens Sando
    Sahana, Goutam
    JOURNAL OF DAIRY SCIENCE, 2016, 99 (09) : 7289 - 7298
  • [43] Genome-wide association analysis using multiple Atlantic salmon populations
    Ajasa, Afees A.
    Gjoen, Hans M.
    Boison, Solomon A.
    Lillehammer, Marie
    GENETICS SELECTION EVOLUTION, 2025, 57 (01)
  • [44] Genome-wide association study of endo-parasite phenotypes using imputed whole-genome sequence data in dairy and beef cattle
    Twomey, Alan J.
    Berry, Donagh P.
    Evans, Ross D.
    Doherty, Michael L.
    Graham, David A.
    Purfield, Deirdre C.
    GENETICS SELECTION EVOLUTION, 2019, 51 (1)
  • [45] Genome-wide association study of endo-parasite phenotypes using imputed whole-genome sequence data in dairy and beef cattle
    Alan J. Twomey
    Donagh P. Berry
    Ross D. Evans
    Michael L. Doherty
    David A. Graham
    Deirdre C. Purfield
    Genetics Selection Evolution, 51
  • [46] Modelling Human Regulatory Variation in Mouse: Finding the Function in Genome-Wide Association Studies and Whole-Genome Sequencing
    Schmouth, Jean-Francois
    Bonaguro, Russell J.
    Corso-Diaz, Ximena
    Simpson, Elizabeth M.
    PLOS GENETICS, 2012, 8 (03):
  • [47] On the Threshold from Genome-Wide Association Studies to Whole-Genome Sequencing Looking for Signal in All the Right Places
    Hansel, Nadia N.
    Mathias, Rasika A.
    AMERICAN JOURNAL OF RESPIRATORY AND CRITICAL CARE MEDICINE, 2014, 189 (04) : 381 - 383
  • [48] Ultra Low-Coverage Whole-Genome Sequencing as an Alternative to Genotyping Arrays in Genome-Wide Association Studies
    Chat, Vylyny
    Ferguson, Robert
    Morales, Leah
    Kirchhoff, Tomas
    FRONTIERS IN GENETICS, 2022, 12
  • [49] Genome-Wide Association Studies of Cancer in Diverse Populations
    Park, Sungshim L.
    Cheng, Iona
    Haiman, Christopher A.
    CANCER EPIDEMIOLOGY BIOMARKERS & PREVENTION, 2018, 27 (04) : 405 - 417
  • [50] Development of Genome-wide Simple Sequence Repeat Markers from Whole-genome Sequence of Mungbean (Vigna radiata)
    Mayalagu, Kanimoli Mathivathana
    Adhimoolam, Karthikeyan
    Nallathambi, Jagadeeshselvam
    Rajagopalan, Veera Ranjani
    Balasubramanian, Madhumitha
    Shihabdeen, Madiha Natchi Samu
    Natesan, Senthil
    Muthurajan, Raveendran
    Manickam, Sudha
    LEGUME RESEARCH, 2023, 46 (10) : 1405 - 1409