Bayesian genomic selection: the effect of haplotype length and priors

被引:0
|
作者
Trine Michelle Villumsen
Luc Janss
机构
[1] University of Aarhus,Faculty of Agricultural Sciences, Department of Genetics & Biotechnology, Research Centre Foulum
[2] University of Copenhagen,Faculty of Life Sciences, Department of Large Animal Sciences
关键词
Full Data; Genomic Prediction; Genomic Estimate Breeding Value; Conditional Posterior Distribution; Allele Substitution Effect;
D O I
10.1186/1753-6561-3-S1-S11
中图分类号
学科分类号
摘要
Breeding values for animals with marker data are estimated using a genomic selection approach where data is analyzed using Bayesian multi-marker association models. Fourteen model scenarios with varying haplotype lengths, hyper parameter and prior distributions were compared to find the scenario expected to give the most correct genomic estimated breeding values for animals with marker information only. Five-fold cross validation was performed to assess the ability of models to estimate breeding values for animals in generation 3. In each of the five subsets, 20% of phenotypic records in generation 3 were left out. Correlations between breeding values estimated on full data and on subsets for the "leave-out" animals varied between 0.77–0.99. Regression coefficients of breeding values from full data on breeding values from subsets ranged from 0.78–1.01. Single-SNP marker models didn't perform well. Correlations were 0.77–0.89 and predicted breeding values were biased. In addition the models seemed to over fit the genomic part of the variation. Highest correlations and most unbiased results were obtained when SNP markers were joined into haplotypes. Especially the scenarios with 5-SNP haplotypes gave promising results (distance between adjacent SNPs is 0.1 cM evenly over the genome). All correlations were 0.99 and regression coefficients were 0.99–1.01. Models with 5-SNP markers seemed robust to hyper parameter and prior changes. Haplotypes up to 40 SNPs also gave good results. However, longer haplotypes are expected to have less predictive ability over several generations and therefore the 5-SNP haplotypes are expected to give the best predictions for generations 4–6.
引用
收藏
相关论文
共 50 条
  • [1] The importance of haplotype length and heritability using genomic selection in dairy cattle
    Villumsen, T. M.
    Janss, L.
    Lund, M. S.
    [J]. JOURNAL OF ANIMAL BREEDING AND GENETICS, 2009, 126 (01) : 3 - 13
  • [2] Entropic Priors and Bayesian Model Selection
    Brewer, Brendon J.
    Francis, Matthew J.
    [J]. BAYESIAN INFERENCE AND MAXIMUM ENTROPY METHODS IN SCIENCE AND ENGINEERING, 2009, 1193 : 179 - +
  • [3] BAYESIAN VARIABLE SELECTION WITH SHRINKING AND DIFFUSING PRIORS
    Narisetty, Naveen Naidu
    He, Xuming
    [J]. ANNALS OF STATISTICS, 2014, 42 (02): : 789 - 817
  • [4] Selection of model discrepancy priors in Bayesian calibration
    Ling, You
    Mullins, Joshua
    Mahadevan, Sankaran
    [J]. JOURNAL OF COMPUTATIONAL PHYSICS, 2014, 276 : 665 - 680
  • [5] Bayesian model selection using encompassing priors
    Klugkist, I
    Kato, B
    Hoijtink, H
    [J]. STATISTICA NEERLANDICA, 2005, 59 (01) : 57 - 69
  • [6] Can natural selection encode Bayesian priors?
    Ramirez, Juan Camilo
    Marshall, James A. R.
    [J]. JOURNAL OF THEORETICAL BIOLOGY, 2017, 426 : 57 - 66
  • [7] Bayesian variable selection with spherically symmetric priors
    De Kock, Michiel B.
    Eggers, Hans C.
    [J]. COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 2017, 46 (09) : 4250 - 4263
  • [8] Mixtures of g priors for Bayesian variable selection
    Liang, Feng
    Paulo, Rui
    Molina, German
    Clyde, Merlise A.
    Berger, Jim O.
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2008, 103 (481) : 410 - 423
  • [9] Conjugate priors and variable selection for Bayesian quantile regression
    Alhamzawi, Rahim
    Yu, Keming
    [J]. COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2013, 64 : 209 - 219
  • [10] Comparing Spike and Slab Priors for Bayesian Variable Selection
    Malsiner-Walli, Gertraud
    Wagner, Helga
    [J]. AUSTRIAN JOURNAL OF STATISTICS, 2011, 40 (04) : 241 - 264