A Function Accounting for Training Set Size and Marker Density to Model the Average Accuracy of Genomic Prediction

被引:42
|
作者
Erbe, Malena [1 ]
Gredler, Birgit [2 ]
Seefried, Franz Reinhold [2 ]
Bapst, Beat [2 ]
Simianer, Henner [1 ]
机构
[1] Univ Gottingen, Dept Anim Sci, Anim Breeding & Genet Grp, Gottingen, Germany
[2] Qualitas AG, Zug, Switzerland
来源
PLOS ONE | 2013年 / 8卷 / 12期
关键词
BREEDING VALUES; LINKAGE DISEQUILIBRIUM; RELATIONSHIP MATRIX; SELECTION; IMPACT;
D O I
10.1371/journal.pone.0081046
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Prediction of genomic breeding values is of major practical relevance in dairy cattle breeding. Deterministic equations have been suggested to predict the accuracy of genomic breeding values in a given design which are based on training set size, reliability of phenotypes, and the number of independent chromosome segments (Me). The aim of our study was to find a general deterministic equation for the average accuracy of genomic breeding values that also accounts for marker density and can be fitted empirically. Two data sets of similar to 698 Holstein Friesian bulls genotyped with 50 K SNPs and 19332 Brown Swiss bulls genotyped with 50 K SNPs and imputed to,600 K SNPs were available. Different k-fold (k = 2-10, 15, 20) cross-validation scenarios (50 replicates, random assignment) were performed using a genomic BLUP approach. A maximum likelihood approach was used to estimate the parameters of different prediction equations. The highest likelihood was obtained when using a modified form of the deterministic equation of Daetwyler et al. (2010), augmented by a weighting factor (w) based on the assumption that the maximum achievable accuracy is w < 1. The proportion of genetic variance captured by the complete SNP sets (w(2)) was 0.76 to 0.82 for Holstein Friesian and 0.72 to 0.75 for Brown Swiss. When modifying the number of SNPs, w was found to be proportional to the log of the marker density up to a limit which is population and trait specific and was found to be reached with,209000 SNPs in the Brown Swiss population studied.
引用
收藏
页数:11
相关论文
共 50 条
  • [41] Effect of training-sample size and classification difficulty on the accuracy of genomic predictors
    Popovici, Vlad
    Chen, Weijie
    Gallas, Brandon G.
    Hatzis, Christos
    Shi, Weiwei
    Samuelson, Frank W.
    Nikolsky, Yuri
    Tsyganova, Marina
    Ishkin, Alex
    Nikolskaya, Tatiana
    Hess, Kenneth R.
    Valero, Vicente
    Booser, Daniel
    Delorenzi, Mauro
    Hortobagyi, Gabriel N.
    Shi, Leming
    Symmans, W. Fraser
    Pusztai, Lajos
    BREAST CANCER RESEARCH, 2010, 12 (01)
  • [42] Training Set Construction for Genomic Prediction in Auto-Tetraploids: An Example in Potato
    Wilson, Stefan
    Malosetti, Marcos
    Maliepaard, Chris
    Mulder, Han A.
    Visser, Richard G. F.
    van Eeuwijk, Fred
    FRONTIERS IN PLANT SCIENCE, 2021, 12
  • [43] Genomic prediction in hybrid breeding: I. Optimizing the training set design
    Albrecht E. Melchinger
    Rohan Fernando
    Christian Stricker
    Chris-Carolin Schön
    Hans-Jürgen Auinger
    Theoretical and Applied Genetics, 2023, 136
  • [44] The trade-off between density marker panels size and predictive ability of genomic prediction for agronomic traits in Coffea canephora
    de Sousa, Ithalo Coelho
    Barreto, Cynthia Aparecida Valiati
    Caixeta, Eveline Teixeira
    Nascimento, Ana Carolina Campana
    Azevedo, Camila Ferreira
    Alkimim, Emilly Ruas
    Nascimento, Moyses
    EUPHYTICA, 2024, 220 (04)
  • [45] The trade-off between density marker panels size and predictive ability of genomic prediction for agronomic traits in Coffea canephora
    Ithalo Coelho de Sousa
    Cynthia Aparecida Valiati Barreto
    Eveline Teixeira Caixeta
    Ana Carolina Campana Nascimento
    Camila Ferreira Azevedo
    Emilly Ruas Alkimim
    Moysés Nascimento
    Euphytica, 2024, 220
  • [46] Genomic prediction in hybrid breeding: I. Optimizing the training set design
    Melchinger, Albrecht E. E.
    Fernando, Rohan
    Stricker, Christian
    Schoen, Chris-Carolin
    Auinger, Hans-Juergen
    THEORETICAL AND APPLIED GENETICS, 2023, 136 (08)
  • [47] CHILDRENS CONFORMITY TO THE CARDINALITY RULE AS A FUNCTION OF SET SIZE AND COUNTING ACCURACY
    FUSON, KC
    PERGAMENT, GG
    LYONS, BG
    HALL, JW
    CHILD DEVELOPMENT, 1985, 56 (06) : 1429 - 1436
  • [48] Accuracy of genomic prediction for seed oil concentration in high-oleic soybean populations using a low-density marker panel
    Hemingway, Joel
    Schnebly, Steve R.
    Rajcan, Istvan
    CROP SCIENCE, 2021, 61 (06) : 4012 - 4021
  • [49] Pedigree Effective Population Size Affects Genomic Prediction Accuracy in Katahdin Sheep
    Nilson, Sara M.
    Burke, Joan M.
    Lewis, Ronald M.
    JOURNAL OF ANIMAL SCIENCE, 2023, 101
  • [50] Pedigree Effective Population Size Affects Genomic Prediction Accuracy in Katahdin Sheep
    Nilson, Sara M.
    Burke, Joan M.
    Lewis, Ronald M.
    JOURNAL OF ANIMAL SCIENCE, 2023, 101 : 21 - 22