Genome-enabled prediction through machine learning methods considering different levels of trait complexity

被引:9
|
作者
Barbosa, Ivan de Paiva [1 ]
da Silva, Michele Jorge [1 ]
da Costa, Weverton Gomes [1 ]
de Castro Sant'Anna, Isabela [2 ]
Nascimento, Moyses [3 ]
Cruz, Cosme Damiao [1 ]
机构
[1] Fed Univ Vicosa UFV, Dept Gen Biol, Bioinformat Lab, Vicosa, MG, Brazil
[2] Agron Inst IAC, Rubber Tree & Agroforestry Ctr, Votuporana, SP, Brazil
[3] Fed Univ Vicosa UFV, Dept Stat, Lab Computat Intelligence & Stat Learning, Vicosa, MG, Brazil
基金
巴西圣保罗研究基金会;
关键词
D O I
10.1002/csc2.20488
中图分类号
S3 [农学(农艺学)];
学科分类号
0901 ;
摘要
Genomic-wide selection (GWS) consists of the use of a large number of molecular markers for the prediction of genetic values and has been shown to be highly relevant for genetic improvement. The objective of this work was to evaluate and compare the predictive performance of statistical (ridge regression-best linear unbiased predictor [RR-BLUP] and BayesB) and machine learning methods through GWS in simulated populations with traits presenting different levels of heritability and quantitative trait loci (QTL) numbers in the presence of dominant and epistatic effects. The simulated genome of population F-2 was formed by 1,000 individuals and genotyped with 2,010 single nucleotide polymorphism (SNP) markers. Twenty-six traits were simulated considering QTL numbers ranging from two to 88 and heritabilities of .3 and .6. The selective and predictive performances were evaluated using the multilayer perceptron (MLP), radial basis function (RBF), decision trees (DT), bagging (BA), random forest (RF), and boosting (BO) machine learning models and the classical RR-BLUP and BayesB methods. A high effect of heritability was observed for the results of selective accuracy when compared to the increased QTL number. In addition, the selective accuracy based on the number of QTL demonstrates that the application of alternative machine learning models, such as RBF, BA, BO, and RF, can be suitable for the analysis according to QTL number. Machine learning methods are powerful tools for predicting genetic values with epistatic gene control in traits with different degrees of heritability and different numbers of controlling genes.
引用
收藏
页码:1890 / 1902
页数:13
相关论文
共 50 条
  • [1] Genome-enabled prediction of reproductive traits in Nellore cattle using parametric models and machine learning methods
    Alves, A. A. C.
    Espigolan, R.
    Bresolin, T.
    Costa, R. M.
    Fernandes Junior, G. A.
    Ventura, R. V.
    Carvalheiro, R.
    Albuquerque, L. G.
    ANIMAL GENETICS, 2021, 52 (01) : 32 - 46
  • [2] Nonparametric and machine learning procedures for genome-enabled prediction of genetic value for quantitative traits
    Gianola, D.
    CANADIAN JOURNAL OF ANIMAL SCIENCE, 2009, 89 (01) : 123 - 124
  • [3] A Multiple-Trait Bayesian Lasso for Genome-Enabled Analysis and Prediction of Complex Traits
    Gianola, Daniel
    Fernando, Rohan L.
    GENETICS, 2020, 214 (02) : 305 - 331
  • [4] Genome-enabled prediction through quantile random forest for complex traits
    Valadares, Cristiane Botelho
    Nascimento, Moyses
    Celeri, Mauricio de Oliveira
    Nascimento, Ana Carolina Campana
    Barroso, Lais Mayara Azevedo
    Sant'Anna, Isabela de Castro
    Azevedo, Camila Ferreira
    CIENCIA RURAL, 2023, 53 (10):
  • [5] Genome-enabled classification of stayability in Nellore cattle under a machine learning framework
    Santana, Talita Estefani Zunino
    Silva, Jose Cleydson F.
    da Silva, Luiz Otavio C.
    Alvarenga, Amanda Botelho
    Menezes, Gilberto Romeiro de O.
    Torres Junior, Roberto A. A.
    Duarte, Marcio de Souza
    Fonseca e Silva, Fabyano
    LIVESTOCK SCIENCE, 2022, 260
  • [6] A non-parametric mixture model for genome-enabled prediction of genetic value for a quantitative trait
    Gianola, Daniel
    Wu, Xiao-Lin
    Manfredi, Eduardo
    Simianer, Henner
    GENETICA, 2010, 138 (9-10) : 959 - 977
  • [7] A non-parametric mixture model for genome-enabled prediction of genetic value for a quantitative trait
    Daniel Gianola
    Xiao-Lin Wu
    Eduardo Manfredi
    Henner Simianer
    Genetica, 2010, 138 : 959 - 977
  • [8] White Lupin Drought Tolerance: Genetic Variation, Trait Genetic Architecture, and Genome-Enabled Prediction
    Pecetti, Luciano
    Annicchiarico, Paolo
    Crosta, Margherita
    Notario, Tommaso
    Ferrari, Barbara
    Nazzicari, Nelson
    INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2023, 24 (03)
  • [9] A Review and Tutorial of Machine Learning Methods for Microbiome Host Trait Prediction
    Zhou, Yi-Hui
    Gallins, Paul
    FRONTIERS IN GENETICS, 2019, 10
  • [10] Machine Learning-Enabled Genome Mining and Bioactivity Prediction of Natural Products
    Yuan, Yujie
    Shi, Chengyou
    Zhao, Huimin
    ACS SYNTHETIC BIOLOGY, 2023, 12 (09): : 2650 - 2662