Genome-enabled prediction through machine learning methods considering different levels of trait complexity

被引:9
|
作者
Barbosa, Ivan de Paiva [1 ]
da Silva, Michele Jorge [1 ]
da Costa, Weverton Gomes [1 ]
de Castro Sant'Anna, Isabela [2 ]
Nascimento, Moyses [3 ]
Cruz, Cosme Damiao [1 ]
机构
[1] Fed Univ Vicosa UFV, Dept Gen Biol, Bioinformat Lab, Vicosa, MG, Brazil
[2] Agron Inst IAC, Rubber Tree & Agroforestry Ctr, Votuporana, SP, Brazil
[3] Fed Univ Vicosa UFV, Dept Stat, Lab Computat Intelligence & Stat Learning, Vicosa, MG, Brazil
基金
巴西圣保罗研究基金会;
关键词
D O I
10.1002/csc2.20488
中图分类号
S3 [农学(农艺学)];
学科分类号
0901 ;
摘要
Genomic-wide selection (GWS) consists of the use of a large number of molecular markers for the prediction of genetic values and has been shown to be highly relevant for genetic improvement. The objective of this work was to evaluate and compare the predictive performance of statistical (ridge regression-best linear unbiased predictor [RR-BLUP] and BayesB) and machine learning methods through GWS in simulated populations with traits presenting different levels of heritability and quantitative trait loci (QTL) numbers in the presence of dominant and epistatic effects. The simulated genome of population F-2 was formed by 1,000 individuals and genotyped with 2,010 single nucleotide polymorphism (SNP) markers. Twenty-six traits were simulated considering QTL numbers ranging from two to 88 and heritabilities of .3 and .6. The selective and predictive performances were evaluated using the multilayer perceptron (MLP), radial basis function (RBF), decision trees (DT), bagging (BA), random forest (RF), and boosting (BO) machine learning models and the classical RR-BLUP and BayesB methods. A high effect of heritability was observed for the results of selective accuracy when compared to the increased QTL number. In addition, the selective accuracy based on the number of QTL demonstrates that the application of alternative machine learning models, such as RBF, BA, BO, and RF, can be suitable for the analysis according to QTL number. Machine learning methods are powerful tools for predicting genetic values with epistatic gene control in traits with different degrees of heritability and different numbers of controlling genes.
引用
收藏
页码:1890 / 1902
页数:13
相关论文
共 50 条
  • [21] Prediction of Intrinsically Disordered Proteins Using Machine Learning Based on Low Complexity Methods
    Zeng, Xingming
    Liu, Haiyuan
    He, Hao
    ALGORITHMS, 2022, 15 (03)
  • [22] Evaluation and Optimization Methods for Applicability Domain Methods and Their Hyperparameters, Considering the Prediction Performance of Machine Learning Models
    Kaneko, Hiromasa
    ACS OMEGA, 2024, 9 (10): : 11453 - 11458
  • [23] Comparison of machine learning methods for ground settlement prediction with different tunneling datasets
    Tang, Libin
    Na, SeonHong
    JOURNAL OF ROCK MECHANICS AND GEOTECHNICAL ENGINEERING, 2021, 13 (06) : 1274 - 1289
  • [24] A comparative study of different machine learning methods for reservoir landslide displacement prediction
    Wang, Yankun
    Tang, Huiming
    Huang, Jinsong
    Wen, Tao
    Ma, Junwei
    Zhang, Junrong
    ENGINEERING GEOLOGY, 2022, 298
  • [25] Statistical and machine learning methods applied to the prediction of different tropical rainfall types
    Wang, Jiayi
    Wong, Raymond K. W.
    Jun, Mikyoung
    Schumacher, Courtney
    Saravanan, R.
    Sun, Chunmei
    ENVIRONMENTAL RESEARCH COMMUNICATIONS, 2021, 3 (11):
  • [26] Flood susceptible prediction through the use of geospatial variables and machine learning methods
    Gharakhanlou, Navid Mahdizadeh
    Perez, Liliana
    JOURNAL OF HYDROLOGY, 2023, 617
  • [27] Strength prediction of circular CFST columns through advanced machine learning methods
    Hou, Chao
    Zhou, Xiao-Guang
    JOURNAL OF BUILDING ENGINEERING, 2022, 51
  • [28] Comparing machine-learning models of different levels of complexity for crop protection: A look into the complexity-accuracy tradeoff
    Gauriau, Olivier
    Galarraga, Luis
    Brun, Francois
    Termier, Alexandre
    Davadan, Loic
    Joudelat, Francois
    SMART AGRICULTURAL TECHNOLOGY, 2024, 7
  • [29] Application of various machine learning architectures for crash prediction, considering different depths and processing layers
    Rezapour, Mahdi
    Ksaibati, Khaled
    ENGINEERING REPORTS, 2020, 2 (08)
  • [30] Prediction of low-density lipoprotein cholesterol levels using machine learning methods
    Kim, Yoori
    Lee, Won Kyung
    Lee, Woojoo
    LABORATORY MEDICINE, 2024, 55 (04) : 471 - 484