Genome-enabled prediction through machine learning methods considering different levels of trait complexity

被引:9
|
作者
Barbosa, Ivan de Paiva [1 ]
da Silva, Michele Jorge [1 ]
da Costa, Weverton Gomes [1 ]
de Castro Sant'Anna, Isabela [2 ]
Nascimento, Moyses [3 ]
Cruz, Cosme Damiao [1 ]
机构
[1] Fed Univ Vicosa UFV, Dept Gen Biol, Bioinformat Lab, Vicosa, MG, Brazil
[2] Agron Inst IAC, Rubber Tree & Agroforestry Ctr, Votuporana, SP, Brazil
[3] Fed Univ Vicosa UFV, Dept Stat, Lab Computat Intelligence & Stat Learning, Vicosa, MG, Brazil
基金
巴西圣保罗研究基金会;
关键词
D O I
10.1002/csc2.20488
中图分类号
S3 [农学(农艺学)];
学科分类号
0901 ;
摘要
Genomic-wide selection (GWS) consists of the use of a large number of molecular markers for the prediction of genetic values and has been shown to be highly relevant for genetic improvement. The objective of this work was to evaluate and compare the predictive performance of statistical (ridge regression-best linear unbiased predictor [RR-BLUP] and BayesB) and machine learning methods through GWS in simulated populations with traits presenting different levels of heritability and quantitative trait loci (QTL) numbers in the presence of dominant and epistatic effects. The simulated genome of population F-2 was formed by 1,000 individuals and genotyped with 2,010 single nucleotide polymorphism (SNP) markers. Twenty-six traits were simulated considering QTL numbers ranging from two to 88 and heritabilities of .3 and .6. The selective and predictive performances were evaluated using the multilayer perceptron (MLP), radial basis function (RBF), decision trees (DT), bagging (BA), random forest (RF), and boosting (BO) machine learning models and the classical RR-BLUP and BayesB methods. A high effect of heritability was observed for the results of selective accuracy when compared to the increased QTL number. In addition, the selective accuracy based on the number of QTL demonstrates that the application of alternative machine learning models, such as RBF, BA, BO, and RF, can be suitable for the analysis according to QTL number. Machine learning methods are powerful tools for predicting genetic values with epistatic gene control in traits with different degrees of heritability and different numbers of controlling genes.
引用
收藏
页码:1890 / 1902
页数:13
相关论文
共 50 条
  • [31] Machine learning methods and predictive ability metrics for genome-wide prediction of complex traits
    Gonzalez-Recio, Oscar
    Rosa, Guilherme J. M.
    Gianola, Daniel
    LIVESTOCK SCIENCE, 2014, 166 : 217 - 231
  • [32] Evaluation of Genome-Enabled Selection for Bacterial Cold Water Disease Resistance Using Progeny Performance Data in Rainbow Trout: Insights on Genotyping Methods and Genomic Prediction Models
    Vallejo, Roger L.
    Leeds, Timothy D.
    Fragomeni, Breno O.
    Gao, Guangtu
    Hernandez, Alvaro G.
    Misztal, Ignacy
    Welch, Timothy J.
    Wiens, Gregory D.
    Palti, Yniv
    FRONTIERS IN GENETICS, 2016, 7
  • [33] Breast cancer prediction using different machine learning methods applying multi factors
    Nazari, Elham
    Naderi, Hamid
    Tabadkani, Mahla
    ArefNezhad, Reza
    Farzin, Amir Hossein
    Dashtiahangar, Mohammad
    Khazaei, Majid
    Ferns, Gordon A.
    Mehrabian, Amin
    Tabesh, Hamed
    Avan, Amir
    JOURNAL OF CANCER RESEARCH AND CLINICAL ONCOLOGY, 2023, 149 (19) : 17133 - 17146
  • [34] Large-scale prediction of activity cliffs using machine and deep learning methods of increasing complexity
    Shunsuke Tamura
    Tomoyuki Miyao
    Jürgen Bajorath
    Journal of Cheminformatics, 15
  • [35] Breast cancer prediction using different machine learning methods applying multi factors
    Elham Nazari
    Hamid Naderi
    Mahla Tabadkani
    Reza ArefNezhad
    Amir Hossein Farzin
    Mohammad Dashtiahangar
    Majid Khazaei
    Gordon A. Ferns
    Amin Mehrabian
    Hamed Tabesh
    Amir Avan
    Journal of Cancer Research and Clinical Oncology, 2023, 149 : 17133 - 17146
  • [36] Prediction of Maximum Oxygen Uptake with Different Machine Learning Methods by Using Submaximal Data
    Yildiz, Incilay
    Akay, M. Fatih
    2015 23RD SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2015, : 184 - 187
  • [37] Large-scale prediction of activity cliffs using machine and deep learning methods of increasing complexity
    Tamura, Shunsuke
    Miyao, Tomoyuki
    Bajorath, Juergen
    JOURNAL OF CHEMINFORMATICS, 2023, 15 (01)
  • [38] Reservoir Landslide Susceptibility Prediction Considering Non-Landslide Sampling and Ensemble Machine Learning Methods
    Wang Y.
    Cao Y.
    Xu F.
    Zhou C.
    Yu L.
    Wu L.
    Wang Y.
    Yin K.
    Diqiu Kexue - Zhongguo Dizhi Daxue Xuebao/Earth Science - Journal of China University of Geosciences, 2024, 49 (05): : 1619 - 1635
  • [39] High-Dimensional Multi-trait GWAS By Reverse Prediction of Genotypes Using Machine Learning Methods
    Malik, Muhammad Ammar
    Ludl, Adriaan-Alexander
    Michoel, Tom
    COMPUTATIONAL INTELLIGENCE METHODS FOR BIOINFORMATICS AND BIOSTATISTICS, CIBB 2021, 2022, 13483 : 79 - 93
  • [40] Machine Learning Methods to Improve Crystallization through the Prediction of Solute-Solvent Interactions
    Kandaswamy, Aatish
    Schwaminger, Sebastian P.
    CRYSTALS, 2024, 14 (06)