Prediction of Multiple-Trait and Multiple-Environment Genomic Data Using Recommender Systems

被引:13
|
作者
Montesinos-Lopez, Osval A. [1 ]
Montesinos-Lopez, Abelardo [2 ]
Crossa, Jose [3 ]
Montesinos-Lopez, Jose C. [4 ]
Mota-Sanchez, David [5 ]
Estrada-Gonzalez, Fermin [1 ]
Gillberg, Jussi [6 ]
Singh, Ravi [3 ]
Mondal, Suchismita [3 ]
Juliana, Philomin [3 ]
机构
[1] Univ Colima, Fac Telemat, Colima 28040, Mexico
[2] Univ Guadalajara, CUCEI, Dept Matemat, Guadalajara 44430, Jalisco, Mexico
[3] Int Maize & Wheat Improvement Ctr CIMMYT, Apdo Postal 6-641, Mexico City 06600, DF, Mexico
[4] Ctr Invest Matemat CIMAT, Dept Estadist, Guanajuato 36240, Mexico
[5] Michigan State Univ, Dept Entomol, E Lansing, MI 48824 USA
[6] Aalto Univ, Dept Comp Sci, FI-00076 Aalto, Finland
来源
G3-GENES GENOMES GENETICS | 2018年 / 8卷 / 01期
关键词
genomic information; item-based collaborative filtering; matrix factorization; multi-trait; genotype; environment interaction; prediction accuracy; collaborative filtering; GenPred; Shared Data Resources; Genomic Selection;
D O I
10.1534/g3.117.300309
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
In genomic-enabled prediction, the task of improving the accuracy of the prediction of lines in environments is difficult because the available information is generally sparse and usually has low correlations between traits. In current genomic selection, although researchers have a large amount of information and appropriate statistical models to process it, there is still limited computing efficiency to do so. Although some statistical models are usually mathematically elegant, many of them are also computationally inefficient, and they are impractical for many traits, lines, environments, and years because they need to sample from huge normal multivariate distributions. For these reasons, this study explores two recommender systems: item-based collaborative filtering (IBCF) and the matrix factorization algorithm (MF) in the context of multiple traits and multiple environments. The IBCF and MF methods were compared with two conventional methods on simulated and real data. Results of the simulated and real data sets show that the IBCF technique was slightly better in terms of prediction accuracy than the two conventional methods and the MF method when the correlation was moderately high. The IBCF technique is very attractive because it produces good predictions when there is high correlation between items (environment-trait combinations) and its implementation is computationally feasible, which can be useful for plant breeders who deal with very large data sets.
引用
收藏
页码:131 / 147
页数:17
相关论文
共 50 条
  • [31] Accuracy of genomic predictions using single and multiple-trait machine learning methods in Canadian beef cattle population
    Li, Hongwei
    Wang, Yining
    Vinsky, Michael
    Valente, Tiago
    Basarab, John A.
    Li, Changxi
    JOURNAL OF ANIMAL SCIENCE, 2024, 102 : 33 - 33
  • [32] MULTIPLE-TRAIT COEVOLUTION AND ENVIRONMENTAL GRADIENTS IN GUPPIES
    ENDLER, JA
    TRENDS IN ECOLOGY & EVOLUTION, 1995, 10 (01) : 22 - 29
  • [33] Accuracy of genomic predictions using single and multiple-trait machine learning methods in Canadian beef cattle population
    Li, Hongwei
    Wang, Yining
    Vinsky, Michael
    Valente, Tiago
    Basarab, John A.
    Li, Changxi
    JOURNAL OF ANIMAL SCIENCE, 2024, 102 : 33 - 34
  • [34] Approximation of reliability for multiple-trait animal models with missing data by canonical transformation
    Gengler, N
    Misztal, I
    JOURNAL OF DAIRY SCIENCE, 1996, 79 (02) : 317 - 328
  • [35] A mixed-model quantitative trait loci (QTL) analysis for multiple-environment trial data using environmental covariables for QTL-by-environment interactions, with an example in maize
    Boer, Martin P.
    Wright, Deanne
    Feng, Lizhi
    Podlich, Dean W.
    Luo, Lang
    Cooper, Mark
    van Eeuwijk, Fred A.
    GENETICS, 2007, 177 (03) : 1801 - 1813
  • [36] Advantages and limitations of multiple-trait genomic prediction for Fusarium head blight severity in hybrid wheat (Triticum aestivum L.)
    Albert W. Schulthess
    Yusheng Zhao
    C. Friedrich H. Longin
    Jochen C. Reif
    Theoretical and Applied Genetics, 2018, 131 : 685 - 701
  • [37] Multiple-trait BLUP in longitudinal data analysis on Jatropha curcas breeding for bioenergy
    Alves, Rodrigo Silva
    Teodoro, Paulo Eduardo
    Peixoto, Leonardo de Azevedo
    Santos de Carvalho Rocha, Joao Romero do Amaral
    Silva, Lidiane Aparecida
    Laviola, Bruno Galveas
    Vilela de Resende, Marcos Deon
    Bhering, Leonardo Lopes
    INDUSTRIAL CROPS AND PRODUCTS, 2019, 130 : 558 - 561
  • [38] Multiple-trait mapping of quantitative trait loci after selective genotyping using logistic regression
    Henshall, JM
    Goddard, ME
    GENETICS, 1999, 151 (02) : 885 - 894
  • [39] MULTIPLE-TRAIT ANALYSIS OF GENETIC-MAPPING FOR QUANTITATIVE TRAIT LOCI
    JIANG, CJ
    ZENG, ZB
    GENETICS, 1995, 140 (03) : 1111 - 1127
  • [40] Advantages and limitations of multiple-trait genomic prediction for Fusarium head blight severity in hybrid wheat (Triticum aestivum L.)
    Schulthess, Albert W.
    Zhao, Yusheng
    Longin, C. Friedrich H.
    Reif, Jochen C.
    THEORETICAL AND APPLIED GENETICS, 2018, 131 (03) : 685 - 701