Improving Genomic Prediction with Machine Learning Incorporating TPE for Hyperparameters Optimization

被引:11
|
作者
Liang, Mang [1 ]
An, Bingxing [1 ]
Li, Keanning [1 ]
Du, Lili [1 ]
Deng, Tianyu [1 ]
Cao, Sheng [1 ]
Du, Yueying [1 ]
Xu, Lingyang [1 ]
Gao, Xue [1 ]
Zhang, Lupei [1 ]
Li, Junya [1 ]
Gao, Huijiang [1 ]
机构
[1] Chinese Acad Agr Sci, Inst Anim Sci, Beijing 100193, Peoples R China
来源
BIOLOGY-BASEL | 2022年 / 11卷 / 11期
关键词
hyperparameters optimization; tree-structured Parzen estimator; genomic prediction; machine learning; SELECTION; ACCURACY; WHEAT; TOOL;
D O I
10.3390/biology11111647
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Simple Summary Machine learning has been a crucial implement for genomic prediction. However, the complicated process of tuning hyperparameters tremendously hindered its application in actual breeding programs, especially for people without experience tuning hyperparameters. In this study, we applied a tree-structured Parzen estimator (TPE) to tune the hyperparameters of machine learning methods. Overall, incorporating kernel ridge regression (KRR) with TPE achieved the highest prediction accuracy in simulation and real datasets. Depending on excellent prediction ability, machine learning has been considered the most powerful implement to analyze high-throughput sequencing genome data. However, the sophisticated process of tuning hyperparameters tremendously impedes the wider application of machine learning in animal and plant breeding programs. Therefore, we integrated an automatic tuning hyperparameters algorithm, tree-structured Parzen estimator (TPE), with machine learning to simplify the process of using machine learning for genomic prediction. In this study, we applied TPE to optimize the hyperparameters of Kernel ridge regression (KRR) and support vector regression (SVR). To evaluate the performance of TPE, we compared the prediction accuracy of KRR-TPE and SVR-TPE with the genomic best linear unbiased prediction (GBLUP) and KRR-RS, KRR-Grid, SVR-RS, and SVR-Grid, which tuned the hyperparameters of KRR and SVR by using random search (RS) and grid search (Gird) in a simulation dataset and the real datasets. The results indicated that KRR-TPE achieved the most powerful prediction ability considering all populations and was the most convenient. Especially for the Chinese Simmental beef cattle and Loblolly pine populations, the prediction accuracy of KRR-TPE had an 8.73% and 6.08% average improvement compared with GBLUP, respectively. Our study will greatly promote the application of machine learning in GP and further accelerate breeding progress.
引用
收藏
页数:13
相关论文
共 50 条
  • [31] Incorporating mitigation strategies in machine learning for landslide susceptibility prediction
    Lyu, Hai -Min
    Yin, Zhen-Yu
    Hicher, Pierre -Yves
    Laouafa, Farid
    GEOSCIENCE FRONTIERS, 2024, 15 (05)
  • [32] Incorporating domain knowledge in machine learning for soccer outcome prediction
    Berrar, Daniel
    Lopes, Philippe
    Dubitzky, Werner
    MACHINE LEARNING, 2019, 108 (01) : 97 - 126
  • [33] Optimization of English Classroom Interaction Models Incorporating Machine Learning
    Ren, Shiming
    JOURNAL OF ELECTRICAL SYSTEMS, 2024, 20 (06)
  • [34] Improving the Machine Learning Prediction Accuracy with Clustering Discretization
    Gao, Chunlan
    Zhang, Yanqing
    Lo, Dan
    Shi, Yong
    Huang, Jian
    2022 IEEE 12TH ANNUAL COMPUTING AND COMMUNICATION WORKSHOP AND CONFERENCE (CCWC), 2022, : 513 - 517
  • [35] Evaluating the performance of hyperparameters for unbiased and fair machine learning
    Bui, Vy
    Yu, Hang
    Kantipudi, Karthik
    Yaniv, Ziv
    Jaeger, Stefan
    MEDICAL IMAGING 2024: IMAGE PROCESSING, 2024, 12926
  • [36] Genomic Prediction of Wheat Grain Yield Using Machine Learning
    Sirsat, Manisha Sanjay
    Oblessuc, Paula Rodrigues
    Ramiro, Ricardo S.
    AGRICULTURE-BASEL, 2022, 12 (09):
  • [37] MFMGP: an integrated machine learning fusion model for genomic prediction
    Zhang, Chaopu
    Liang, Qiqi
    Yu, Yuye
    Jin, Shaojuan
    Huang, Jinmei
    Xu, Zhongping
    Liu, Erbao
    Wang, Wensheng
    Zhang, Fan
    Liu, Fangzhou
    Shi, Yingyao
    Li, Fenge
    Li, Zhikang
    Jin, Shuangxia
    Li, Min
    PLANT BIOTECHNOLOGY JOURNAL, 2025, 23 (03) : 712 - 714
  • [38] Meta optimization: Improving compiler heuristics with machine learning
    Stephenson, M
    Amarasinghe, S
    Martin, M
    O'Reilly, UM
    ACM SIGPLAN NOTICES, 2003, 38 (05) : 77 - 90
  • [39] Improving soil moisture prediction with deep learning and machine learning models
    Teshome, Fitsum T.
    Bayabil, Haimanote K.
    Schaffer, Bruce
    Ampatzidis, Yiannis
    Hoogenboom, Gerrit
    Computers and Electronics in Agriculture, 2024, 226
  • [40] Optimization of support vector machine hyperparameters by using genetic algorithm
    Szymanski, Z
    Jankowski, S
    Grelow, D
    PHOTONICS APPLICATIONS IN ASTRONOMY, COMMUNICATIONS, INDUSTRY, AND HIGH-ENERGY PHYSICS EXPERIMENTS IV, 2006, 6159