Improving Genomic Prediction with Machine Learning Incorporating TPE for Hyperparameters Optimization

被引:11
|
作者
Liang, Mang [1 ]
An, Bingxing [1 ]
Li, Keanning [1 ]
Du, Lili [1 ]
Deng, Tianyu [1 ]
Cao, Sheng [1 ]
Du, Yueying [1 ]
Xu, Lingyang [1 ]
Gao, Xue [1 ]
Zhang, Lupei [1 ]
Li, Junya [1 ]
Gao, Huijiang [1 ]
机构
[1] Chinese Acad Agr Sci, Inst Anim Sci, Beijing 100193, Peoples R China
来源
BIOLOGY-BASEL | 2022年 / 11卷 / 11期
关键词
hyperparameters optimization; tree-structured Parzen estimator; genomic prediction; machine learning; SELECTION; ACCURACY; WHEAT; TOOL;
D O I
10.3390/biology11111647
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Simple Summary Machine learning has been a crucial implement for genomic prediction. However, the complicated process of tuning hyperparameters tremendously hindered its application in actual breeding programs, especially for people without experience tuning hyperparameters. In this study, we applied a tree-structured Parzen estimator (TPE) to tune the hyperparameters of machine learning methods. Overall, incorporating kernel ridge regression (KRR) with TPE achieved the highest prediction accuracy in simulation and real datasets. Depending on excellent prediction ability, machine learning has been considered the most powerful implement to analyze high-throughput sequencing genome data. However, the sophisticated process of tuning hyperparameters tremendously impedes the wider application of machine learning in animal and plant breeding programs. Therefore, we integrated an automatic tuning hyperparameters algorithm, tree-structured Parzen estimator (TPE), with machine learning to simplify the process of using machine learning for genomic prediction. In this study, we applied TPE to optimize the hyperparameters of Kernel ridge regression (KRR) and support vector regression (SVR). To evaluate the performance of TPE, we compared the prediction accuracy of KRR-TPE and SVR-TPE with the genomic best linear unbiased prediction (GBLUP) and KRR-RS, KRR-Grid, SVR-RS, and SVR-Grid, which tuned the hyperparameters of KRR and SVR by using random search (RS) and grid search (Gird) in a simulation dataset and the real datasets. The results indicated that KRR-TPE achieved the most powerful prediction ability considering all populations and was the most convenient. Especially for the Chinese Simmental beef cattle and Loblolly pine populations, the prediction accuracy of KRR-TPE had an 8.73% and 6.08% average improvement compared with GBLUP, respectively. Our study will greatly promote the application of machine learning in GP and further accelerate breeding progress.
引用
收藏
页数:13
相关论文
共 50 条
  • [1] Metaheuristic optimization of data preparation and machine learning hyperparameters for prediction of dynamic methane production
    Meola, Alberto
    Winkler, Manuel
    Weinrich, Soeren
    BIORESOURCE TECHNOLOGY, 2023, 372
  • [2] Evolutionary optimization of machine learning algorithm hyperparameters for strength prediction of high-performance concrete
    Singh S.
    Patro S.K.
    Parhi S.K.
    Asian Journal of Civil Engineering, 2023, 24 (8) : 3121 - 3143
  • [3] Fast Bayesian Optimization of Machine Learning Hyperparameters on Large Datasets
    Klein, Aaron
    Falkner, Stefan
    Bartels, Simon
    Hennig, Philipp
    Hutter, Frank
    ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 54, 2017, 54 : 528 - 536
  • [4] Discrete simulation optimization for tuning machine learning method hyperparameters
    Ramamohan, Varun
    Singhal, Shobhit
    Gupta, Aditya Raj
    Bolia, Nomesh Bhojkumar
    JOURNAL OF SIMULATION, 2024, 18 (05) : 745 - 765
  • [5] Evaluation and Optimization Methods for Applicability Domain Methods and Their Hyperparameters, Considering the Prediction Performance of Machine Learning Models
    Kaneko, Hiromasa
    ACS OMEGA, 2024, 9 (10): : 11453 - 11458
  • [6] Stealing Hyperparameters in Machine Learning
    Wang, Binghui
    Gong, Neil Zhenqiang
    2018 IEEE SYMPOSIUM ON SECURITY AND PRIVACY (SP), 2018, : 36 - 52
  • [7] psBLUP: incorporating marker proximity for improving genomic prediction accuracy
    Bartzis, Georgios
    Peeters, Carel F. W.
    van Eeuwijk, Fred
    EUPHYTICA, 2022, 218 (05)
  • [8] psBLUP: incorporating marker proximity for improving genomic prediction accuracy
    Georgios Bartzis
    Carel F. W. Peeters
    Fred van Eeuwijk
    Euphytica, 2022, 218
  • [9] Optimization of Deep Learning Hyperparameters with Experimental Design in Exchange Rate Prediction
    Midilli, Yunus Emre
    Parsutins, Sergejs
    2020 61ST INTERNATIONAL SCIENTIFIC CONFERENCE ON INFORMATION TECHNOLOGY AND MANAGEMENT SCIENCE OF RIGA TECHNICAL UNIVERSITY (ITMS), 2020,
  • [10] Improving econometric prediction by machine learning
    Cerulli, Giovanni
    APPLIED ECONOMICS LETTERS, 2021, 28 (16) : 1419 - 1425