Improving Genomic Prediction with Machine Learning Incorporating TPE for Hyperparameters Optimization

被引:11
|
作者
Liang, Mang [1 ]
An, Bingxing [1 ]
Li, Keanning [1 ]
Du, Lili [1 ]
Deng, Tianyu [1 ]
Cao, Sheng [1 ]
Du, Yueying [1 ]
Xu, Lingyang [1 ]
Gao, Xue [1 ]
Zhang, Lupei [1 ]
Li, Junya [1 ]
Gao, Huijiang [1 ]
机构
[1] Chinese Acad Agr Sci, Inst Anim Sci, Beijing 100193, Peoples R China
来源
BIOLOGY-BASEL | 2022年 / 11卷 / 11期
关键词
hyperparameters optimization; tree-structured Parzen estimator; genomic prediction; machine learning; SELECTION; ACCURACY; WHEAT; TOOL;
D O I
10.3390/biology11111647
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Simple Summary Machine learning has been a crucial implement for genomic prediction. However, the complicated process of tuning hyperparameters tremendously hindered its application in actual breeding programs, especially for people without experience tuning hyperparameters. In this study, we applied a tree-structured Parzen estimator (TPE) to tune the hyperparameters of machine learning methods. Overall, incorporating kernel ridge regression (KRR) with TPE achieved the highest prediction accuracy in simulation and real datasets. Depending on excellent prediction ability, machine learning has been considered the most powerful implement to analyze high-throughput sequencing genome data. However, the sophisticated process of tuning hyperparameters tremendously impedes the wider application of machine learning in animal and plant breeding programs. Therefore, we integrated an automatic tuning hyperparameters algorithm, tree-structured Parzen estimator (TPE), with machine learning to simplify the process of using machine learning for genomic prediction. In this study, we applied TPE to optimize the hyperparameters of Kernel ridge regression (KRR) and support vector regression (SVR). To evaluate the performance of TPE, we compared the prediction accuracy of KRR-TPE and SVR-TPE with the genomic best linear unbiased prediction (GBLUP) and KRR-RS, KRR-Grid, SVR-RS, and SVR-Grid, which tuned the hyperparameters of KRR and SVR by using random search (RS) and grid search (Gird) in a simulation dataset and the real datasets. The results indicated that KRR-TPE achieved the most powerful prediction ability considering all populations and was the most convenient. Especially for the Chinese Simmental beef cattle and Loblolly pine populations, the prediction accuracy of KRR-TPE had an 8.73% and 6.08% average improvement compared with GBLUP, respectively. Our study will greatly promote the application of machine learning in GP and further accelerate breeding progress.
引用
收藏
页数:13
相关论文
共 50 条
  • [41] An Effective Classification of DDoS Attacks in a Distributed Network by Adopting Hierarchical Machine Learning and Hyperparameters Optimization Techniques
    Dasari, Sandeep
    Kaluri, Rajesh
    IEEE ACCESS, 2024, 12 : 10834 - 10845
  • [42] Incorporating Physical Models for Dynamic Stall Prediction Based on Machine Learning
    Wang, Xu
    Kou, Jiaqing
    Zhang, Weiwei
    Liu, Zhitao
    AIAA JOURNAL, 2022, 60 (07) : 4428 - 4439
  • [43] Movement Optimization for a Cyborg Cockroach in a Bounded Space Incorporating Machine Learning
    Ariyanto, Mochammad
    Refat, Chowdhury Mohammad Masum
    Hirao, Kazuyoshi
    Morishima, Keisuke
    CYBORG AND BIONIC SYSTEMS, 2023, 4
  • [44] Incorporating Machine Learning Methods for Predictive Maintenance and Fuzzy Inventory Optimization
    Shobana, S.
    Wavare, Mahesh Sahebrao
    Kalaiarasi, K.
    Bhaskar, T.
    Anand, M. Clement Joe
    Sindhuja, N.
    INTELLIGENT AND FUZZY SYSTEMS, VOL 2, INFUS 2024, 2024, 1089 : 666 - 678
  • [45] Heuristic hyperparameter optimization of deep learning models for genomic prediction
    Han, Junjie
    Gondro, Cedric
    Reid, Kenneth
    Steibel, Juan P.
    G3-GENES GENOMES GENETICS, 2021, 11 (07):
  • [46] Machine learning and the optimization of prediction-based policies
    Battiston, Pietro
    Gamba, Simona
    Santoro, Alessandro
    TECHNOLOGICAL FORECASTING AND SOCIAL CHANGE, 2024, 199
  • [47] Machine Learning for Complex EMI Prediction, Optimization and Localization
    Jin, Hang
    Zhang, Le
    Ma, Han-Zhi
    Yang, Si-Chen
    Yang, Xiao-Li
    Li, Er-Ping
    2017 IEEE ELECTRICAL DESIGN OF ADVANCED PACKAGING AND SYSTEMS SYMPOSIUM (EDAPS), 2017,
  • [48] Improving breast cancer diagnosis by incorporating raw ultrasound parameters into machine learning
    Baek, Jihye
    O'Connell, Avice M.
    Parker, Kevin J.
    MACHINE LEARNING-SCIENCE AND TECHNOLOGY, 2022, 3 (04):
  • [49] Machine learning to set hyperparameters for overlapping community detection algorithms
    Xiao, Chenglong
    Wang, Yajie
    Wang, Shanshan
    JOURNAL OF ENGINEERING-JOE, 2023, 2023 (08):
  • [50] Improving prediction of computational job execution times with machine learning
    Balis, Bartosz
    Lelek, Tomasz
    Bodera, Jakub
    Grabowski, Michal
    Grigoras, Costin
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2024, 36 (02):