A Stacking Ensemble Learning Framework for Genomic Prediction

被引:39
|
作者
Liang, Mang [1 ]
Chang, Tianpeng [1 ]
An, Bingxing [1 ]
Duan, Xinghai [1 ]
Du, Lili [1 ]
Wang, Xiaoqiao [1 ]
Miao, Jian [1 ]
Xu, Lingyang [1 ]
Gao, Xue [1 ]
Zhang, Lupei [1 ]
Li, Junya [1 ]
Gao, Huijiang [1 ]
机构
[1] Chinese Acad Agr Sci, Inst Anim Sci, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
ensemble learning; stacking; genomic prediction; machine learning; prediction accuracy; SELECTION; ACCURACY;
D O I
10.3389/fgene.2021.600040
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Machine learning (ML) is perhaps the most useful tool for the interpretation of large genomic datasets. However, the performance of a single machine learning method in genomic selection (GS) is currently unsatisfactory. To improve the genomic predictions, we constructed a stacking ensemble learning framework (SELF), integrating three machine learning methods, to predict genomic estimated breeding values (GEBVs). The present study evaluated the prediction ability of SELF by analyzing three real datasets, with different genetic architecture; comparing the prediction accuracy of SELF, base learners, genomic best linear unbiased prediction (GBLUP) and BayesB. For each trait, SELF performed better than base learners, which included support vector regression (SVR), kernel ridge regression (KRR) and elastic net (ENET). The prediction accuracy of SELF was, on average, 7.70% higher than GBLUP in three datasets. Except for the milk fat percentage (MFP) traits, of the German Holstein dairy cattle dataset, SELF was more robust than BayesB in all remaining traits. Therefore, we believed that SEFL has the potential to be promoted to estimate GEBVs in other animals and plants.
引用
收藏
页数:9
相关论文
共 50 条
  • [1] Enhancing genomic prediction with Stacking Ensemble Learning in Arabica Coffee
    Nascimento, Moyses
    Nascimento, Ana Carolina Campana
    Azevedo, Camila Ferreira
    de Oliveira, Antonio Carlos Baiao
    Caixeta, Eveline Teixeira
    Jarquin, Diego
    [J]. FRONTIERS IN PLANT SCIENCE, 2024, 15
  • [2] SAPPHIRE: A stacking-based ensemble learning framework for accurate prediction of thermophilic proteins
    Charoenkwan, Phasit
    Schaduangrat, Nalini
    Moni, Mohammad Ali
    Lio, Pietro
    Manavalan, Balachandran
    Shoombuatong, Watshara
    [J]. COMPUTERS IN BIOLOGY AND MEDICINE, 2022, 146
  • [3] STALLION: a stacking-based ensemble learning framework for prokaryotic lysine acetylation site prediction
    Basith, Shaherin
    Lee, Gwang
    Manavalan, Balachandran
    [J]. BRIEFINGS IN BIOINFORMATICS, 2022, 23 (01)
  • [4] SCORPION is a stacking-based ensemble learning framework for accurate prediction of phage virion proteins
    Saeed Ahmad
    Phasit Charoenkwan
    Julian M. W. Quinn
    Mohammad Ali Moni
    Md Mehedi Hasan
    Pietro Lio’
    Watshara Shoombuatong
    [J]. Scientific Reports, 12 (1)
  • [5] SCORPION is a stacking-based ensemble learning framework for accurate prediction of phage virion proteins
    Ahmad, Saeed
    Charoenkwan, Phasit
    Quinn, Julian M. W.
    Moni, Mohammad Ali
    Hasan, Md Mehedi
    Lio, Pietro
    Shoombuatong, Watshara
    [J]. SCIENTIFIC REPORTS, 2022, 12 (01):
  • [6] Spam comments prediction using stacking with ensemble learning
    Mehmood, Arif
    On, Byung-Won
    Lee, Ingyu
    Ashraf, Imran
    Choi, Gyu Sang
    [J]. 10TH INTERNATIONAL CONFERENCE ON COMPUTER AND ELECTRICAL ENGINEERING, 2018, 933
  • [7] A Stacking Ensemble Learning Model for Mobile Traffic Prediction
    Li, Zhigang
    Cai, Di
    Wang, Jialin
    Fu, Jingchang
    Qin, Linlin
    Fu, Duomin
    [J]. 2020 IEEE/CIC INTERNATIONAL CONFERENCE ON COMMUNICATIONS IN CHINA (ICCC), 2020, : 542 - 547
  • [8] MFDroid: A Stacking Ensemble Learning Framework for Android Malware Detection
    Wang, Xusheng
    Zhang, Linlin
    Zhao, Kai
    Ding, Xuhui
    Yu, Mingming
    [J]. SENSORS, 2022, 22 (07)
  • [9] Stacking based ensemble learning framework for identification of nitrotyrosine sites
    Parvez, Aiman
    Ali, Syed Danish
    Tayara, Hilal
    Chong, Kil To
    [J]. Computers in Biology and Medicine, 2024, 183
  • [10] Remaining useful life prediction based on stacking ensemble learning
    Han, Tengfei
    Li, Yaping
    [J]. Jisuanji Jicheng Zhizao Xitong/Computer Integrated Manufacturing Systems, CIMS, 2024, 30 (07): : 2464 - 2473