共 50 条
Bayesian model averaging to improve the yield prediction in wheat breeding trials
被引:26
|作者:
Fei, Shuaipeng
[1
,2
,3
]
Chen, Zhen
[1
]
Li, Lei
[2
]
Ma, Yuntao
[3
]
Xiao, Yonggui
[2
]
机构:
[1] Chinese Acad Agr Sci, Inst Farmland Irrigat, Xinxiang 453002, Peoples R China
[2] Chinese Acad Agr Sci, Inst Crop Sci, Natl Wheat Improvement Ctr, Beijing 100081, Peoples R China
[3] China Agr Univ, Coll Land Sci & Technol, Beijing 100193, Peoples R China
基金:
中国国家自然科学基金;
关键词:
Wheat yield;
Remote sensing;
Hyperspectral reflectance;
Bayesian model averaging;
Machine learning;
BIOMASS ESTIMATION;
RIDGE-REGRESSION;
ENSEMBLE;
OPTIMIZATION;
STABILITY;
SELECTION;
SYSTEM;
RGB;
D O I:
10.1016/j.agrformet.2022.109237
中图分类号:
S3 [农学(农艺学)];
学科分类号:
0901 ;
摘要:
Accurate pre-harvest prediction of wheat yield through secondary traits helps to facilitate plant breeding and reduce costs. Machine learning (ML) algorithms are increasingly applied to grain yield with remote sensing data. However, the performance of individual ML algorithms varies for different species in different environments due to different sources of uncertainty. This study proposed a novel wheat yield prediction framework based on canopy hyperspectral reflectance (350-2500 nm) and adopted the ensemble Bayesian model averaging (EBMA) method to improve model performance. To develop the yield prediction models, important bands extracted by the Boruta feature selection method were fed into four linear ML models and four nonlinear ML models. Meanwhile, Bayesian model averaging (BMA) weights obtained based on model cross-validation performance were used to combine the predictions of individual ML models. Compared to the best-performing individual model, the EBMA models obtained a weak accuracy improvement by integrating only the linear models or the nonlinear models. Additionally, the integration of two linear models and two non-linear models simultaneously was analyzed. Results indicate that most EBMA combinations of mixed linear and non-linear models achieved higher prediction accuracy than those integrating a single type of model and the best-performing individual model. The advantage of the EBMA method is that it produces a prediction distribution that reflects the un-certainty associated with deterministic predictions. With full consideration of the model diversity of ensemble members, the EBMA modeling framework provides an alternative method for predicting grain yield in plant breeding trials.
引用
收藏
页数:13
相关论文