Ensemble learning prediction of soybean yields in China based on meteorological data

被引:13
|
作者
Li, Qian-chuan [1 ]
Xu, Shi-wei [1 ,2 ,5 ]
Zhuang, Jia-yu [1 ,5 ]
Liu, Jia-Jia [2 ]
Zhou, Yi [3 ]
Zhang, Ze-xi [4 ]
机构
[1] Chinese Acad Agr Sci, Agr Informat Inst, Beijing 100081, Peoples R China
[2] Beijing Engn Res Ctr Agr Monitoring & Early Warnin, Beijing 100081, Peoples R China
[3] Chinese Acad Sci, Aerosp Informat Res Inst, Beijing 100094, Peoples R China
[4] Columbia Univ, Dept Math, New York, NY 10027 USA
[5] Minist Agr & Rural Affairs, Key Lab Agr Monitoring & Early Warning Technol, Beijing 100081, Peoples R China
关键词
meteorological factors; ensemble learning; crop yield prediction; machine learning; county-level; CLIMATE DATA; WHEAT YIELD; CROP YIELD; TRENDS; TEMPERATURE; PATTERNS; RAINFALL;
D O I
10.1016/j.jia.2023.02.011
中图分类号
S [农业科学];
学科分类号
09 ;
摘要
The accurate prediction of soybean yield is of great significance for agricultural production, monitoring and early warning. Although previous studies have used machine learning algorithms to predict soybean yield based on meteorological data, it is not clear how different models can be used to effectively separate soybean meteorological yield from soybean yield in various regions. In addition, comprehensively integrating the advantages of various machine learning algorithms to improve the prediction accuracy through ensemble learning algorithms has not been studied in depth. This study used and analyzed various daily meteorological data and soybean yield data from 173 county-level administrative regions and meteorological stations in two principal soybean planting areas in China (Northeast China and the Huang-Huai region), covering 34 years. Three effective machine learning algorithms (K-nearest neighbor, random forest, and support vector regression) were adopted as the base-models to establish a high-precision and highly-reliable soybean meteorological yield prediction model based on the stacking ensemble learning framework. The model's generalizability was further improved through 5-fold crossvalidation, and the model was optimized by principal component analysis and hyperparametric optimization. The accuracy of the model was evaluated by using the five-year sliding prediction and four regression indicators of the 173 counties, which showed that the stacking model has higher accuracy and stronger robustness. The 5-year sliding estimations of soybean yield based on the stacking model in 173 counties showed that the prediction effect can reflect the spatiotemporal distribution of soybean yield in detail, and the mean absolute percentage error (MAPE) was less than 5%. The stacking prediction model of soybean meteorological yield provides a new approach for accurately predicting soybean yield.
引用
收藏
页码:1909 / 1927
页数:19
相关论文
共 50 条
  • [21] Forest Fire Risk Prediction Based on Stacking Ensemble Learning for Yunnan Province of China
    Li, Yanzhi
    Li, Guohui
    Wang, Kaifeng
    Wang, Zumin
    Chen, Yanqiu
    [J]. FIRE-SWITZERLAND, 2024, 7 (01):
  • [22] Crime Data Analysis and Prediction using Ensemble Learning
    Almaw, Ayisheshim
    Kadam, Kalyani
    [J]. PROCEEDINGS OF THE 2018 SECOND INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING AND CONTROL SYSTEMS (ICICCS), 2018, : 1918 - 1923
  • [23] Ensemble learning based software defect prediction
    Dong, Xin
    Liang, Yan
    Miyamoto, Shoichiro
    Yamaguchi, Shingo
    [J]. JOURNAL OF ENGINEERING RESEARCH, 2023, 11 (04): : 377 - 391
  • [24] Data and Ensemble Machine Learning Fusion Based Intelligent Software Defect Prediction System
    Abbas, Sagheer
    Aftab, Shabib
    Khan, Muhammad Adnan
    Ghazal, Taher M.
    Al Hamadi, Hussam
    Yeun, Chan Yeob
    [J]. CMC-COMPUTERS MATERIALS & CONTINUA, 2023, 75 (03): : 6083 - 6100
  • [25] Incorporating Meteorological Data and Pesticide Information to Forecast Crop Yields Using Machine Learning
    Hoque, Md Jiabul
    Islam, Md. Saiful
    Uddin, Jia
    Samad, Md. Abdus
    De Abajo, Beatriz Sainz
    Vargas, Debora Libertad Ramirez
    Ashraf, Imran
    [J]. IEEE ACCESS, 2024, 12 : 47768 - 47786
  • [26] Computational solar energy - Ensemble learning methods for prediction of solar power generation based on meteorological parameters in Eastern India
    Chakraborty, Debojyoti
    Mondal, Jayeeta
    Barua, Hrishav Bakul
    Bhattacharjee, Ankur
    [J]. RENEWABLE ENERGY FOCUS, 2023, 44 : 277 - 294
  • [27] Yield prediction of malting barley based on meteorological data
    Hünting, K
    Weissteiner, CJ
    Kühbauch, W
    [J]. IGARSS 2003: IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, VOLS I - VII, PROCEEDINGS: LEARNING FROM EARTH'S SHAPES AND SIZES, 2003, : 383 - 385
  • [28] Solar GHI Ensemble Prediction Based on a Meteorological Model and Method Kalman Filter
    Liu, Yuanyuan
    [J]. ADVANCES IN METEOROLOGY, 2022, 2022
  • [29] Rainfall Prediction based on 100 years of Meteorological Data
    Mohapatra, Sandeep Kumar
    Upadhyay, Anamika
    Gola, Channabasava
    [J]. 2017 INTERNATIONAL CONFERENCE ON COMPUTING AND COMMUNICATION TECHNOLOGIES FOR SMART NATION (IC3TSN), 2017, : 163 - 167
  • [30] Multi-Model Ensemble Prediction of Summer Precipitation in China Based on Machine Learning Algorithms
    Yang, Jie
    Xiang, Ying
    Sun, Jiali
    Xu, Xiazhen
    [J]. ATMOSPHERE, 2022, 13 (09)