Machine learning techniques and interpretability for maize yield estimation using Time-Series images of MODIS and Multi-Source data

被引:0
|
作者
Lyu, Yujiao [1 ,5 ]
Wang, Pengxin [1 ,4 ]
Bai, Xueyuan [2 ]
Li, Xuecao [3 ]
Ye, Xin [1 ]
Hu, Yuchen [1 ,4 ]
Zhang, Jie [1 ,4 ,5 ]
机构
[1] China Agr Univ, Coll Informat & Elect Engn, Beijing 100083, Peoples R China
[2] China Agr Univ, Coll Resources & Environm, Beijing 100193, Peoples R China
[3] China Agr Univ, Coll Land Sci & Technol, Beijing 100193, Peoples R China
[4] Minist Agr & Rural Affairs, Key Lab Agr Machinery Monitoring & Big Data Applic, Beijing 100083, Peoples R China
[5] Natl Innovat Ctr Digital Agr Prod Circulat, Beijing 100083, Peoples R China
关键词
Yield estimation; Multi-Source data; Machine learning; Model interpretability; Remote Sensing; Regional Geo-Statistics (RGS) Method; WHEAT YIELD; SATELLITE; MANAGEMENT; NETWORK;
D O I
10.1016/j.compag.2024.109063
中图分类号
S [农业科学];
学科分类号
09 ;
摘要
Timely and accurate estimation of maize yield is crucial for ensuring food security. This study integrated multisource data (satellite, meteorological, and soil data) on Google Earth Engine (GEE) and used machine learning techniques to estimate summer maize yield across 469 counties in the Huang -Huai -Hai Plain of China from 2010 to 2020. A novel method for extracting features from remote sensing images was proposed called the Regional Geo-Statistics (RGS) Method. We compared its performance with the traditional county -level averages method and explained the impact and contributions of incorporating multi -source data on yield estimation models. Firstly, Enhanced Vegetation Index (EVI) and Near -Infrared Vegetation Reflectance (NIRv) were transformed into regional geostatistical vectors and county -level averages using GEE. Then, yield was estimated using LightGBM, RF and LASSO. The results highlighted the superiority of the RGS method, with LightGBM yielding the best model (R 2 = 0.55, RMSE = 852.92 kg/ha, NRMSE = 13.66 %). Further improvements were achieved by gradually adding meteorological and soil variables, with R 2 improvements of 0.03 and 0.20, respectively. The Comprehensive Factor Model (CFM), utilizing all data as features, achieved the best results (R 2 = 0.76, RMSE = 629.33 kg/ha, NRMSE = 10.08 %). The interpretability analysis based on CFM model underscored the significance of soil -related variables, which contributed significantly (45.38 %) alongside remote sensing variables, emphasizing the crucial role of soil variables in maize yield estimation. This study presents a versatile and reliable method for integrating multi -source data for maize yield estimation, supporting agricultural management and food security.
引用
收藏
页数:12
相关论文
共 50 条
  • [1] Dynamic Maize Yield Predictions Using Machine Learning on Multi-Source Data
    Croci, Michele
    Impollonia, Giorgio
    Meroni, Michele
    Amaducci, Stefano
    [J]. REMOTE SENSING, 2023, 15 (01)
  • [2] Maize yield estimation at province scale by interpolation of TM and MODIS time-series images
    Gu X.
    He X.
    Guo W.
    Huang W.
    Yan R.
    [J]. Nongye Gongcheng Xuebao/Transactions of the Chinese Society of Agricultural Engineering, 2010, 26 (SUPPL. 2): : 53 - 58
  • [3] Machine Learning Regression Techniques for the Silage Maize Yield Prediction Using Time-Series Images of Landsat 8 OLI
    Aghighi, Hossein
    Azadbakht, Mohsen
    Ashourloo, Davoud
    Shahrabi, Hamid Salehi
    Radiom, Soheil
    [J]. IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2018, 11 (12) : 4563 - 4577
  • [4] Coffee-Yield Estimation Using High-Resolution Time-Series Satellite Images and Machine Learning
    Martello, Mauricio
    Molin, Jose Paulo
    Wei, Marcelo Chan Fu
    Canal Filho, Ricardo
    Nicoletti, Joao Vitor Moreira
    [J]. AGRIENGINEERING, 2022, 4 (04): : 888 - 902
  • [5] CROP YIELD ESTIMATION USING MULTI-SOURCE SATELLITE IMAGE SERIES AND DEEP LEARNING
    Ghazaryan, Gohar
    Skakun, Sergii
    Koenig, Simon
    Rezaei, Ehsan Eyshi
    Siebert, Stefan
    Dubovyk, Olena
    [J]. IGARSS 2020 - 2020 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2020, : 5163 - 5166
  • [6] Predicting China's Maize Yield Using Multi-Source Datasets and Machine Learning Algorithms
    Miao, Lijuan
    Zou, Yangfeng
    Cui, Xuefeng
    Kattel, Giri Raj
    Shang, Yi
    Zhu, Jingwen
    [J]. REMOTE SENSING, 2024, 16 (13)
  • [7] Integrated UAV-Based Multi-Source Data for Predicting Maize Grain Yield Using Machine Learning Approaches
    Guo, Yahui
    Zhang, Xuan
    Chen, Shouzhi
    Wang, Hanxi
    Jayavelu, Senthilnath
    Cammarano, Davide
    Fu, Yongshuo
    [J]. REMOTE SENSING, 2022, 14 (24)
  • [8] Machine Learning Techniques for Fine Dead Fuel Load Estimation Using Multi-Source Remote Sensing Data
    D'Este, Marina
    Elia, Mario
    Giannico, Vincenzo
    Spano, Giuseppina
    Lafortezza, Raffaele
    Sanesi, Giovanni
    [J]. REMOTE SENSING, 2021, 13 (09)
  • [9] Semi-supervised Meta-learning for Multi-source Heterogeneity in Time-series Data
    Zhang, Lida
    Mortazavi, Bobak J.
    [J]. MACHINE LEARNING FOR HEALTHCARE CONFERENCE, VOL 219, 2023, 219
  • [10] Crop Yield Estimation Using Time-Series MODIS Data and the Effects of Cropland Masks in Ontario, Canada
    Liu, Jiangui
    Shang, Jiali
    Qian, Budong
    Huffman, Ted
    Zhang, Yinsuo
    Dong, Taifeng
    Jing, Qi
    Martin, Tim
    [J]. REMOTE SENSING, 2019, 11 (20)