Feasibility of machine learning-based rice yield prediction in India at the district level using climate reanalysis and remote sensing data

被引:0
|
作者
De Clercq, Djavan [1 ]
Mahdi, Adam [1 ]
机构
[1] Univ Oxford, Oxford, England
关键词
Rice; Yield prediction; Machine learning; Climate reanalysis; Remote sensing; CROP YIELD; SATELLITE DATA; MODEL; DIFFUSION; HEALTH;
D O I
10.1016/j.agsy.2024.104099
中图分类号
S [农业科学];
学科分类号
09 ;
摘要
CONTEXT: Yield forecasting, the science of predicting agricultural productivity before the crop harvest occurs, helps a wide range of stakeholders make better decisions around agricultural planning. OBJECTIVE: This study aims to investigate whether machine learning-based yield prediction models can capably predict Kharif season rice yields at the district level in India several months before the rice harvest takes place. METHODOLOGY: The methodology involved training 19 machine learning models such as CatBoost, LightGBM, Orthogonal Matching Pursuit, and Extremely Randomized Trees on 20 years of climate, satellite, and rice yield data across 247 of India's rice-producing districts. In addition to model-building, a dynamic dashboard was built understand how the reliability of rice yield predictions varies across district. RESULTS AND CONCLUSIONS: The results of the proof-of-concept machine learning pipeline demonstrated that rice yields can be predicted with a reasonable degree of accuracy, with out-of-sample R2, MAE, and MAPE performance of up to 0.82, 0.29, and 0.16 respectively. This performance outperformed test set performance reported in related literature on rice yield modelling in other contexts and countries. In addition, SHAP value analysis was conducted to infer both the importance and directional impact of the climate and remote sensing variables included in the model. Important features driving rice yields included temperature, soil water volume, and leaf area index. In particular, higher temperatures in August correlate with increased rice yields, particularly when the leaf area index in August is also high. Building on the results, a proof-of-concept dashboard was developed to allow users to easily explore which districts may experience a rise or fall in yield relative to the previous year. The dashboard show that the model may perform better in some regions than in others. For instance, the absolute percentage error for predicted versus actual yields ranged from an average of 7.1 % in districts in Uttarakhand to an average of 14.7 % in Uttar Pradesh. SIGNIFICANCE: This study underscores the potential for policymakers to consider scaling and operationalizing machine learning approaches to rice yield prediction in the context of agricultural early warning systems to deliver timely crop yield forecasts on a rolling basis throughout the season, thereby equipping agricultural decision-makers with the ability to make informed choices on irrigation scheduling, fertilizer application, and harvest planning to optimize crop output and resource use.
引用
收藏
页数:13
相关论文
共 50 条
  • [31] Training Machine Learning Algorithms Using Remote Sensing and Topographic Indices for Corn Yield Prediction
    de Oliveira, Mailson Freire
    Ortiz, Brenda Valeska
    Morata, Guilherme Trimer
    Jimenez, Andres-F
    Rolim, Glauco de Souza
    da Silva, Rouverson Pereira
    REMOTE SENSING, 2022, 14 (23)
  • [32] Integrating Remote Sensing and Weather Variables for Mango Yield Prediction Using a Machine Learning Approach
    Torgbor, Benjamin Adjah
    Rahman, Muhammad Moshiur
    Brinkhoff, James
    Sinha, Priyakant
    Robson, Andrew
    REMOTE SENSING, 2023, 15 (12)
  • [33] Tree crop yield estimation and prediction using remote sensing and machine learning: A systematic review
    Trentin, Carolina
    Ampatzidis, Yiannis
    Lacerda, Christian
    Shiratsuchi, Luciano
    SMART AGRICULTURAL TECHNOLOGY, 2024, 9
  • [34] Rice Yield Estimation Using Multi-Temporal Remote Sensing Data and Machine Learning: A Case Study of Jiangsu, China
    Liu, Zhangxin
    Ju, Haoran
    Ma, Qiyun
    Sun, Chengming
    Lv, Yuping
    Liu, Kaihua
    Wu, Tianao
    Cheng, Minghan
    AGRICULTURE-BASEL, 2024, 14 (04):
  • [35] Machine Learning-Based Wetland Vulnerability Assessment in the Sindh Province Ramsar Site Using Remote Sensing Data
    Aslam, Rana Waqar
    Shu, Hong
    Naz, Iram
    Quddoos, Abdul
    Yaseen, Andaleeb
    Gulshad, Khansa
    Alarifi, Saad S.
    REMOTE SENSING, 2024, 16 (05)
  • [36] Machine Learning-Based Crop Yield Prediction in South India: Performance Analysis of Various Models
    Nikhil, Uppugunduri Vijay
    Pandiyan, Athiya M.
    Raja, S. P.
    Stamenkovic, Zoran
    COMPUTERS, 2024, 13 (06)
  • [37] Evaluating Foehn Occurrence in a Changing Climate Based on Reanalysis and Climate Model Data Using Machine Learning
    Mony, Christoph
    Jansing, Lukas
    Sprenger, Michael
    WEATHER AND FORECASTING, 2021, 36 (06) : 2039 - 2055
  • [38] Sorghum yield prediction based on remote sensing and machine learning in conflict affected South Sudan
    John Karongo
    Joseph Ivivi Mwaniki
    John Ndiritu
    Victor Mokaya
    Scientific Reports, 15 (1)
  • [39] Machine learning-based approaches for cancer prediction using microbiome data
    Freitas, Pedro
    Silva, Francisco
    Sousa, Joana Vale
    Ferreira, Rui M.
    Figueiredo, Ceu
    Pereira, Tania
    Oliveira, Helder P.
    SCIENTIFIC REPORTS, 2023, 13 (01):
  • [40] Machine learning-based approaches for cancer prediction using microbiome data
    Pedro Freitas
    Francisco Silva
    Joana Vale Sousa
    Rui M. Ferreira
    Céu Figueiredo
    Tania Pereira
    Hélder P. Oliveira
    Scientific Reports, 13 (1)