Feasibility of machine learning-based rice yield prediction in India at the district level using climate reanalysis and remote sensing data

被引:0
|
作者
De Clercq, Djavan [1 ]
Mahdi, Adam [1 ]
机构
[1] Univ Oxford, Oxford, England
关键词
Rice; Yield prediction; Machine learning; Climate reanalysis; Remote sensing; CROP YIELD; SATELLITE DATA; MODEL; DIFFUSION; HEALTH;
D O I
10.1016/j.agsy.2024.104099
中图分类号
S [农业科学];
学科分类号
09 ;
摘要
CONTEXT: Yield forecasting, the science of predicting agricultural productivity before the crop harvest occurs, helps a wide range of stakeholders make better decisions around agricultural planning. OBJECTIVE: This study aims to investigate whether machine learning-based yield prediction models can capably predict Kharif season rice yields at the district level in India several months before the rice harvest takes place. METHODOLOGY: The methodology involved training 19 machine learning models such as CatBoost, LightGBM, Orthogonal Matching Pursuit, and Extremely Randomized Trees on 20 years of climate, satellite, and rice yield data across 247 of India's rice-producing districts. In addition to model-building, a dynamic dashboard was built understand how the reliability of rice yield predictions varies across district. RESULTS AND CONCLUSIONS: The results of the proof-of-concept machine learning pipeline demonstrated that rice yields can be predicted with a reasonable degree of accuracy, with out-of-sample R2, MAE, and MAPE performance of up to 0.82, 0.29, and 0.16 respectively. This performance outperformed test set performance reported in related literature on rice yield modelling in other contexts and countries. In addition, SHAP value analysis was conducted to infer both the importance and directional impact of the climate and remote sensing variables included in the model. Important features driving rice yields included temperature, soil water volume, and leaf area index. In particular, higher temperatures in August correlate with increased rice yields, particularly when the leaf area index in August is also high. Building on the results, a proof-of-concept dashboard was developed to allow users to easily explore which districts may experience a rise or fall in yield relative to the previous year. The dashboard show that the model may perform better in some regions than in others. For instance, the absolute percentage error for predicted versus actual yields ranged from an average of 7.1 % in districts in Uttarakhand to an average of 14.7 % in Uttar Pradesh. SIGNIFICANCE: This study underscores the potential for policymakers to consider scaling and operationalizing machine learning approaches to rice yield prediction in the context of agricultural early warning systems to deliver timely crop yield forecasts on a rolling basis throughout the season, thereby equipping agricultural decision-makers with the ability to make informed choices on irrigation scheduling, fertilizer application, and harvest planning to optimize crop output and resource use.
引用
收藏
页数:13
相关论文
共 50 条
  • [41] Cotton yield estimation model based on machine learning using time series UAV remote sensing data
    Xu, Weicheng
    Chen, Pengchao
    Zhan, Yilong
    Chen, Shengde
    Zhang, Lei
    Lan, Yubin
    INTERNATIONAL JOURNAL OF APPLIED EARTH OBSERVATION AND GEOINFORMATION, 2021, 104
  • [42] Feasibility of machine learning-based modeling and prediction using multiple centers data to assess intrahepatic cholangiocarcinoma outcomes
    Zhou, Shuang-Nan
    Jv, Da-Wei
    Meng, Xiang-Fei
    Zhang, Jing-Jing
    Liu, Chun
    Wu, Ze-Yi
    Hong, Na
    Lu, Yin-Ying
    Zhang, Ning
    ANNALS OF MEDICINE, 2023, 55 (01) : 215 - 223
  • [43] Machine learning based plot level rice lodging assessment using multi-spectral UAV remote sensing
    Kumar, Mukesh
    Bhattacharya, Bimal K.
    Pandya, Mehul R.
    Handique, B. K.
    COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2024, 219
  • [44] Rice yield responses to climate variability in Northeast India using machine learning approachRice yield responses to climate variability in Northeast India using machine learning approachN. Gogoi et al.
    Niki Gogoi
    Binita Pathak
    Rizwan Rehman
    Sristisri Upadhyaya
    Pranami Mahanta
    Anindita Borah
    Krishnanka Jyoti Baishya
    Kalyan Bhuyan
    Theoretical and Applied Climatology, 2025, 156 (4)
  • [45] Deep Transfer Learning for Crop Yield Prediction with Remote Sensing Data
    Wang, Anna X.
    Tran, Caelin
    Desai, Nikhil
    Lobell, David
    Ermon, Stefano
    PROCEEDINGS OF THE 1ST ACM SIGCAS CONFERENCE ON COMPUTING AND SUSTAINABLE SOCIETIES (COMPASS 2018), 2018,
  • [46] Leveraging Remote Sensing Data for Yield Prediction with Deep Transfer Learning
    Huber, Florian
    Inderka, Alvin
    Steinhage, Volker
    SENSORS, 2024, 24 (03)
  • [47] Integration of Remote Sensing and Meteorological Data for Rapid Sugarcane Yield Estimation Using Machine Learning
    V. B. Virani
    Neeraj Kumar
    B. M. Mote
    Journal of the Indian Society of Remote Sensing, 2025, 53 (4) : 1109 - 1124
  • [48] Machine Learning-Based Prediction of Cattle Activity Using Sensor-Based Data
    Hernandez, Guillermo
    Gonzalez-Sanchez, Carlos
    Gonzalez-Arrieta, Angelica
    Sanchez-Brizuela, Guillermo
    Fraile, Juan-Carlos
    SENSORS, 2024, 24 (10)
  • [49] Machine Learning-Based Cellular Traffic Prediction Using Data Reduction Techniques
    Nashaat, Heba
    Mohammed, Nihal H.
    Abdel-Mageid, Salah M.
    Rizk, Rawya Y.
    IEEE ACCESS, 2024, 12 : 58927 - 58939
  • [50] Machine Learning-Based Prediction of Hemoglobinopathies Using Complete Blood Count Data
    Schipper, Anoeska
    Rutten, Matthieu
    van Gammeren, Adriaan
    Harteveld, Cornelis L.
    Urrechaga, Eloisa
    Weerkamp, Floor
    den Besten, Gijs
    Krabbe, Johannes
    Slomp, Jennichjen
    Schoonen, Lise
    Broeren, Maarten
    van Wijnen, Merel
    Huijskens, Mirelle J. A. J.
    Koopmann, Tamara
    van Ginneken, Bram
    Kusters, Ron
    Kurstjens, Steef
    CLINICAL CHEMISTRY, 2024, 70 (08) : 1064 - 1075