Using machine learning to generate an open-access cropland map from satellite images time series in the Indian Himalayan region

被引:0
|
作者
Li, Danya [1 ,2 ]
Gajardo, Joaquin [1 ]
Volpi, Michele [3 ,4 ]
Defraeye, Thijs [1 ]
机构
[1] Empa, Swiss Fed Labs Mat Sci & Technol, Lab Biomimet Membranes & Text, Lerchenfeldstr 5, CH-9014 St Gallen, Switzerland
[2] Ecole Polytech Fed Lausanne, Lausanne, Switzerland
[3] Swiss Fed Inst Technol, Swiss Data Sci Ctr, Zurich, Switzerland
[4] Ecole Polytech Fed Lausanne, Lausanne, Switzerland
关键词
Cropland mapping; Smallholders; Remote sensing; High-altitude region; Random forest; Feature engineering; Google earth engine; Sentinel-2; EXTENT;
D O I
10.1016/j.rsase.2023.101057
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Crop maps are crucial for agricultural monitoring and food management and can additionally support domain-specific applications, such as setting cold supply chain infrastructure in developing countries. Machine learning (ML) models, combined with freely-available satellite imagery, can be used to produce cost-effective and high spatial-resolution crop maps. However, accessing ground truth data for supervised learning is especially challenging in developing countries due to factors such as smallholding and fragmented geography, which often results in a lack of crop type maps or even reliable cropland maps. Our area of interest for this study lies in Himachal Pradesh, India, where we aim at producing an open-access binary cropland map at 10-m resolution for the Kullu, Shimla, and Mandi districts. To this end, we developed an ML pipeline that relies on Sen-tinel-2 satellite images time series. We investigated two pixel-based supervised classifiers, sup-port vector machines (SVM) and random forest (RF), which are used to classify per-pixel time series for binary cropland mapping. The ground truth data used for training, validation and testing was manually annotated from a combination of field survey reference points and visual interpretation of very high resolution (VHR) imagery. We trained and validated the models via spatial cross-validation to account for local spatial autocorrelation and improve the generalization capability of the model. We tested the model on hold out test sets of each district, achieving an aver-age accuracy for the RF (our best model) of 87%. We noticed NIR band at the early and late stage of the apple harvest season (main crop in the region) to be of critical importance for the model. Finally, we used this model to generate a cropland map for three districts of Himachal Pradesh, spanning 14,600 km2, which improves the resolution and quality of existing public maps, and made the code open-source.
引用
收藏
页数:13
相关论文
共 50 条
  • [21] Assessment of Three Machine Learning Techniques with Open-Access Geographic Data for Forest Fire Susceptibility Monitoring-Evidence from Southern Ecuador
    Reyes-Bueno, Fabian
    Lojan-Cordova, Julia
    FORESTS, 2022, 13 (03):
  • [22] Estimation of correlation matrices from limited time series data using machine learning
    Easaw, Nikhil
    Lee, Woo Seok
    Lohiya, Prashant Singh
    Jalan, Sarika
    Pradhan, Priodyuti
    JOURNAL OF COMPUTATIONAL SCIENCE, 2023, 71
  • [23] DEM generation using point cloud from ICESat and high-resolution satellite stereo images for Indian region
    Kaitheri, Athul
    Ramiya, Anandakumar M.
    IMAGE AND SIGNAL PROCESSING FOR REMOTE SENSING XXV, 2019, 11155
  • [24] Land use land cover mapping and snow cover detection in Himalayan region using machine learning and multispectral Sentinel-2 satellite imagery
    Saini R.
    Singh S.
    International Journal of Information Technology, 2024, 16 (2) : 675 - 686
  • [25] Advancing food security: Rice yield estimation framework using time-series satellite data & machine learning
    Tiwari, Varun
    Thorp, Kelly
    Tulbure, Mirela G.
    Gray, Joshua
    Kamruzzaman, Mohammad
    Krupnik, Timothy J.
    Sankarasubramanian, A.
    Ardon, Marcelo
    PLOS ONE, 2024, 19 (12):
  • [26] A multimodality test outperforms three machine learning classifiers for identifying and mapping paddocks using time series satellite imagery
    O'Hara, Rob
    Zimmermann, Jesko
    Green, Stuart
    GEOCARTO INTERNATIONAL, 2022, 37 (25) : 9748 - 9766
  • [27] Correction to: Multitemporal time series analysis using machine learning models for ground deformation in the Erhai Region, China
    Yahui Guo
    Shunqiang Hu
    Wenxiang Wu
    Yuyi Wang
    J. Senthilnath
    Environmental Monitoring and Assessment, 2020, 192
  • [28] Using deep learning to map retrogressive thaw slumps in the Beiluhe region (Tibetan Plateau) from CubeSat images
    Huang, Lingcao
    Luo, Jing
    Lin, Zhanju
    Niu, Fujun
    Liu, Lin
    REMOTE SENSING OF ENVIRONMENT, 2020, 237 (237)
  • [29] Effectiveness of hybrid ensemble machine learning models for landslide susceptibility analysis: Evidence from Shimla district of North-west Indian Himalayan region
    Sharma, Aastha
    Sajjad, Haroon
    Rahaman, Md Hibjur
    Saha, Tamal Kanti
    Bhuyan, Nirsobha
    JOURNAL OF MOUNTAIN SCIENCE, 2024, 21 (07) : 2368 - 2393
  • [30] Forecasting fire risk with machine learning and dynamic information derived from satellite vegetation index time-series
    Michael, Yaron
    Helman, David
    Glickman, Oren
    Gabay, David
    Brenner, Steve
    Lensky, Itamar M.
    SCIENCE OF THE TOTAL ENVIRONMENT, 2021, 764