Using machine learning to generate an open-access cropland map from satellite images time series in the Indian Himalayan region

被引：0

作者：

Li, Danya ^{[1
,2
]}

Gajardo, Joaquin ^{[1
]}

Volpi, Michele ^{[3
,4
]}

Defraeye, Thijs ^{[1
]}

机构：

[1] Empa, Swiss Fed Labs Mat Sci & Technol, Lab Biomimet Membranes & Text, Lerchenfeldstr 5, CH-9014 St Gallen, Switzerland

[2] Ecole Polytech Fed Lausanne, Lausanne, Switzerland

[3] Swiss Fed Inst Technol, Swiss Data Sci Ctr, Zurich, Switzerland

[4] Ecole Polytech Fed Lausanne, Lausanne, Switzerland

来源：

REMOTE SENSING APPLICATIONS-SOCIETY AND ENVIRONMENT | 2023年 / 32卷

关键词：

Cropland mapping; Smallholders; Remote sensing; High-altitude region; Random forest; Feature engineering; Google earth engine; Sentinel-2; EXTENT;

D O I：

10.1016/j.rsase.2023.101057

中图分类号：

X [环境科学、安全科学];

学科分类号：

08 ; 0830 ;

摘要：

Crop maps are crucial for agricultural monitoring and food management and can additionally support domain-specific applications, such as setting cold supply chain infrastructure in developing countries. Machine learning (ML) models, combined with freely-available satellite imagery, can be used to produce cost-effective and high spatial-resolution crop maps. However, accessing ground truth data for supervised learning is especially challenging in developing countries due to factors such as smallholding and fragmented geography, which often results in a lack of crop type maps or even reliable cropland maps. Our area of interest for this study lies in Himachal Pradesh, India, where we aim at producing an open-access binary cropland map at 10-m resolution for the Kullu, Shimla, and Mandi districts. To this end, we developed an ML pipeline that relies on Sen-tinel-2 satellite images time series. We investigated two pixel-based supervised classifiers, sup-port vector machines (SVM) and random forest (RF), which are used to classify per-pixel time series for binary cropland mapping. The ground truth data used for training, validation and testing was manually annotated from a combination of field survey reference points and visual interpretation of very high resolution (VHR) imagery. We trained and validated the models via spatial cross-validation to account for local spatial autocorrelation and improve the generalization capability of the model. We tested the model on hold out test sets of each district, achieving an aver-age accuracy for the RF (our best model) of 87%. We noticed NIR band at the early and late stage of the apple harvest season (main crop in the region) to be of critical importance for the model. Finally, we used this model to generate a cropland map for three districts of Himachal Pradesh, spanning 14,600 km2, which improves the resolution and quality of existing public maps, and made the code open-source.

引用

页数：13

共 50 条

[21] Assessment of Three Machine Learning Techniques with Open-Access Geographic Data for Forest Fire Susceptibility Monitoring-Evidence from Southern Ecuador
Reyes-Bueno, Fabian
Lojan-Cordova, Julia
FORESTS, 2022, 13 (03):
[22] Estimation of correlation matrices from limited time series data using machine learning
Easaw, Nikhil
Lee, Woo Seok
Lohiya, Prashant Singh
Jalan, Sarika
Pradhan, Priodyuti
JOURNAL OF COMPUTATIONAL SCIENCE, 2023, 71
[23] DEM generation using point cloud from ICESat and high-resolution satellite stereo images for Indian region
Kaitheri, Athul
Ramiya, Anandakumar M.
IMAGE AND SIGNAL PROCESSING FOR REMOTE SENSING XXV, 2019, 11155
[24] Land use land cover mapping and snow cover detection in Himalayan region using machine learning and multispectral Sentinel-2 satellite imagery
Saini R.
Singh S.
International Journal of Information Technology, 2024, 16 (2) : 675 - 686
[25] Advancing food security: Rice yield estimation framework using time-series satellite data & machine learning
Tiwari, Varun
Thorp, Kelly
Tulbure, Mirela G.
Gray, Joshua
Kamruzzaman, Mohammad
Krupnik, Timothy J.
Sankarasubramanian, A.
Ardon, Marcelo
PLOS ONE, 2024, 19 (12):
[26] A multimodality test outperforms three machine learning classifiers for identifying and mapping paddocks using time series satellite imagery
O'Hara, Rob
Zimmermann, Jesko
Green, Stuart
GEOCARTO INTERNATIONAL, 2022, 37 (25) : 9748 - 9766
[27] Correction to: Multitemporal time series analysis using machine learning models for ground deformation in the Erhai Region, China
Yahui Guo
Shunqiang Hu
Wenxiang Wu
Yuyi Wang
J. Senthilnath
Environmental Monitoring and Assessment, 2020, 192
[28] Using deep learning to map retrogressive thaw slumps in the Beiluhe region (Tibetan Plateau) from CubeSat images
Huang, Lingcao
Luo, Jing
Lin, Zhanju
Niu, Fujun
Liu, Lin
REMOTE SENSING OF ENVIRONMENT, 2020, 237 (237)
[29] Effectiveness of hybrid ensemble machine learning models for landslide susceptibility analysis: Evidence from Shimla district of North-west Indian Himalayan region
Sharma, Aastha
Sajjad, Haroon
Rahaman, Md Hibjur
Saha, Tamal Kanti
Bhuyan, Nirsobha
JOURNAL OF MOUNTAIN SCIENCE, 2024, 21 (07) : 2368 - 2393
[30] Forecasting fire risk with machine learning and dynamic information derived from satellite vegetation index time-series
Michael, Yaron
Helman, David
Glickman, Oren
Gabay, David
Brenner, Steve
Lensky, Itamar M.
SCIENCE OF THE TOTAL ENVIRONMENT, 2021, 764

← 1 2 3 4 5 →