Large-scale rice mapping under spatiotemporal heterogeneity using multi-temporal SAR images and explainable deep learning

被引:1
|
作者
Ge, Ji [1 ,2 ,3 ]
Zhang, Hong [1 ,2 ,3 ]
Zuo, Lijun [1 ,2 ,3 ]
Xu, Lu [1 ,2 ,3 ]
Jiang, Jingling [1 ,2 ,3 ]
Song, Mingyang [1 ,2 ,3 ]
Ding, Yinhaibin [1 ,2 ,3 ]
Xie, Yazhe [1 ,2 ,3 ]
Wu, Fan [1 ,2 ,3 ]
Wang, Chao [1 ,2 ,3 ]
Huang, Wenjiang [1 ,2 ,3 ]
机构
[1] Chinese Acad Sci, Aerosp Informat Res Inst, Key Lab Digital Earth Sci, Beijing 100094, Peoples R China
[2] Int Res Ctr Big Data Sustainable Dev Goals, Beijing 100094, Peoples R China
[3] Univ Chinese Acad Sci, Coll Resources & Environm, Beijing 100049, Peoples R China
关键词
Synthetic aperture radar; Rice mapping; Explainable deep learning; Feature importance; UNSUPERVISED DOMAIN ADAPTATION; MULTISOURCE; RESOLUTION; RADAR; MAP;
D O I
10.1016/j.isprsjprs.2024.12.021
中图分类号
P9 [自然地理学];
学科分类号
0705 ; 070501 ;
摘要
Timely and accurate mapping of rice cultivation distribution is crucial for ensuring global food security and achieving SDG2. From a global perspective, rice areas display high heterogeneity in spatial pattern and SAR timeseries characteristics, posing substantial challenges to deep learning (DL) models' performance, efficiency, and transferability. Moreover, due to their "black box" nature, DL often lack interpretability and credibility. To address these challenges, this paper constructs the first SAR rice dataset with spatiotemporal heterogeneity and proposes an explainable, lightweight model for rice area extraction, the eXplainable Mamba UNet (XM-UNet). The dataset is based on the 2023 multi-temporal Sentinel-1 data, covering diverse rice samples from the United States, Kenya, and Vietnam. A Temporal Feature Importance Explainer (TFI-Explainer) based on the Selective State Space Model is designed to enhance adaptability to the temporal heterogeneity of rice and the model's interpretability. This explainer, coupled with the DL model, provides interpretations of the importance of SAR temporal features and facilitates crucial time phase screening. To overcome the spatial heterogeneity of rice, an Attention Sandglass Layer (ASL) combining CNN and self-attention mechanisms is designed to enhance the local spatial feature extraction capabilities. Additionally, the Parallel Visual State Space Layer (PVSSL) utilizes 2D-Selective-Scan (SS2D) cross-scanning to capture the global spatial features of rice multi-directionally, significantly reducing computational complexity through parallelization. Experimental results demonstrate that the XM-UNet adapts well to the spatiotemporal heterogeneity of rice globally, with OA and F1-score of 94.26 % and 90.73 %, respectively. The model is extremely lightweight, with only 0.190 M parameters and 0.279 GFLOPs. Mamba's selective scanning facilitates feature screening, and its integration with CNN effectively balances rice's local and global spatial characteristics. The interpretability experiments prove that the explanations of the importance of the temporal features provided by the model are crucial for guiding rice distribution mapping and filling a gap in the related field. The code is available in https://github.com/SAR-RICE/XM-UNet.
引用
收藏
页码:395 / 412
页数:18
相关论文
共 50 条
  • [1] Large-Scale Rice Mapping Using Multi-Task Spatiotemporal Deep Learning and Sentinel-1 SAR Time Series
    Lin, Zhixian
    Zhong, Renhai
    Xiong, Xingguo
    Guo, Changqiang
    Xu, Jinfan
    Zhu, Yue
    Xu, Jialu
    Ying, Yibin
    Ting, K. C.
    Huang, Jingfeng
    Lin, Tao
    REMOTE SENSING, 2022, 14 (03)
  • [2] Mapping Paddy Rice Varieties Using Multi-temporal RADARSAT SAR Images
    Jang, Min-Won
    Kim, Yi-Hyun
    Park, No-Wook
    Hong, Suk-Young
    KOREAN JOURNAL OF REMOTE SENSING, 2012, 28 (06) : 653 - 660
  • [3] A Large-Scale Deep-Learning Approach for Multi-Temporal Aqua and Salt-Culture Mapping
    Diniz, Cesar
    Cortinhas, Luiz
    Pinheiro, Maria Luize
    Sadeck, Luis
    Fernandes Filho, Alexandre
    Baumann, Luis R. F.
    Adami, Marcos
    Souza-Filho, Pedro Walfir M.
    REMOTE SENSING, 2021, 13 (08)
  • [4] Multi-Temporal SAR Data Large-Scale Crop Mapping Based on U-Net Model
    Wei, Sisi
    Zhang, Hong
    Wang, Chao
    Wang, Yuanyuan
    Xu, Lu
    REMOTE SENSING, 2019, 11 (01)
  • [5] Mapping Large-Scale Forest Disturbance Types with Multi-Temporal CNN Framework
    Chen, Xi
    Zhao, Wenzhi
    Chen, Jiage
    Qu, Yang
    Wu, Dinghui
    Chen, Xuehong
    REMOTE SENSING, 2021, 13 (24)
  • [6] Towards a Multi-Temporal Deep Learning Approach for Mapping Urban Fabric Using Sentinel 2 Images
    El Mendili, Lamiae
    Puissant, Anne
    Chougrad, Mehdi
    Sebari, Imane
    REMOTE SENSING, 2020, 12 (03)
  • [7] Mapping paddy rice agriculture in southern China using multi-temporal MODIS images
    Xiao, XM
    Boles, S
    Liu, JY
    Zhuang, DF
    Frolking, S
    Li, CS
    Salas, W
    Moore, B
    REMOTE SENSING OF ENVIRONMENT, 2005, 95 (04) : 480 - 492
  • [8] Mapping paddy rice agriculture using multi-temporal FORMOSAT-2 images
    Shiu, Yi-Shiang
    Lin, Meng-Lung
    Chang, Kang-Tsung
    Chu, Tzu-How
    World Academy of Science, Engineering and Technology, 2010, 43 : 621 - 627
  • [9] Rice-Planted Area Mapping Using Small Sets of Multi-Temporal SAR Data
    Miyaoka, Kanae
    Maki, Masayasu
    Susaki, Junichi
    Homma, Koki
    Noda, Keigo
    Oki, Kazuo
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2013, 10 (06) : 1507 - 1511
  • [10] Monitoring terrain deformations using multi-temporal SAR images
    Ferretti, A
    Prati, C
    Rocca, F
    CEOS SAR WORKSHOP, 2000, 450 : 15 - 18