Looking Outside the Window: Wide-Context Transformer for the Semantic Segmentation of High-Resolution Remote Sensing Images

被引:76
|
作者
Ding, Lei [1 ]
Lin, Dong [2 ,3 ]
Lin, Shaofu [4 ]
Zhang, Jing [5 ]
Cui, Xiaojie [6 ]
Wang, Yuebin [7 ]
Tang, Hao [8 ]
Bruzzone, Lorenzo [5 ]
机构
[1] PLA Strateg Force Informat Engn Univ, Zhengzhou 450001, Peoples R China
[2] Space Engn Univ, Beijing 102249, Peoples R China
[3] Xian Inst Surveying & Mapping, State Key Lab Geoinformat Engn, Xian 710054, Peoples R China
[4] Beijing Univ Technol, Fac Informat Technol, Beijing 100022, Peoples R China
[5] Univ Trento, Dept Informat Engn & Comp Sci, I-38123 Trento, Italy
[6] Beijing Inst Remote Sensing Informat, Beijing 100011, Peoples R China
[7] China Univ Geosci Beijing, Sch Land Sci & Technol, Beijing 100084, Peoples R China
[8] Swiss Fed Inst Technol, Dept Informat Technol & Elect Engn, CH-8092 Zurich, Switzerland
关键词
Transformers; Semantics; Image segmentation; Feature extraction; Task analysis; Convolutional neural networks; Context modeling; Convolutional neural network; remote sensing; semantic segmentation; vision transformer (ViT);
D O I
10.1109/TGRS.2022.3168697
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
Long-range contextual information is crucial for the semantic segmentation of high-resolution (HR) remote sensing images (RSIs). However, image cropping operations, commonly used for training neural networks, limit the perception of long-range contexts in large RSIs. To overcome this limitation, we propose a wide-context network (WiCoNet) for the semantic segmentation of HR RSIs. Apart from extracting local features with a conventional convolutional neural network (CNN), the WiCoNet has an extra context branch to aggregate information from a larger image area. Moreover, we introduce a context transformer to embed contextual information from the context branch and selectively project it onto the local features. The context transformer extends the vision transformer, an emerging kind of neural networks, to model the dual-branch semantic correlations. It overcomes the locality limitation of CNNs and enables the WiCoNet to see the bigger picture before segmenting the land-cover/land-use (LCLU) classes. Ablation studies and comparative experiments conducted on several benchmark datasets demonstrate the effectiveness of the proposed method. In addition, we present a new Beijing Land-Use (BLU) dataset. This is a large-scale HR satellite dataset with high-quality and fine-grained reference labels, which can facilitate future studies in this field.
引用
收藏
页数:13
相关论文
共 50 条
  • [1] SEMANTIC SEGMENTATION OF HIGH-RESOLUTION REMOTE SENSING IMAGES USING AN IMPROVED TRANSFORMER
    Liu, Yuheng
    Mei, Shaohui
    Zhang, Shun
    Wang, Ye
    He, Mingyi
    Du, Qian
    [J]. 2022 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS 2022), 2022, : 3496 - 3499
  • [2] Multiscale Global Context Network for Semantic Segmentation of High-Resolution Remote Sensing Images
    Zeng, Qiaolin
    Zhou, Jingxiang
    Tao, Jinhua
    Chen, Liangfu
    Niu, Xuerui
    Zhang, Yumeng
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62
  • [3] HRCNet: High-Resolution Context Extraction Network for Semantic Segmentation of Remote Sensing Images
    Xu, Zhiyong
    Zhang, Weicun
    Zhang, Tianxiang
    Li, Jiangyun
    [J]. REMOTE SENSING, 2021, 13 (01) : 1 - 23
  • [4] Spatial-specific Transformer with involution for semantic segmentation of high-resolution remote sensing images
    Wu, Xinjia
    Zhang, Jing
    Li, Wensheng
    Li, Jiafeng
    Zhuo, Li
    Zhang, Jie
    [J]. INTERNATIONAL JOURNAL OF REMOTE SENSING, 2023, 44 (04) : 1280 - 1307
  • [5] HCANet: A Hierarchical Context Aggregation Network for Semantic Segmentation of High-Resolution Remote Sensing Images
    Bai, Haiwei
    Cheng, Jian
    Huang, Xia
    Liu, Siyu
    Deng, Changjian
    [J]. IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
  • [6] Semantic Segmentation for High-Resolution Remote-Sensing Images via Dynamic Graph Context Reasoning
    Su, Yanzhou
    Cheng, Jian
    Wang, Wen
    Bai, Haiwei
    Liu, Haijun
    [J]. IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
  • [7] UNeXt: An Efficient Network for the Semantic Segmentation of High-Resolution Remote Sensing Images
    Chang, Zhanyuan
    Xu, Mingyu
    Wei, Yuwen
    Lian, Jie
    Zhang, Chongming
    Li, Chuanjiang
    [J]. Sensors, 2024, 24 (20)
  • [8] Dual decoupling semantic segmentation model for high-resolution remote sensing images
    Liu S.
    Li X.
    Yu M.
    Xing G.
    [J]. Cehui Xuebao/Acta Geodaetica et Cartographica Sinica, 2023, 52 (04): : 638 - 647
  • [9] Edge Guidance Network for Semantic Segmentation of High-Resolution Remote Sensing Images
    Ni, Yue
    Liu, Jiahang
    Cui, Jian
    Yang, Yuze
    Wang, Xiaozhen
    [J]. IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2023, 16 : 9809 - 9822
  • [10] Dynamic High-Resolution Network for Semantic Segmentation in Remote-Sensing Images
    Guo, Shichen
    Yang, Qi
    Xiang, Shiming
    Wang, Pengfei
    Wang, Xuezhi
    [J]. REMOTE SENSING, 2023, 15 (09)