TCNet: Multiscale Fusion of Transformer and CNN for Semantic Segmentation of Remote Sensing Images

被引:12
|
作者
Xiang, Xuyang [1 ]
Gong, Wenping [1 ]
Li, Shuailong [1 ]
Chen, Jun [2 ]
Ren, Tianhe [1 ]
机构
[1] China Univ Geosci, Fac Engn, Wuhan 430074, Peoples R China
[2] China Univ Geosci, Sch Automat, Wuhan 430074, Peoples R China
关键词
Convolutional Neural Network (CNN); feature fusion; remote sensing images; semantic segmentation; Transformer;
D O I
10.1109/JSTARS.2024.3349625
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Semantic segmentation of remote sensing images plays a critical role in areas such as urban change detection, environmental protection, and geohazard identification. Convolutional Neural Networks (CNNs) have been excessively employed for semantic segmentation over the past few years; however, a limitation of the CNN is that there exists a challenge in extracting the global context of remote sensing images, which is vital for semantic segmentation, due to the locality of the convolution operation. It is informed that the recently developed Transformer is equipped with powerful global modeling capabilities. A network called TCNet is proposed in this article, and a parallel-in-branch architecture of the Transformer and the CNN is adopted in the TCNet. As such, the TCNet takes advantage of both Transformer and CNN, and both global context and low-level spatial details could be captured in a much shallower manner. In addition, a novel fusion technique called Interactive Self-attention is advanced to fuse the multilevel features extracted from both branches. To bridge the semantic gap between regions, a skip connection module called Windowed Self-attention Gating is further developed and added to the progressive upsampling network. Experiments on three public datasets (i.e., Bijie Landslide Dataset, WHU Building Dataset, and Massachusetts Buildings Dataset) depict that TCNet yields superior performance over state-of-the-art models. The IoU values obtained by TCNet for these three datasets are 75.34% (ranked first among 10 models compared), 91.16% (ranked first among 13 models compared), and 76.21% (ranked first among 13 models compared), respectively.
引用
收藏
页码:3123 / 3136
页数:14
相关论文
共 50 条
  • [41] Multiscale feature fusion network for automatic port segmentation from remote sensing images
    Ju, Haoran
    Bi, Fukun
    Bian, Mingming
    Shi, Yinni
    JOURNAL OF APPLIED REMOTE SENSING, 2022, 16 (04)
  • [42] CIMFNet: Cross-Layer Interaction and Multiscale Fusion Network for Semantic Segmentation of High-Resolution Remote Sensing Images
    Zhou, Wujie
    Jin, Jianhui
    Lei, Jingsheng
    Yu, Lu
    IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2022, 16 (04) : 666 - 676
  • [43] CCTNet: CNN and Cross-Shaped Transformer Hybrid Network for Remote Sensing Image Semantic Segmentation
    Wu, Honglin
    Zeng, Zhaobin
    Huang, Peng
    Yu, Xinyu
    Zhang, Min
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2024, 17 : 19986 - 19997
  • [44] Multiscale Global Context Network for Semantic Segmentation of High-Resolution Remote Sensing Images
    Zeng, Qiaolin
    Zhou, Jingxiang
    Tao, Jinhua
    Chen, Liangfu
    Niu, Xuerui
    Zhang, Yumeng
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62 : 1 - 13
  • [45] Multi-scale attention fusion network for semantic segmentation of remote sensing images
    Wen, Zhiqiang
    Huang, Hongxu
    Liu, Shuai
    INTERNATIONAL JOURNAL OF REMOTE SENSING, 2023, 44 (24) : 7909 - 7926
  • [46] Dual-Path Feature Fusion Network for Semantic Segmentation of Remote Sensing Images
    Li, Boyang
    Zhang, Yu
    Zhang, Youmei
    Li, Bin
    Li, Zhenhao
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2024, 21 : 1 - 5
  • [47] A Novel Transformer Based Semantic Segmentation Scheme for Fine-Resolution Remote Sensing Images
    Wang, Libo
    Li, Rui
    Duan, Chenxi
    Zhang, Ce
    Meng, Xiaoliang
    Fang, Shenghui
    IEEE Geoscience and Remote Sensing Letters, 2022, 19
  • [48] Multiscale Adapter Based on SAM for Remote Sensing Semantic Segmentation
    Chen, Shanjuan
    Yu, Yunlong
    Li, Yingming
    Wang, Zhao
    Li, Xi
    Han, Jungong
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2025, 18 : 6806 - 6819
  • [49] A Novel Transformer Based Semantic Segmentation Scheme for Fine-Resolution Remote Sensing Images
    Wang, Libo
    Li, Rui
    Duan, Chenxi
    Zhang, Ce
    Meng, Xiaoliang
    Fang, Shenghui
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
  • [50] A Multitask CNN-Transformer Network for Semantic Change Detection From Bitemporal Remote Sensing Images
    Liu, Wei
    Kang, Ziwen
    Liu, Jiawei
    Lin, Yiyuan
    Yu, Yongtao
    Li, Jonathan
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62