FEST: Feature Enhancement Swin Transformer for Remote Sensing Image Semantic Segmentation

被引:0
|
作者
Zhang, Ronghuan [1 ,2 ,3 ]
Zhao, Jing [1 ,2 ,3 ]
Li, Ming [4 ]
Zou, Qingzhi [1 ,2 ,3 ]
机构
[1] Qilu Univ Technol, Shandong Acad Sci, Shandong Comp Sci Ctr, Key Lab Comp Power Network & Informat Secur,Minis, Jinan, Peoples R China
[2] Qilu Univ Technol, Shandong Acad Sci, Fac Comp Sci & Technol, Shandong Engn Res Ctr Big Data Appl Technol, Jinan, Peoples R China
[3] Shandong Fundamental Res Ctr Comp Sci, Shandong Prov Key Lab Comp Networks, Jinan, Peoples R China
[4] Shandong Univ Tradit Chinese Med, Sch Intelligence & Informat Engn, Jinan, Peoples R China
关键词
global information; semantic segmentation; Swin Transformer;
D O I
10.1109/CSCWD61410.2024.10580494
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
The global context is crucial for the precise segmentation of remote sensing images. However, the large volumes and high spatial resolutions of remote sensing images make efficient analysis of the entire scene challenging for most convolutional neural network (CNN)-based methods. To address this issue, we propose to design an innovative framework for semantic segmentation of remote sensing images called Feature Enhancement Swin Transformer (FEST). Firstly, we utilize the Swin Transformer as the encoder and incorporates a Global Information Enhancement Model (GIEM) within each Swin Transformer block to reduce information loss and enable encoding of more accurate spatial information. Secondly, we introduce an enhanced decoding structure called Enhanced Feature Fusion Module (EFFM) with added enhanced channel and spatial attention modules to retain localized information while obtaining extensive contextual information. Finally, for loss calculation, we utilize the dice and cross-entropy loss to jointly supervise the model, aiming to achieve a competitive performance. We comprehensively evaluated FEST on the ISPRS-Vaihingen and Potsdam datasets. The results indicate that our approach has achieved significant improvements in semantic segmentation tasks compared to existing methods.
引用
收藏
页码:1177 / 1182
页数:6
相关论文
共 50 条
  • [31] Semantic segmentation of multi-scale remote sensing images with contextual feature enhancement
    Zhang, Mei
    Liu, Lingling
    Pei, Yongtao
    Xie, Guojing
    Wen, Jinghua
    VISUAL COMPUTER, 2025, 41 (02): : 1303 - 1317
  • [32] A Semantic Segmentation Method of Remote Sensing Image Based on Feature Fusion and Attention Mechanism
    Wang, Yiqin
    Dong, Yunyun
    JOURNAL OF INFORMATION PROCESSING SYSTEMS, 2024, 20 (05): : 640 - 653
  • [33] Dual-Path Feature Aware Network for Remote Sensing Image Semantic Segmentation
    Geng, Jie
    Song, Shuai
    Jiang, Wen
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (05) : 3674 - 3686
  • [34] SFA-Net: Semantic Feature Adjustment Network for Remote Sensing Image Segmentation
    Hwang, Gyutae
    Jeong, Jiwoo
    Lee, Sang Jun
    REMOTE SENSING, 2024, 16 (17)
  • [35] Global and edge enhanced transformer for semantic segmentation of remote sensing
    Wang, Hengyou
    Li, Xiao
    Huo, Lianzhi
    Hu, Changmiao
    APPLIED INTELLIGENCE, 2024, 54 (07) : 5658 - 5673
  • [36] Unsupervised Domain Adaptation for Remote Sensing Semantic Segmentation with Transformer
    Li, Weitao
    Gao, Hui
    Su, Yi
    Momanyi, Biffon Manyura
    REMOTE SENSING, 2022, 14 (19)
  • [37] A Multilevel Multimodal Fusion Transformer for Remote Sensing Semantic Segmentation
    Ma, Xianping
    Zhang, Xiaokang
    Pun, Man-On
    Liu, Ming
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62 : 1 - 15
  • [38] CMTFNet: CNN and Multiscale Transformer Fusion Network for Remote-Sensing Image Semantic Segmentation
    Wu, Honglin
    Huang, Peng
    Zhang, Min
    Tang, Wenlong
    Yu, Xinyu
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
  • [39] CTFNet: CNN-Transformer Fusion Network for Remote-Sensing Image Semantic Segmentation
    Wu, Honglin
    Huang, Peng
    Zhang, Min
    Tang, Wenlong
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2024, 21 : 1 - 5
  • [40] Remote Sensing Image Recognition Algorithm Based on Pseudo Global Swin Transformer
    Wang K.
    Zuo X.
    Yang Y.
    Fei S.
    Moshi Shibie yu Rengong Zhineng/Pattern Recognition and Artificial Intelligence, 2023, 36 (09): : 818 - 831