FEST: Feature Enhancement Swin Transformer for Remote Sensing Image Semantic Segmentation

被引:0
|
作者
Zhang, Ronghuan [1 ,2 ,3 ]
Zhao, Jing [1 ,2 ,3 ]
Li, Ming [4 ]
Zou, Qingzhi [1 ,2 ,3 ]
机构
[1] Qilu Univ Technol, Shandong Acad Sci, Shandong Comp Sci Ctr, Key Lab Comp Power Network & Informat Secur,Minis, Jinan, Peoples R China
[2] Qilu Univ Technol, Shandong Acad Sci, Fac Comp Sci & Technol, Shandong Engn Res Ctr Big Data Appl Technol, Jinan, Peoples R China
[3] Shandong Fundamental Res Ctr Comp Sci, Shandong Prov Key Lab Comp Networks, Jinan, Peoples R China
[4] Shandong Univ Tradit Chinese Med, Sch Intelligence & Informat Engn, Jinan, Peoples R China
关键词
global information; semantic segmentation; Swin Transformer;
D O I
10.1109/CSCWD61410.2024.10580494
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
The global context is crucial for the precise segmentation of remote sensing images. However, the large volumes and high spatial resolutions of remote sensing images make efficient analysis of the entire scene challenging for most convolutional neural network (CNN)-based methods. To address this issue, we propose to design an innovative framework for semantic segmentation of remote sensing images called Feature Enhancement Swin Transformer (FEST). Firstly, we utilize the Swin Transformer as the encoder and incorporates a Global Information Enhancement Model (GIEM) within each Swin Transformer block to reduce information loss and enable encoding of more accurate spatial information. Secondly, we introduce an enhanced decoding structure called Enhanced Feature Fusion Module (EFFM) with added enhanced channel and spatial attention modules to retain localized information while obtaining extensive contextual information. Finally, for loss calculation, we utilize the dice and cross-entropy loss to jointly supervise the model, aiming to achieve a competitive performance. We comprehensively evaluated FEST on the ISPRS-Vaihingen and Potsdam datasets. The results indicate that our approach has achieved significant improvements in semantic segmentation tasks compared to existing methods.
引用
收藏
页码:1177 / 1182
页数:6
相关论文
共 50 条
  • [21] Remote sensing image semantic segmentation network based on multi-scale feature enhancement fusion
    Wang, Feiting
    Zhang, Yuan
    Hu, Qiongqiong
    Zhu, Yu
    GEOCARTO INTERNATIONAL, 2024, 39 (01)
  • [22] MMT: Mixed-Mask Transformer for Remote Sensing Image Semantic Segmentation
    Xu, Zhe
    Geng, Jie
    Jiang, Wen
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
  • [23] Hybrid Attention Fusion Embedded in Transformer for Remote Sensing Image Semantic Segmentation
    Chen, Yan
    Dong, Quan
    Wang, Xiaofeng
    Zhang, Qianchuan
    Kang, Menglei
    Jiang, Wenxiang
    Wang, Mengyuan
    Xu, Lixiang
    Zhang, Chen
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2024, 17 : 4421 - 4435
  • [24] Hybrid Shunted Transformer embedding UNet for remote sensing image semantic segmentation
    Zhou H.
    Xiao X.
    Li H.
    Liu X.
    Liang P.
    Neural Computing and Applications, 2024, 36 (25) : 15705 - 15720
  • [25] ResU-Former: Advancing Remote Sensing Image Segmentation with Swin Residual Transformer for Precise Global-Local Feature Recognition and Visual-Semantic Space Learning
    Li, Hanlu
    Li, Lei
    Zhao, Liangyu
    Liu, Fuxiang
    ELECTRONICS, 2024, 13 (02)
  • [26] Efficient Transformer for Remote Sensing Image Segmentation
    Xu, Zhiyong
    Zhang, Weicun
    Zhang, Tianxiang
    Yang, Zhifang
    Li, Jiangyun
    REMOTE SENSING, 2021, 13 (18)
  • [27] Efficient Swin Transformer for Remote Sensing Image Super-Resolution
    Kang, Xudong
    Duan, Puhong
    Li, Jier
    Li, Shutao
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 33 : 6367 - 6379
  • [28] Remote Sensing Image Fusion Method Based on Improved Swin Transformer
    Li Zitong
    Zhao Jiankang
    Xu Jingran
    Long Haihui
    Liu Chuanqi
    ACTA PHOTONICA SINICA, 2023, 52 (11)
  • [29] Global Adaptive Second-Order Transformer for Remote Sensing Image Semantic Segmentation
    Zhang, Yijie
    Cheng, Jian
    Su, Yanzhou
    Deng, Changjian
    Xia, Ziying
    Tashi, Nyima
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62
  • [30] Swin-RSIC: remote sensing image classification using a modified swin transformer with explainability
    Ansith S
    Ananth A
    Ebin Deni Raj
    Kala S
    Earth Science Informatics, 2025, 18 (2)