FEST: Feature Enhancement Swin Transformer for Remote Sensing Image Semantic Segmentation

被引:0
|
作者
Zhang, Ronghuan [1 ,2 ,3 ]
Zhao, Jing [1 ,2 ,3 ]
Li, Ming [4 ]
Zou, Qingzhi [1 ,2 ,3 ]
机构
[1] Qilu Univ Technol, Shandong Acad Sci, Shandong Comp Sci Ctr, Key Lab Comp Power Network & Informat Secur,Minis, Jinan, Peoples R China
[2] Qilu Univ Technol, Shandong Acad Sci, Fac Comp Sci & Technol, Shandong Engn Res Ctr Big Data Appl Technol, Jinan, Peoples R China
[3] Shandong Fundamental Res Ctr Comp Sci, Shandong Prov Key Lab Comp Networks, Jinan, Peoples R China
[4] Shandong Univ Tradit Chinese Med, Sch Intelligence & Informat Engn, Jinan, Peoples R China
关键词
global information; semantic segmentation; Swin Transformer;
D O I
10.1109/CSCWD61410.2024.10580494
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
The global context is crucial for the precise segmentation of remote sensing images. However, the large volumes and high spatial resolutions of remote sensing images make efficient analysis of the entire scene challenging for most convolutional neural network (CNN)-based methods. To address this issue, we propose to design an innovative framework for semantic segmentation of remote sensing images called Feature Enhancement Swin Transformer (FEST). Firstly, we utilize the Swin Transformer as the encoder and incorporates a Global Information Enhancement Model (GIEM) within each Swin Transformer block to reduce information loss and enable encoding of more accurate spatial information. Secondly, we introduce an enhanced decoding structure called Enhanced Feature Fusion Module (EFFM) with added enhanced channel and spatial attention modules to retain localized information while obtaining extensive contextual information. Finally, for loss calculation, we utilize the dice and cross-entropy loss to jointly supervise the model, aiming to achieve a competitive performance. We comprehensively evaluated FEST on the ISPRS-Vaihingen and Potsdam datasets. The results indicate that our approach has achieved significant improvements in semantic segmentation tasks compared to existing methods.
引用
收藏
页码:1177 / 1182
页数:6
相关论文
共 50 条
  • [41] Semantic Segmentation Method of UAV Image Based on Window Attention Aggregation Swin Transformer
    Li, Junjie
    Yi, Shi
    He, Runhua
    Liu, Xi
    Computer Engineering and Applications, 2024, 60 (15) : 198 - 210
  • [42] Indoor semantic segmentation based on Swin-Transformer
    Zheng, Yunping
    Xu, Yuan
    Shu, Shiqiang
    Sarem, Mudar
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2024, 98
  • [43] Improved Swin Transformer-Based Semantic Segmentation of Postearthquake Dense Buildings in Urban Areas Using Remote Sensing Images
    Cui, Liangyi
    Jing, Xin
    Wang, Yu
    Huan, Yixuan
    Xu, Yang
    Zhang, Qiangqiang
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2023, 16 : 369 - 385
  • [44] Remote sensing image instance segmentation network with transformer and multi-scale feature representation
    Ye, Wenhui
    Zhang, Wei
    Lei, Weimin
    Zhang, Wenchao
    Chen, Xinyi
    Wang, Yanwen
    EXPERT SYSTEMS WITH APPLICATIONS, 2023, 234
  • [45] Swin transformer and fusion for underwater image enhancement
    Sun, Jinghao
    Dong, Junyu
    Lv, Qingxuan
    INTERNATIONAL WORKSHOP ON ADVANCED IMAGING TECHNOLOGY (IWAIT) 2022, 2022, 12177
  • [46] Category-Wise Fusion and Enhancement Learning for Multimodal Remote Sensing Image Semantic Segmentation
    Zheng, Aihua
    He, Jinbo
    Wang, Ming
    Li, Chenglong
    Luo, Bin
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [47] CCTNet: CNN and Cross-Shaped Transformer Hybrid Network for Remote Sensing Image Semantic Segmentation
    Wu, Honglin
    Zeng, Zhaobin
    Huang, Peng
    Yu, Xinyu
    Zhang, Min
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2024, 17 : 19986 - 19997
  • [48] Local-enhanced multi-scale aggregation swin transformer for semantic segmentation of high-resolution remote sensing images
    Ren, Dong
    Li, Falin
    Sun, Hang
    Liu, Li
    Ren, Shun
    Yu, Mei
    INTERNATIONAL JOURNAL OF REMOTE SENSING, 2024, 45 (01) : 101 - 120
  • [49] Spectral-Swin Transformer with Spatial Feature Extraction Enhancement for Hyperspectral Image Classification
    Peng, Yinbin
    Ren, Jiansi
    Wang, Jiamei
    Shi, Meilin
    REMOTE SENSING, 2023, 15 (10)
  • [50] A Swin Transformer with Dynamic High-Pass Preservation for Remote Sensing Image Pansharpening
    Li, Weisheng
    Hu, Yijian
    Peng, Yidong
    He, Maolin
    REMOTE SENSING, 2023, 15 (19)