LSTFormer:Lightweight Semantic Segmentation Network Based on Swin Transformer

被引:0
|
作者
Yang, Cheng [1 ]
Gao, Jianlin [1 ]
Zheng, Meilin [1 ]
Ding, Rong [1 ]
机构
[1] College of Big Data and Information Engineering, Guizhou University, Guiyang,550025, China
关键词
Semantics;
D O I
10.3778/j.issn.1002-8331.2210-0331
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Aiming at the general problem of high computational complexity in existing semantic segmentation networks based on Transformer, a lightweight semantic segmentation network based on Swin Transformer is proposed. Firstly, feature maps of multiple scales are obtained by Swin Transformer. Secondly, the full perception module and the improved cascading fusion module are used to fuse the feature maps of different scales across layers, reducing the semantic gap between the feature maps of different levels. Then, a single Swin Transformer block is introduced to optimize the initial segmentation feature mapping and improve the ability of the network to classify different pixels through the moving window autoattention mechanism. Finally, Dice loss function and cross-entropy loss function are added in the training stage to improve the segmentation performance and convergence speed of the network. The experimental results show that the mIoU of LSTFormer on ADE20K and Cityscapes reaches 49.47% and 81.47%. Compared with similar networks such as SETR and Swin-UPerNet, LSTFormer has lower parameters and computation while maintaining the same segmentation accuracy. © 2023 Journal of Computer Engineering and Applications Beijing Co., Ltd.; Science Press. All rights reserved.
引用
收藏
页码:166 / 175
相关论文
共 50 条
  • [1] Indoor semantic segmentation based on Swin-Transformer
    Zheng, Yunping
    Xu, Yuan
    Shu, Shiqiang
    Sarem, Mudar
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2024, 98
  • [2] ER-Swin: Feature Enhancement and Refinement Network Based on Swin Transformer for Semantic Segmentation of Remote Sensing Images
    Liu, Jiang
    Cheng, Shuli
    Du, Anyu
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2024, 21 : 1 - 5
  • [3] Hybrid semantic segmentation for tunnel lining cracks based on Swin Transformer and convolutional neural network
    Zhou, Zhong
    Zhang, Junjie
    Gong, Chenjie
    COMPUTER-AIDED CIVIL AND INFRASTRUCTURE ENGINEERING, 2023, 38 (17) : 2491 - 2510
  • [4] Swin Transformer Based Pyramid Pooling Network for Food Segmentation
    Wang, Qiankun
    Dong, Xiaoxiao
    Wang, Ruimin
    Sun, Hao
    2022 2ND IEEE INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING AND ARTIFICIAL INTELLIGENCE (SEAI 2022), 2022, : 64 - 68
  • [5] SSformer: A Lightweight Transformer for Semantic Segmentation
    Shi, Wentao
    Xu, Jing
    Gao, Pan
    2022 IEEE 24TH INTERNATIONAL WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING (MMSP), 2022,
  • [6] Semantic Segmentation Method for Remote Sensing Images Based on Improved Swin Transformer
    Wang, Yizhong
    Hu, Yaqi
    Wu, Xiaosuo
    Yan, Haowen
    Wang, Xiaocheng
    Computer Engineering and Applications, 2024, 60 (11) : 194 - 203
  • [7] P-Swin: Parallel Swin transformer multi-scale semantic segmentation network for land cover classification
    Wang, Di
    Yang, Ronghao
    Zhang, Zhenxin
    Liu, Hanhu
    Tan, Junxiang
    Li, Shaoda
    Yang, Xiaoxia
    Wang, Xiao
    Tang, Kangqi
    Qiao, Yichun
    Su, Po
    COMPUTERS & GEOSCIENCES, 2023, 175
  • [8] STransFuse: Fusing Swin Transformer and Convolutional Neural Network for Remote Sensing Image Semantic Segmentation
    Gao, Liang
    Liu, Hui
    Yang, Minhang
    Chen, Long
    Wan, Yaling
    Xiao, Zhengqing
    Qian, Yurong
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2021, 14 (14) : 10990 - 11003
  • [9] Semantic Segmentation Method of UAV Image Based on Window Attention Aggregation Swin Transformer
    Li, Junjie
    Yi, Shi
    He, Runhua
    Liu, Xi
    Computer Engineering and Applications, 2024, 60 (15) : 198 - 210
  • [10] Speech Semantic Communication Based on Swin Transformer
    Zhou, Ziliang
    Zheng, Shilian
    Chen, Jie
    Zhao, Zhijin
    Yang, Xiaoniu
    IEEE TRANSACTIONS ON COGNITIVE COMMUNICATIONS AND NETWORKING, 2024, 10 (03) : 756 - 768