SSformer: A Lightweight Transformer for Semantic Segmentation

被引:11
|
作者
Shi, Wentao [1 ]
Xu, Jing [1 ]
Gao, Pan [1 ]
机构
[1] Nanjing Univ Aeronaut & Astronaut, Coll Comp Sci & Technol, Nanjing, Peoples R China
关键词
Image Segmentation; Transformer; Multilayer perceptron; Lightweight model;
D O I
10.1109/MMSP55362.2022.9949177
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
It is well believed that Transformer performs better in semantic segmentation compared to convolutional neural networks. Nevertheless, the original Vision Transformer [2] may lack of inductive biases of local neighborhoods and possess a high time complexity. Recently, Swin Transformer [3] sets a new record in various vision tasks by using hierarchical architecture and shifted windows while being more efficient. However, as Swin Transformer is specifically designed for image classification, it may achieve suboptimal performance on dense prediction-based segmentation task. Further, simply combing Swin Transformer with existing methods would lead to the boost of model size and parameters for the final segmentation model. In this paper, we rethink the Swin Transformer for semantic segmentation, and design a lightweight yet effective transformer model, called SSformer. In this model, considering the inherent hierarchical design of Swin Transformer, we propose a decoder to aggregate information from different layers, thus obtaining both local and global attentions. Experimental results show the proposed SSformer yields comparable mIoU performance with state-of-the-art models, while maintaining a smaller model size and lower compute. Source code and pretrained models are available at: https://github.com/shiwt03/SSformer
引用
收藏
页数:5
相关论文
共 50 条
  • [31] LACTNet: A Lightweight Real-Time Semantic Segmentation Network Based on an Aggregated Convolutional Neural Network and Transformer
    Zhang, Xiangyue
    Li, Hexiao
    Ru, Jingyu
    Ji, Peng
    Wu, Chengdong
    ELECTRONICS, 2024, 13 (12)
  • [32] A Lightweight CNN-Transformer Network With Laplacian Loss for Low-Altitude UAV Imagery Semantic Segmentation
    Lu, Wen
    Zhang, Zhiqi
    Nguyen, Minh
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62 : 1 - 20
  • [33] TransBridge: A Lightweight Transformer for Left Ventricle Segmentation in Echocardiography
    Deng, Kaizhong
    Meng, Yanda
    Gao, Dongxu
    Bridge, Joshua
    Shen, Yaochun
    Lip, Gregory
    Zhao, Yitian
    Zheng, Yalin
    SIMPLIFYING MEDICAL ULTRASOUND, 2021, 12967 : 63 - 72
  • [34] Lightweight Self-Attention Network for Semantic Segmentation
    Zhou, Yan
    Zhou, Haibin
    Li, Nanjun
    Li, Jianxun
    Wang, Dongli
    2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
  • [35] LSSMask: a lightweight semantic segmentation network for dynamic object
    Xiaofeng Lian
    Maomao Kang
    Li Tan
    Xiao Sun
    Yanli Wang
    Signal, Image and Video Processing, 2025, 19 (3)
  • [36] A Graph-Involved Lightweight Semantic Segmentation Network
    Xia, Xue
    You, Jiayu
    Fang, Yuming
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT VII, 2024, 14431 : 372 - 383
  • [37] Lightweight and Progressively-Scalable Networks for Semantic Segmentation
    Zhang, Yiheng
    Yao, Ting
    Qiu, Zhaofan
    Mei, Tao
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2023, 131 (08) : 2153 - 2171
  • [38] Mobile-SegFormer: A Lightweight Semantic Segmentation Network
    Lin, Zhenyuan
    Li, Weikun
    Gao, Dahua
    ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, PT XII, ICIC 2024, 2024, 14873 : 294 - 305
  • [39] Lightweight and Progressively-Scalable Networks for Semantic Segmentation
    Yiheng Zhang
    Ting Yao
    Zhaofan Qiu
    Tao Mei
    International Journal of Computer Vision, 2023, 131 : 2153 - 2171
  • [40] Lightweight Semantic Segmentation Network Based on Attention Coding
    Chen Xiaolong
    Zhao Ji
    Chen Siyi
    LASER & OPTOELECTRONICS PROGRESS, 2021, 58 (14)