A Transformer-Based Decoder for Semantic Segmentation with Multi-level Context Mining

被引:9
|
作者
Shi, Bowen [1 ]
Jiang, Dongsheng [2 ]
Zhang, Xiaopeng [2 ]
Li, Han [1 ]
Dai, Wenrui [1 ]
Zou, Junni [1 ]
Xiong, Hongkai [1 ]
Tian, Qi [2 ]
机构
[1] Shanghai Jiao Tong Univ, Shanghai, Peoples R China
[2] Huawei Cloud EI, Shenzhen, Peoples R China
来源
基金
中国国家自然科学基金;
关键词
D O I
10.1007/978-3-031-19815-1_36
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Transformers have recently shown superior performance than CNN on semantic segmentation. However, previous works mostly focus on the deliberate design of the encoder, while seldom considering the decoder part. In this paper, we find that a light weighted decoder counts for segmentation, and propose a pure transformer-based segmentation decoder, named SegDeformer, to seamlessly incorporate into current varied transformer-based encoders. The highlight is that SegDeformer is able to conveniently utilize the tokenized input and the attention mechanism of the transformer for effective context mining. This is achieved by two key component designs, i.e., the internal and external context mining modules. The former is equipped with internal attention within an image to better capture global-local context, while the latter introduces external tokens from other images to enhance current representation. To enable SegDeformer in a scalable way, we further provide performance/efficiency optimization modules for flexible deployment. Experiments on widely used benchmarks ADE20K, COCO-Stuff and Cityscapes and different transformer encoders (e.g., ViT, MiT and Swin) demonstrate that SegDeformer can bring consistent performance gains. Code is available at https://github.com/lygsbw/segdeformer.
引用
收藏
页码:624 / 639
页数:16
相关论文
共 50 条
  • [1] MUSTER: A Multi-Scale Transformer-Based Decoder for Semantic Segmentation
    Xu, Jing
    Shi, Wentao
    Gao, Pan
    Li, Qizhu
    Wang, Zhengwei
    [J]. IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2024,
  • [2] Transformer-Based Decoder Designs for Semantic Segmentation on Remotely Sensed Images
    Panboonyuen, Teerapong
    Jitkajornwanich, Kulsawasd
    Lawawirojwong, Siam
    Srestasathiern, Panu
    Vateekul, Peerapon
    [J]. REMOTE SENSING, 2021, 13 (24)
  • [3] Multi-Level Transformer-Based Social Relation Recognition
    Wang, Yuchen
    Qing, Linbo
    Wang, Zhengyong
    Cheng, Yongqiang
    Peng, Yonghong
    [J]. SENSORS, 2022, 22 (15)
  • [4] TransRSS: Transformer-based Radar Semantic Segmentation
    Zou, Hao
    Xie, Zhen
    Ou, Jiarong
    Gao, Yutao
    [J]. 2023 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, ICRA, 2023, : 6965 - 6972
  • [5] MLCRNet: Multi-Level Context Refinement for Semantic Segmentation in Aerial Images
    Huang, Zhifeng
    Zhang, Qian
    Zhang, Guixu
    [J]. REMOTE SENSING, 2022, 14 (06)
  • [6] TransVPR: Transformer-Based Place Recognition with Multi-Level Attention Aggregation
    Wang, Ruotong
    Shen, Yanqing
    Zuo, Weiliang
    Zhou, Sanping
    Zheng, Nanning
    [J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 13638 - 13647
  • [7] Transformer-Based Semantic Segmentation for Recycling Materials in Construction
    Wang, Xin
    Han, Wei
    Mo, Sicheng
    Cai, Ting
    Gong, Yijing
    Li, Yin
    Zhu, Zhenhua
    [J]. COMPUTING IN CIVIL ENGINEERING 2023-DATA, SENSING, AND ANALYTICS, 2024, : 25 - 33
  • [8] Evaluating Transformer-based Semantic Segmentation Networks for Pathological Image Segmentation
    Cam Nguyen
    Asad, Zuhayr
    Deng, Ruining
    Huo, Yuankai
    [J]. MEDICAL IMAGING 2022: IMAGE PROCESSING, 2022, 12032
  • [9] A Transformer-based Semantic Segmentation Model for Street Fashion Images
    Peng, Dingjie
    Kameyama, Wataru
    [J]. INTERNATIONAL WORKSHOP ON ADVANCED IMAGING TECHNOLOGY, IWAIT 2023, 2023, 12592
  • [10] TRANSFORMER-BASED METHOD FOR SEMANTIC SEGMENTATION AND RECONSTRUCTION OF THE MARTIAN SURFACE
    Li, Z.
    Wu, B.
    Chen, Z.
    Ma, Y.
    [J]. GEOSPATIAL WEEK 2023, VOL. 48-1, 2023, : 1643 - 1649