Laformer: Vision Transformer for Panoramic Image Semantic Segmentation

被引:1
|
作者
Yuan, Zheng [1 ]
Wang, Junhua [3 ]
Lv, Yuxin [2 ]
Wang, Ding [2 ]
Fang, Yi [2 ]
机构
[1] Fudan Univ, Acad Engn & Technol, Shanghai 200433, Peoples R China
[2] Fudan Univ, Sch Informat Sci & Technol, Shanghai 200433, Peoples R China
[3] Fudan Univ, Inst Optoelect, Shanghai Frontiers Sci Res Base Intelligent Optoel, Shanghai 200438, Peoples R China
关键词
Deformable convolution; panoramic images; prototype adaptation; self-training; semantic segmentation;
D O I
10.1109/LSP.2023.3337716
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Recent years have seen great advances in the area of semantic segmentation. However, general methods are targeted at pinhole images and tend to underperform when directly adopted to panoramic images. And with the wide applications of panoramic cameras, it is important to develop feasible approaches to train segmentation models for their real-time applications. To address this problem, we propose a novel method using self-training and achieve comparable results on DensePASS dataset. Namely, we propose a deformable merge module tailored for panoramic images by efficiently and accurately incorporating features of different levels. We design a novel prototype adaptation term that aids the model to better learn the class-wise feature embeddings of distorted objects. Finally, we use a simple and valid evaluation method to achieve real-time and improved inference performance. All combined, we can reach 58.27% of mIoU scores on DensePASS dataset and achieve new state of the art results.
引用
收藏
页码:1792 / 1796
页数:5
相关论文
共 50 条
  • [1] PASTS: TOWARD EFFECTIVE DISTILLING TRANSFORMER FOR PANORAMIC SEMANTIC SEGMENTATION
    Kim, Jihyun
    Jeong, Somi
    Sohn, Kwanghoon
    2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 2881 - 2885
  • [2] Automatic Medical Image Segmentation with Vision Transformer
    Zhang, Jie
    Li, Fan
    Zhang, Xin
    Wang, Huaijun
    Hei, Xinhong
    APPLIED SCIENCES-BASEL, 2024, 14 (07):
  • [3] ViTBIS: Vision Transformer for Biomedical Image Segmentation
    Sagar, Abhinav
    CLINICAL IMAGE-BASED PROCEDURES, DISTRIBUTED AND COLLABORATIVE LEARNING, ARTIFICIAL INTELLIGENCE FOR COMBATING COVID-19 AND SECURE AND PRIVACY-PRESERVING MACHINE LEARNING, CLIP 2021, DCL 2021, LL-COVID19 2021, PPML 2021, 2021, 12969 : 34 - 45
  • [4] Semantic and structural image segmentation for prosthetic vision
    Sanchez-Garcia, Melani
    Martinez-Cantin, Ruben
    Guerrero, Jose J.
    PLOS ONE, 2020, 15 (01):
  • [5] Privacy-Preserving Semantic Segmentation Using Vision Transformer
    Kiya, Hitoshi
    Nagamori, Teru
    Imaizumi, Shoko
    Shiota, Sayaka
    JOURNAL OF IMAGING, 2022, 8 (09)
  • [6] Semantic Image Segmentation for Information Presentation in Enhanced Vision
    Vygolov, Oleg V.
    Gorbatsevich, Vladimir S.
    Kostromov, Nikita A.
    Lebedev, Maxim A.
    Vizilter, Yury V.
    Knyaz, Vladimir A.
    Zheltov, Sergey Y.
    DEGRADED ENVIRONMENTS: SENSING, PROCESSING, AND DISPLAY 2017, 2017, 10197
  • [7] Evaluating Transformer-based Semantic Segmentation Networks for Pathological Image Segmentation
    Cam Nguyen
    Asad, Zuhayr
    Deng, Ruining
    Huo, Yuankai
    MEDICAL IMAGING 2022: IMAGE PROCESSING, 2022, 12032
  • [8] SemiCVT: Semi-Supervised Convolutional Vision Transformer for Semantic Segmentation
    Huang, Huimin
    Xie, Shiao
    Lin, Lanfen
    Tong, Ruofeng
    Chen, Yen-Wei
    Li, Yuexiang
    Wang, Hong
    Huang, Yawen
    Zheng, Yefeng
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 11340 - 11349
  • [9] Cross-scale sampling transformer for semantic image segmentation
    Ma, Yizhe
    Yu, Long
    Lin, Fangjian
    Tian, Shengwei
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2023, 44 (02) : 2895 - 2907
  • [10] Efficient Depth Fusion Transformer for Aerial Image Semantic Segmentation
    Yan, Li
    Huang, Jianming
    Xie, Hong
    Wei, Pengcheng
    Gao, Zhao
    REMOTE SENSING, 2022, 14 (05)