Depth-Wise Asymmetric Bottleneck With Point-Wise Aggregation Decoder for Real-Time Semantic Segmentation in Urban Scenes

被引:37
|
作者
Li, Gen [1 ]
Jiang, Shenlu [1 ]
Yun, Inyong [1 ]
Kim, Jonghyun [1 ]
Kim, Joongkyu [1 ]
机构
[1] Sungkyunkwan Univ, Dept Elect Elect & Comp Engn, Suwon 16419, South Korea
基金
新加坡国家研究基金会;
关键词
Real-time semantic segmentation; encoder-decoder network; convolutional neural network; urban scenes; lightweight network;
D O I
10.1109/ACCESS.2020.2971760
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Semantic segmentation is a process of linking each pixel in an image to a class label, and is widely used in the field of autonomous vehicles and robotics. Although deep learning methods have already made great progress for semantic segmentation, they either achieve great results with numerous parameters or design lightweight models but heavily sacrifice the segmentation accuracy. Because of the strict requirements of real-world applications, it is critical to design an effective real-time model with both competitive segmentation accuracy and small model capacity. In this paper, we propose a lightweight network named DABNet, which employs Depth-wise Asymmetric Bottleneck (DAB) and Point-wise Aggregation Decoder (PAD) module to tackle the challenging real-time semantic segmentation in urban scenes. Specifically, the DAB module creates a sufficient receptive field and densely utilizes the contextual information, and the PAD module aggregates the feature maps of different scales to optimize performance through the attention mechanism. Compared with existing methods, our network substantially reduces the number of parameters but still achieves high accuracy with real-time inference ability. Extensive ablation experiments on two challenging urban scene datasets (Cityscapes and CamVid) have proved the effectiveness of the proposed approach in real-time semantic segmentation.
引用
收藏
页码:27495 / 27506
页数:12
相关论文
共 50 条
  • [1] DAABNet: depth-wise asymmetric attention bottleneck for real-time semantic segmentation
    Tang, Qingsong
    Chen, Yingli
    Zhao, Minghui
    Min, Shitong
    Jiang, Wuming
    INTERNATIONAL JOURNAL OF MULTIMEDIA INFORMATION RETRIEVAL, 2024, 13 (01)
  • [2] DAABNet: depth-wise asymmetric attention bottleneck for real-time semantic segmentation
    Qingsong Tang
    Yingli Chen
    Minghui Zhao
    Shitong Min
    Wuming Jiang
    International Journal of Multimedia Information Retrieval, 2024, 13
  • [3] Real-time Image Semantic Segmentation Networks with Residual Depth-wise Separable Blocks
    Van-Viet Doan
    Duy-Hung Nguyen
    Quoc-Long Tran
    Do-Van Nguyen
    Thanh-Ha Le
    2018 JOINT 10TH INTERNATIONAL CONFERENCE ON SOFT COMPUTING AND INTELLIGENT SYSTEMS (SCIS) AND 19TH INTERNATIONAL SYMPOSIUM ON ADVANCED INTELLIGENT SYSTEMS (ISIS), 2018, : 174 - 179
  • [4] STN: Saliency-Guided Transformer Network for Point-Wise Semantic Segmentation of Urban Scenes
    Ma, Lingfei
    Li, Jonathan
    Guan, Haiyan
    Yu, Yongtao
    Chen, Yiping
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
  • [5] PPANet: Point-Wise Pyramid Attention Network for Semantic Segmentation
    Elhassan, Mohammed A. M.
    Chen, YuXuan
    Chen, Yunyi
    Huang, Chenxi
    Yang, Jane
    Yao, Xingcong
    Yang, Chenhui
    Cheng, Yinuo
    WIRELESS COMMUNICATIONS & MOBILE COMPUTING, 2021, 2021
  • [6] PDS-Net: A novel point and depth-wise separable convolution for real-time object detection
    Junayed, Masum Shah
    Islam, Md Baharul
    Imani, Hassan
    Aydin, Tarkan
    INTERNATIONAL JOURNAL OF MULTIMEDIA INFORMATION RETRIEVAL, 2022,
  • [7] PDS-Net: A novel point and depth-wise separable convolution for real-time object detection
    Junayed, Masum Shah
    Islam, Md Baharul
    Imani, Hassan
    Aydin, Tarkan
    INTERNATIONAL JOURNAL OF MULTIMEDIA INFORMATION RETRIEVAL, 2022, 11 (02) : 171 - 188
  • [8] PDS-Net: A novel point and depth-wise separable convolution for real-time object detection
    Masum Shah Junayed
    Md Baharul Islam
    Hassan Imani
    Tarkan Aydin
    International Journal of Multimedia Information Retrieval, 2022, 11 : 171 - 188
  • [9] HyperSeg: Patch-wise Hypernetwork for Real-time Semantic Segmentation
    Nirkin, Yuval
    Wolf, Lior
    Hassner, Tal
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 4060 - 4069
  • [10] Research on Efficient Asymmetric Attention Module for Real-Time Semantic Segmentation Networks in Urban Scenes
    Su, Xu
    Li, Lihong
    Xiao, Jiejie
    Wang, Pengtao
    JOURNAL OF ADVANCED COMPUTATIONAL INTELLIGENCE AND INTELLIGENT INFORMATICS, 2024, 28 (03) : 562 - 572