Depth-Wise Asymmetric Bottleneck With Point-Wise Aggregation Decoder for Real-Time Semantic Segmentation in Urban Scenes

被引:37
|
作者
Li, Gen [1 ]
Jiang, Shenlu [1 ]
Yun, Inyong [1 ]
Kim, Jonghyun [1 ]
Kim, Joongkyu [1 ]
机构
[1] Sungkyunkwan Univ, Dept Elect Elect & Comp Engn, Suwon 16419, South Korea
基金
新加坡国家研究基金会;
关键词
Real-time semantic segmentation; encoder-decoder network; convolutional neural network; urban scenes; lightweight network;
D O I
10.1109/ACCESS.2020.2971760
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Semantic segmentation is a process of linking each pixel in an image to a class label, and is widely used in the field of autonomous vehicles and robotics. Although deep learning methods have already made great progress for semantic segmentation, they either achieve great results with numerous parameters or design lightweight models but heavily sacrifice the segmentation accuracy. Because of the strict requirements of real-world applications, it is critical to design an effective real-time model with both competitive segmentation accuracy and small model capacity. In this paper, we propose a lightweight network named DABNet, which employs Depth-wise Asymmetric Bottleneck (DAB) and Point-wise Aggregation Decoder (PAD) module to tackle the challenging real-time semantic segmentation in urban scenes. Specifically, the DAB module creates a sufficient receptive field and densely utilizes the contextual information, and the PAD module aggregates the feature maps of different scales to optimize performance through the attention mechanism. Compared with existing methods, our network substantially reduces the number of parameters but still achieves high accuracy with real-time inference ability. Extensive ablation experiments on two challenging urban scene datasets (Cityscapes and CamVid) have proved the effectiveness of the proposed approach in real-time semantic segmentation.
引用
收藏
页码:27495 / 27506
页数:12
相关论文
共 50 条
  • [21] Triple-Branch Asymmetric Network for Real-time Semantic Segmentation of Road Scenes
    Yazhi Zhang
    Xuguang Zhang
    Hui Yu
    Instrumentation, 2024, 11 (02) : 72 - 82
  • [22] Real-Time Joint Semantic Segmentation and Depth Estimation Using Asymmetric Annotations
    Nekrasov, Vladimir
    Dharmasiri, Thanuja
    Spek, Andrew
    Drummond, Tom
    Shen, Chunhua
    Reid, Ian
    2019 INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2019, : 7101 - 7107
  • [23] DSANet: Dilated spatial attention for real-time semantic segmentation in urban street scenes
    Elhassan, Mohammed A. M.
    Huang, Chenxi
    Yang, Chenhui
    Munea, Tewodros Legesse
    EXPERT SYSTEMS WITH APPLICATIONS, 2021, 183
  • [24] Real-Time High-Performance Semantic Image Segmentation of Urban Street Scenes
    Dong, Genshun
    Yan, Yan
    Shen, Chunhua
    Wang, Hanzi
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2021, 22 (06) : 3258 - 3274
  • [25] PUA-MOS: End-to-End Point-wise Uncertainty Weighted Aggregation for Moving Object Segmentation
    Chi, Cheng
    Li, Peiliang
    Chen, Xiaozhi
    Yang, Xin
    2022 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2022, : 12456 - 12463
  • [26] A lightweight network with attention decoder for real-time semantic segmentation
    Kang Wang
    Jinfu Yang
    Shuai Yuan
    Mingai Li
    The Visual Computer, 2022, 38 : 2329 - 2339
  • [27] ADFNet: accumulated decoder features for real-time semantic segmentation
    Choi, Hyunguk
    Ahn, Hoyeon
    Kim, Joonmo
    Jeon, Moongu
    IET COMPUTER VISION, 2020, 14 (08) : 555 - 563
  • [28] Deep Multi-Branch Aggregation Network for Real-Time Semantic Segmentation in Street Scenes
    Weng, Xi
    Yan, Yan
    Dong, Genshun
    Shu, Chang
    Wang, Biao
    Wang, Hanzi
    Zhang, Ji
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (10) : 17224 - 17240
  • [29] Design of Real-time Semantic Segmentation Decoder for Automated Driving
    Das, Arindam
    Kandan, Saranya
    Yogamani, Senthil
    Krizek, Pavel
    PROCEEDINGS OF THE 14TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS (VISAPP), VOL 5, 2019, : 393 - 400
  • [30] A lightweight network with attention decoder for real-time semantic segmentation
    Wang, Kang
    Yang, Jinfu
    Yuan, Shuai
    Li, Mingai
    VISUAL COMPUTER, 2022, 38 (07): : 2329 - 2339