A lightweight network with attention decoder for real-time semantic segmentation

被引:0
|
作者
Kang Wang
Jinfu Yang
Shuai Yuan
Mingai Li
机构
[1] Beijing University of Technology,Faculty of Information Technology
来源
The Visual Computer | 2022年 / 38卷
关键词
Semantic segmentation; Encoder–decoder structure; Depth-wise separable asymmetric convolution; Dilated convolution; Attention mechanism;
D O I
暂无
中图分类号
学科分类号
摘要
As an important task in scene understanding, semantic segmentation requires a large amount of computation to achieve high performance. In recent years, with the rise of autonomous systems, it is crucial to make a trade-off in terms of accuracy and speed. In this paper, we propose a novel asymmetric encoder–decoder network structure to address this problem. In the encoder, we design a Separable Asymmetric Module, which combines depth-wise separable asymmetric convolution with dilated convolution to greatly reduce computation cost while maintaining accuracy. On the other hand, an attention mechanism is also used in the decoder to further improve segmentation performance. Experimental results on CityScapes and CamVid datasets show that the proposed method can achieve a better balance between segmentation precision and speed compared with state-of-the-art semantic segmentation methods. Specifically, our model obtains mean IoU of 72.5% and 66.3% on CityScapes and CamVid test dataset, respectively, with less than 1M parameters.
引用
收藏
页码:2329 / 2339
页数:10
相关论文
共 50 条
  • [31] LARFNet: Lightweight asymmetric refining fusion network for real-time semantic segmentation
    Hu, Xuegang
    Gong, Juelin
    [J]. COMPUTERS & GRAPHICS-UK, 2022, 109 : 55 - 64
  • [32] LDPNet: A Lightweight Densely Connected Pyramid Network for Real-Time Semantic Segmentation
    Hu, Xuegang
    Jing, Liyuan
    [J]. IEEE ACCESS, 2020, 8 : 212647 - 212658
  • [33] DARSegNet: A Real-Time Semantic Segmentation Method Based on Dual Attention Fusion Module and Encoder-Decoder Network
    Xing, Yongfeng
    Zhong, Luo
    Zhong, Xian
    [J]. MATHEMATICAL PROBLEMS IN ENGINEERING, 2022, 2022
  • [34] Fast Real-time Semantic Segmentation Network with an Asymmetric Encoder-Decoder Structure
    Rui, Tang
    Yan, Li Hui
    Kai, Xu
    Yi, Ding
    [J]. 2020 5TH INTERNATIONAL CONFERENCE ON MECHANICAL, CONTROL AND COMPUTER ENGINEERING (ICMCCE 2020), 2020, : 2408 - 2413
  • [35] PPEDNet: Pyramid Pooling Encoder-Decoder Network for Real-Time Semantic Segmentation
    Tan, Zhentao
    Liu, Bin
    Yu, Nenghai
    [J]. IMAGE AND GRAPHICS (ICIG 2017), PT I, 2017, 10666 : 328 - 339
  • [36] Block attention network: A lightweight deep network for real-time semantic segmentation of road scenes in resource-constrained devices
    Mazhar, Saquib
    Atif, Nadeem
    Bhuyan, M. K.
    Ahamed, Shaik Rafi
    [J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 126
  • [37] ADFNet: accumulated decoder features for real-time semantic segmentation
    Choi, Hyunguk
    Ahn, Hoyeon
    Kim, Joonmo
    Jeon, Moongu
    [J]. IET COMPUTER VISION, 2020, 14 (08) : 555 - 563
  • [38] Design of Real-time Semantic Segmentation Decoder for Automated Driving
    Das, Arindam
    Kandan, Saranya
    Yogamani, Senthil
    Krizek, Pavel
    [J]. PROCEEDINGS OF THE 14TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS (VISAPP), VOL 5, 2019, : 393 - 400
  • [39] Joint pyramid attention network for real-time semantic segmentation of urban scenes
    Hu, Xuegang
    Jing, Liyuan
    Sehar, Uroosa
    [J]. APPLIED INTELLIGENCE, 2022, 52 (01) : 580 - 594
  • [40] Joint pyramid attention network for real-time semantic segmentation of urban scenes
    Xuegang Hu
    Liyuan Jing
    Uroosa Sehar
    [J]. Applied Intelligence, 2022, 52 : 580 - 594