Bilateral attention decoder: A lightweight decoder for real-time semantic segmentation

被引:44
|
作者
Peng, Chengli [1 ]
Tian, Tian [2 ]
Chen, Chen [3 ]
Guo, Xiaojie [4 ]
Ma, Jiayi [1 ]
机构
[1] Wuhan Univ, Elect Informat Sch, Wuhan 430072, Peoples R China
[2] Huazhong Univ Sci & Technol, Sch Artificial Intelligence & Automat, Wuhan 430074, Peoples R China
[3] Univ N Carolina, Dept Elect & Comp Engn, Charlotte, NC 28223 USA
[4] Tianjin Univ, Sch Comp Software, Tianjin 300350, Peoples R China
关键词
Semantic segmentation; Real time; Deep learning; Attention mechanism; NETWORK;
D O I
10.1016/j.neunet.2021.01.021
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The encoder-decoder structure has been introduced into semantic segmentation to improve the spatial accuracy of the network by fusing high- and low-level feature maps. However, recent state-of-the-art encoder-decoder-based methods can hardly attain the real-time requirement due to their complex and inefficient decoders. To address this issue, in this paper, we propose a lightweight bilateral attention decoder for real-time semantic segmentation. It consists of two blocks and can fuse different level feature maps via two steps, i.e., information refinement and information fusion. In the first step, we propose a channel attention branch to refine the high-level feature maps and a spatial attention branch for the low-level ones. The refined high-level feature maps can capture more exact semantic information and the refined low-level ones can capture more accurate spatial information, which significantly improves the information capturing ability of these feature maps. In the second step, we develop a new fusion module named pooling fusing block to fuse the refined high- and low-level feature maps. This fusion block can take full advantages of the high- and low-level feature maps, leading to high-quality fusion results. To verify the efficiency of the proposed bilateral attention decoder, we adopt a lightweight network as the backbone and compare our proposed method with other state-of-the-art real-time semantic segmentation methods on the Cityscapes and Camvid datasets. Experimental results demonstrate that our proposed method can achieve better performance with a higher inference speed. Moreover, we compare our proposed network with several state-of-the-art non-real-time semantic segmentation methods and find that our proposed network can also attain better segmentation performance. (C) 2021 Elsevier Ltd. All rights reserved.
引用
收藏
页码:188 / 199
页数:12
相关论文
共 50 条
  • [41] A Lightweight and Dynamic Convolutional Network for Real-time Semantic Segmentation
    Zhang, Chunyu
    Xu, Fang
    Wu, Chengdong
    [J]. 2023 35TH CHINESE CONTROL AND DECISION CONFERENCE, CCDC, 2023, : 4062 - 4067
  • [42] Real-Time Semantic Segmentation of Remote Sensing Images Based on Bilateral Attention Refined Network
    Cai, Jiali
    Liu, Chunjuan
    Yan, Haowen
    Wu, Xiaosuo
    Lu, Wanzhen
    Wang, Xiaoyu
    Sang, Changlin
    [J]. IEEE ACCESS, 2021, 9 : 28349 - 28360
  • [43] Accelerating fractal compression with a real-time decoder
    Chu, HT
    Chen, CC
    [J]. JOURNAL OF INFORMATION SCIENCE AND ENGINEERING, 2001, 17 (03) : 417 - 427
  • [44] A Real-Time Video Decoder for Digital HDTV
    Nam Ling
    Nien-Tsu Wang
    [J]. Journal of VLSI signal processing systems for signal, image and video technology, 2003, 33 : 295 - 306
  • [45] SDDS-Net: Space and Depth Encoder-Decoder Convolutional Neural Networks for Real-Time Semantic Segmentation
    Ibrahem, Hatem
    Salem, Ahmed
    Kang, Hyun-Soo
    [J]. IEEE ACCESS, 2023, 11 : 119362 - 119372
  • [46] Real-time implementation of JPEG encoder/decoder
    Czyszczon, TM
    Czernikowski, RS
    Shaaban, M
    Hsu, KW
    [J]. INPUT/OUTPUT AND IMAGING TECHNOLOGIES, 1998, 3422 : 281 - 292
  • [47] REAL-TIME SVC DECODER IN EMBEDDED SYSTEM
    Maiti, Srijib Narayan
    Gupta, Amit
    Piccinelli, Emiliano Mario
    Saha, Kaushik
    [J]. SIGMAP 2009: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND MULTIMEDIA APPLICATIONS, 2009, : 5 - +
  • [48] A real-time video decoder for digital HDTV
    Ling, N
    Wang, NT
    [J]. JOURNAL OF VLSI SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2003, 33 (03): : 295 - 306
  • [49] Semantic segmentation using stride spatial pyramid pooling and dual attention decoder
    Peng, Chengli
    Ma, Jiayi
    [J]. PATTERN RECOGNITION, 2020, 107
  • [50] Contextual Attention Refinement Network for Real-Time Semantic Segmentation
    Hao, Shijie
    Zhou, Yuan
    Zhang, Youming
    Guo, Yanrong
    [J]. IEEE ACCESS, 2020, 8 : 55230 - 55240