Lightweight convolutional neural networks with context broadcast transformer for real-time semantic segmentation

被引:0
|
作者
Hu, Kaidi [1 ,2 ]
Xie, Zongxia [1 ,2 ]
Hu, Qinghua [1 ,2 ]
机构
[1] Tianjin Univ, Coll Intelligence & Comp, Tianjin 300350, Peoples R China
[2] Minist Educ, Engn Res Ctr Urban Intelligence & Digital Governan, Tianjin 300350, Peoples R China
基金
中国国家自然科学基金;
关键词
Lightweight neural network; Vision transformer; Real-time semantic segmentation; Multi -scale fusion; Attention mechanism; FUSION NETWORK;
D O I
10.1016/j.imavis.2024.105053
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
With the increasing application of embedded mobile devices in various fields, lightweight real-time semantic segmentation systems have attracted more and more attention. Many current methods have successfully reduced the model's parameters, but they have led to low model accuracy, diminishing their practical value. In recent years, the Transformer architecture has achieved good results in many tasks, effectively capturing long-range dependencies and enhancing accuracy. However, the Transformer is not adept at extracting local features, and the model's computational cost is generally too high, hindering real-time inference implementation. We propose a lightweight semantic segmentation network called LCBFormer-Net, which embeds Transformer units between asymmetric encoders and decoders to fully leverage their advantages. On the encoder side, we design the Lightweight Multi-Fusion Unit (LMFU) and Partition Grouping Shuffle Channel Attention (PGSCA). The former fully utilizes input features, merging information multiple times through multiple branches and employing depthwise convolutions with dilation rate to further obtain sufficient features. The latter includes a lightweight grouped channel attention, better guide feature extraction. The Lightweight Context Broadcast Transformer (LCB Transformer) is the Transformer unit we designed, with a lightweight structure that significantly reduces GPU memory consumption. It also improves self-attention and feed-forward networks, enhancing the model's robustness. The decoder includes the Multi-scale Semantic Information Attention Fusion (MSIAF) module, guiding the fusion of features at three different scales and employing a hybrid attention mechanism with both channel and spatial attention to guide feature extraction. LCBFormer-Net achieves good segmentation results with a parameter count of 0.88 M on multiple challenging datasets with diverse scenes.
引用
收藏
页数:14
相关论文
共 50 条
  • [21] U2ESPNet-A lightweight and high-accuracy convolutional neural network for real-time semantic segmentation of visible branches
    Wan, Hao
    Zeng, Xilei
    Fan, Zeming
    Zhang, Shanshan
    Kang, Meilin
    [J]. COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2023, 204
  • [22] Bilateral attention decoder: A lightweight decoder for real-time semantic segmentation
    Peng, Chengli
    Tian, Tian
    Chen, Chen
    Guo, Xiaojie
    Ma, Jiayi
    [J]. NEURAL NETWORKS, 2021, 137 : 188 - 199
  • [23] Lightweight and efficient feature fusion real-time semantic segmentation network
    Zhong, Jie
    Chen, Aiguo
    Jiang, Yizhang
    Sun, Chengcheng
    Peng, Yuheng
    [J]. Image and Vision Computing, 2025, 154
  • [24] Attention based lightweight asymmetric network for real-time semantic segmentation
    Liu, Qian
    Wang, Cunbao
    Li, Zhensheng
    Qi, Youwei
    Fang, Jiongtao
    [J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 130
  • [25] Lightweight and efficient asymmetric network design for real-time semantic segmentation
    Xiu-Ling Zhang
    Bing-Ce Du
    Zhao-Ci Luo
    Kai Ma
    [J]. Applied Intelligence, 2022, 52 : 564 - 579
  • [26] MDRNet: a lightweight network for real-time semantic segmentation in street scenes
    Dai, Yingpeng
    Wang, Junzheng
    Li, Jiehao
    Li, Jing
    [J]. ASSEMBLY AUTOMATION, 2021, 41 (06) : 725 - 733
  • [27] ELANet: an efficiently lightweight asymmetrical network for real-time semantic segmentation
    Chen, Jiafei
    Yu, Junyang
    Wang, Yingqi
    He, Xin
    [J]. JOURNAL OF ELECTRONIC IMAGING, 2024, 33 (01)
  • [28] Lightweight and efficient asymmetric network design for real-time semantic segmentation
    Zhang, Xiu-Ling
    Du, Bing-Ce
    Luo, Zhao-Ci
    Ma, Kai
    [J]. APPLIED INTELLIGENCE, 2022, 52 (01) : 564 - 579
  • [29] A Real-Time Semantic Segmentation Algorithm Based on Improved Lightweight Network
    Liu, Cheng
    Gao, Hongxia
    Chen, An
    [J]. 2020 INTERNATIONAL SYMPOSIUM ON AUTONOMOUS SYSTEMS (ISAS), 2020, : 249 - 253
  • [30] Multi-scale deep context convolutional neural networks for semantic segmentation
    Quan Zhou
    Wenbing Yang
    Guangwei Gao
    Weihua Ou
    Huimin Lu
    Jie Chen
    Longin Jan Latecki
    [J]. World Wide Web, 2019, 22 : 555 - 570