Lightweight convolutional neural networks with context broadcast transformer for real-time semantic segmentation

被引:0
|
作者
Hu, Kaidi [1 ,2 ]
Xie, Zongxia [1 ,2 ]
Hu, Qinghua [1 ,2 ]
机构
[1] Tianjin Univ, Coll Intelligence & Comp, Tianjin 300350, Peoples R China
[2] Minist Educ, Engn Res Ctr Urban Intelligence & Digital Governan, Tianjin 300350, Peoples R China
基金
中国国家自然科学基金;
关键词
Lightweight neural network; Vision transformer; Real-time semantic segmentation; Multi -scale fusion; Attention mechanism; FUSION NETWORK;
D O I
10.1016/j.imavis.2024.105053
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
With the increasing application of embedded mobile devices in various fields, lightweight real-time semantic segmentation systems have attracted more and more attention. Many current methods have successfully reduced the model's parameters, but they have led to low model accuracy, diminishing their practical value. In recent years, the Transformer architecture has achieved good results in many tasks, effectively capturing long-range dependencies and enhancing accuracy. However, the Transformer is not adept at extracting local features, and the model's computational cost is generally too high, hindering real-time inference implementation. We propose a lightweight semantic segmentation network called LCBFormer-Net, which embeds Transformer units between asymmetric encoders and decoders to fully leverage their advantages. On the encoder side, we design the Lightweight Multi-Fusion Unit (LMFU) and Partition Grouping Shuffle Channel Attention (PGSCA). The former fully utilizes input features, merging information multiple times through multiple branches and employing depthwise convolutions with dilation rate to further obtain sufficient features. The latter includes a lightweight grouped channel attention, better guide feature extraction. The Lightweight Context Broadcast Transformer (LCB Transformer) is the Transformer unit we designed, with a lightweight structure that significantly reduces GPU memory consumption. It also improves self-attention and feed-forward networks, enhancing the model's robustness. The decoder includes the Multi-scale Semantic Information Attention Fusion (MSIAF) module, guiding the fusion of features at three different scales and employing a hybrid attention mechanism with both channel and spatial attention to guide feature extraction. LCBFormer-Net achieves good segmentation results with a parameter count of 0.88 M on multiple challenging datasets with diverse scenes.
引用
收藏
页数:14
相关论文
共 50 条
  • [1] FDDWNET: A LIGHTWEIGHT CONVOLUTIONAL NEURAL NETWORK FOR REAL-TIME SEMANTIC SEGMENTATION
    Liu, Jia
    Zhou, Quan
    Qiang, Yong
    Kang, Bin
    Wu, Xiaofu
    Zheng, Baoyu
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 2373 - 2377
  • [2] LACTNet: A Lightweight Real-Time Semantic Segmentation Network Based on an Aggregated Convolutional Neural Network and Transformer
    Zhang, Xiangyue
    Li, Hexiao
    Ru, Jingyu
    Ji, Peng
    Wu, Chengdong
    [J]. ELECTRONICS, 2024, 13 (12)
  • [3] A Lightweight and Dynamic Convolutional Network for Real-time Semantic Segmentation
    Zhang, Chunyu
    Xu, Fang
    Wu, Chengdong
    [J]. 2023 35TH CHINESE CONTROL AND DECISION CONFERENCE, CCDC, 2023, : 4062 - 4067
  • [4] Lightweight Real-Time Semantic Segmentation Network With Efficient Transformer and CNN
    Xu, Guoan
    Li, Juncheng
    Gao, Guangwei
    Lu, Huimin
    Yang, Jian
    Yue, Dong
    [J]. IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2023, 24 (12) : 15897 - 15906
  • [5] Deep Context Convolutional Neural Networks for Semantic Segmentation
    Yang, Wenbin
    Zhou, Quan
    Fan, Yawen
    Gao, Guangwei
    Wu, Songsong
    Ou, Weihua
    Lu, Huimin
    Cheng, Jie
    Latecki, Longin Jan
    [J]. COMPUTER VISION, PT I, 2017, 771 : 696 - 704
  • [6] Real-time iris segmentation model based on lightweight convolutional neural network
    Huo, Guang
    Lin, Dawei
    Liu, Yuanning
    Zhu, Xiaodong
    Yuan, Meng
    [J]. JOURNAL OF ELECTRONIC IMAGING, 2022, 31 (04)
  • [7] A lightweight multi-dimension dynamic convolutional network for real-time semantic segmentation
    Zhang, Chunyu
    Xu, Fang
    Wu, Chengdong
    Xu, Chenglong
    [J]. FRONTIERS IN NEUROROBOTICS, 2022, 16
  • [8] Lightweight Bilateral Network for Real-Time Semantic Segmentation
    Wang, Pengtao
    Li, Lihong
    Pan, Feiyang
    Wang, Lin
    [J]. JOURNAL OF ADVANCED COMPUTATIONAL INTELLIGENCE AND INTELLIGENT INFORMATICS, 2023, 27 (04) : 673 - 682
  • [9] ICANet: A LIGHTWEIGHT INCREASING CONTEXT AIDED NETWORK FOR REAL-TIME IMAGE SEMANTIC SEGMENTATION
    Chen, Lei
    Dai, Huhe
    Zheng, Yuan
    [J]. 2023 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, ICME, 2023, : 414 - 419
  • [10] Exploring Compact and Efficient Neural Networks for Real-Time Semantic Segmentation
    Li, Zijia
    Xiao, ZhenZhong
    Song, Zhan
    [J]. 2023 IEEE 26TH INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS, ITSC, 2023, : 291 - 297