Real-time semantic segmentation in traffic scene using Cross Stage Partial-based encoder-decoder network

被引:7
|
作者
Zhou, Liguo [1 ]
Chen, Guang [2 ]
Liu, Lian [1 ]
Wang, Ruining [1 ]
Knoll, Alois [1 ]
机构
[1] Tech Univ Munich, Chair Robot Artificial Intelligence & Realtime Sys, Garching, Germany
[2] Tongji Univ, Sch Automot Studies, Shanghai, Peoples R China
基金
中国国家自然科学基金;
关键词
Convolutional neural network; Semantic segmentation; Cross stage partial; Traffic scene; BACKPROPAGATION;
D O I
10.1016/j.engappai.2023.106901
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Real-time semantic segmentation in traffic scenes plays an essential part in autonomous driving. The encoder-decoder-based network architecture can well combine the context information and detailed information required for the semantic segmentation task. Achieving a good balance between inference speed and accuracy is a crucial challenge, as considerable real-time semantic segmentation models process information in real-time at the expense of accuracy degradation. This paper presents an encoder-decoder network model based on Cross Stage Partial (CSP) block for real-time semantic segmentation in traffic scenes. Integrating the CSP block can not only lessen the computational overhead but also enhance the feature extraction ability of the network. In addition, we append the Fast Spatial Pyramid Pooling module to the backbone of the network, which can aggregate global information at a low computational cost. On NVIDIA RTX 3090, the middle model of our method can achieve a mean intersection over union (mIOU) of 80.8% at 64.3 frames per second (FPS) on the Cityscapes test set and an mIOU of 81.3% at 105.3 FPS on the CamVid Test Set. The large model of our method can realize an mIOU of 81.5% at 48.4 FPS on the test set of Cityscapes. Our source code is available at https://github.com/zhouliguo/cspsg.
引用
收藏
页数:8
相关论文
共 50 条
  • [1] LEDNET: A LIGHTWEIGHT ENCODER-DECODER NETWORK FOR REAL-TIME SEMANTIC SEGMENTATION
    Wang, Yu
    Zhou, Quan
    Liu, Jia
    Xiong, Jian
    Gao, Guangwei
    Wu, Xiaofu
    Latecki, Longin Jan
    2019 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2019, : 1860 - 1864
  • [2] PPEDNet: Pyramid Pooling Encoder-Decoder Network for Real-Time Semantic Segmentation
    Tan, Zhentao
    Liu, Bin
    Yu, Nenghai
    IMAGE AND GRAPHICS (ICIG 2017), PT I, 2017, 10666 : 328 - 339
  • [3] Fast Real-time Semantic Segmentation Network with an Asymmetric Encoder-Decoder Structure
    Rui, Tang
    Yan, Li Hui
    Kai, Xu
    Yi, Ding
    2020 5TH INTERNATIONAL CONFERENCE ON MECHANICAL, CONTROL AND COMPUTER ENGINEERING (ICMCCE 2020), 2020, : 2408 - 2413
  • [4] Real-time semantic segmentation of microvascular decompression images based on encoder-decoder structure
    Bai Rui-feng
    Jiang Shan
    Sun Hai-jiang
    Liu Xin-rui
    CHINESE OPTICS, 2022, 15 (05) : 1055 - 1065
  • [5] DARSegNet: A Real-Time Semantic Segmentation Method Based on Dual Attention Fusion Module and Encoder-Decoder Network
    Xing, Yongfeng
    Zhong, Luo
    Zhong, Xian
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2022, 2022
  • [6] Deep Encoder-Decoder Network-Based Wildfire Segmentation Using Drone Images in Real-Time
    Muksimova, Shakhnoza
    Mardieva, Sevara
    Cho, Young-Im
    REMOTE SENSING, 2022, 14 (24)
  • [7] An Encoder-Decoder Network Based FCN Architecture for Semantic Segmentation
    Xing, Yongfeng
    Zhong, Luo
    Zhong, Xian
    WIRELESS COMMUNICATIONS & MOBILE COMPUTING, 2020, 2020
  • [8] Attention Based Encoder-decoder Network for Cardiac Semantic Segmentation
    Yuan, Xiaohan
    Zhu, Yinsu
    Wang, Yangang
    2020 CHINESE AUTOMATION CONGRESS (CAC 2020), 2020, : 4578 - 4582
  • [9] SqueezedText: A Real-Time Scene Text Recognition by Binary Convolutional Encoder-Decoder Network
    Liu, Zichuan
    Li, Yixing
    Ren, Fengbo
    Goh, Wang Ling
    Yu, Hao
    THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 7194 - 7201
  • [10] Pooling Attention-based Encoder-Decoder Network for semantic segmentation
    Xu, Haixia
    Huang, Yunjia
    Hancock, Edwin R.
    Wang, Shuailong
    Xuan, Qijun
    Zhou, Wei
    COMPUTERS & ELECTRICAL ENGINEERING, 2021, 93