Optimization of Image Semantic Segmentation Algorithms Based on Deeplab v3+

被引:5
|
作者
Meng Junxi [1 ]
Zhang Li [1 ]
Cao Yang [1 ]
Zhang Letian [1 ]
Song Qian [1 ]
机构
[1] Xian Polytech Univ, Coll Elect & Informat, Xian 710600, Shaanxi, Peoples R China
关键词
deep learning; image semantic segmentation; Deeplab v3+; attentional mechanism;
D O I
10.3788/LOP202259.1610009
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Herein, a new semantic segmentation model N-Deeplab v3+ was proposed based on the existing Deeplab v3+ algorithm. The proposed model can be used to address some severe problems of Deeplab v3+ related to the loss of details, such as missing and incorrect segmentations, during image semantic segmentation. The new model designed an atrous spatial pyramid pooling structure with heteroreceptive field splicing to enhance the correlation between different-level data. The feature fusion of multiple crosslayers is performed to improve the characterization of image details. A feature alignment module based on the attention mechanism was developed to guide the alignment of high- and low-level features and enhance the learning process for important channel features in a targeted manner, thus improving the learning ability of the model. Experimental results based on the Cityscapes dataset show that the proposed model can effectively increase the attention for small-scale targets, alleviate the problem of target mis-segmentation, and show improved semantic segmentation accuracy. The generalization capability of the proposed model is further verified on the PASCAL VOC 2012 dataset. The mean intersection over union of N-Deeplab v3+ on the Cityscapes dataset and PASCAL VOC 2012 dataset reaches 76. 31% and 81. 97%, respectively, showing improvements of 1. 69 percentage points and 2. 14 percentage points, respectively, compared with the original model.
引用
收藏
页数:10
相关论文
共 19 条
  • [1] SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation
    Badrinarayanan, Vijay
    Kendall, Alex
    Cipolla, Roberto
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (12) : 2481 - 2495
  • [2] Importance-Aware Semantic Segmentation for Autonomous Vehicles
    Chen, Bike
    Gong, Chen
    Yang, Jian
    [J]. IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2019, 20 (01) : 137 - 148
  • [3] Chen H, ACTA OPT SIN, V41
  • [4] Chen LC, 2016, Arxiv, DOI arXiv:1412.7062
  • [5] Chen LC, 2017, Arxiv, DOI [arXiv:1706.05587, DOI 10.48550/ARXIV.1706.05587]
  • [6] Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation
    Chen, Liang-Chieh
    Zhu, Yukun
    Papandreou, George
    Schroff, Florian
    Adam, Hartwig
    [J]. COMPUTER VISION - ECCV 2018, PT VII, 2018, 11211 : 833 - 851
  • [7] DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs
    Chen, Liang-Chieh
    Papandreou, George
    Kokkinos, Iasonas
    Murphy, Kevin
    Yuille, Alan L.
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (04) : 834 - 848
  • [8] Xception: Deep Learning with Depthwise Separable Convolutions
    Chollet, Francois
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 1800 - 1807
  • [9] Deformable Convolutional Networks
    Dai, Jifeng
    Qi, Haozhi
    Xiong, Yuwen
    Li, Yi
    Zhang, Guodong
    Hu, Han
    Wei, Yichen
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 764 - 773
  • [10] Image Semantic Description Algorithm with Integrated Spatial Attention Mechanism
    Guo Lie
    Zhang Tuanshan
    Sun Weizhen
    Guo Jielong
    [J]. LASER & OPTOELECTRONICS PROGRESS, 2021, 58 (12)