Multi-scale image semantic segmentation based on ASPP and improved HRNet

被引:5
|
作者
Shi Jian-feng [1 ]
Gao Zhi-ming [2 ]
Wang A-chuan [1 ]
机构
[1] Northeast Forestry Univ, Coll Informat & Comp Engn, Harbin 150040, Peoples R China
[2] Northeast Forestry Univ, Coll Mech & Elect Engn, Harbin 150040, Peoples R China
关键词
semanticsegmentation; deeplearning; neuralnetwork; highresolutionnetwork;
D O I
10.37188/CJLCD.2021-0093
中图分类号
O7 [晶体学];
学科分类号
0702 ; 070205 ; 0703 ; 080501 ;
摘要
Aiming at the problems of huge model and multi-scale object segmentation in the classical semantic segmentation algorithm, an efficient multi-scale image semantic segmentation method based on ASPP and HRNet is proposed. Firstly, the basic block is improved by using deep separable convolution combined with 1 * 1 convolution to reduce the model parameters. Secondly, a batch normalization layer (BN) is added after all convolution layers and before the relu activation function to improve the dead relu problem. Finally, the improved ASPP module based on the hybrid dilated convolution is added, and the advantages of the two are fused by using the parallel upsampling channels to obtain the spatial accurate segmentation results. The RE-ASPP-HRNet is proposed. Results on Pascal voc2012 and CityScapes show that the proposed method is effective. Compared with the original HRNet, it can improve the accuracy of 0.8% or 0.5% MIoU, and reduce the number of parameters by 1/2 and memory by 1/3. We implement a more efficient and reliable multi-scale semantic segmentation algorithm.
引用
收藏
页码:1497 / 1505
页数:9
相关论文
共 16 条
  • [1] SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation
    Badrinarayanan, Vijay
    Kendall, Alex
    Cipolla, Roberto
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (12) : 2481 - 2495
  • [2] DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs
    Chen, Liang-Chieh
    Papandreou, George
    Kokkinos, Iasonas
    Murphy, Kevin
    Yuille, Alan L.
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (04) : 834 - 848
  • [3] Ultrasound image segmentation based on a multi-parameter Gabor filter and multiscale local level set method
    Chen Xiao-dong
    Sheng Jing
    Yang Jin
    Cai Huai-yu
    Jin Hao
    [J]. CHINESE OPTICS, 2020, 13 (05): : 1075 - 1084
  • [4] CHENLC ZHUYK, 2018, 15 EUR C COMP VIS
  • [5] Xception: Deep Learning with Depthwise Separable Convolutions
    Chollet, Francois
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 1800 - 1807
  • [6] The Cityscapes Dataset for Semantic Urban Scene Understanding
    Cordts, Marius
    Omran, Mohamed
    Ramos, Sebastian
    Rehfeld, Timo
    Enzweiler, Markus
    Benenson, Rodrigo
    Franke, Uwe
    Roth, Stefan
    Schiele, Bernt
    [J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 3213 - 3223
  • [7] Ioffe S, 2015, PR MACH LEARN RES, V37, P448
  • [8] U-Net: Convolutional Networks for Biomedical Image Segmentation
    Ronneberger, Olaf
    Fischer, Philipp
    Brox, Thomas
    [J]. MEDICAL IMAGE COMPUTING AND COMPUTER-ASSISTED INTERVENTION, PT III, 2015, 9351 : 234 - 241
  • [9] Fully Convolutional Networks for Semantic Segmentation
    Shelhamer, Evan
    Long, Jonathan
    Darrell, Trevor
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (04) : 640 - 651
  • [10] Deep High-Resolution Representation Learning for Human Pose Estimation
    Sun, Ke
    Xiao, Bin
    Liu, Dong
    Wang, Jingdong
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 5686 - 5696