DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs

被引:13335
|
作者
Chen, Liang-Chieh [1 ]
Papandreou, George [1 ]
Kokkinos, Iasonas [2 ]
Murphy, Kevin [1 ]
Yuille, Alan L. [3 ,4 ]
机构
[1] Google Inc, Mountain View, CA 94043 USA
[2] UCL, London WC1E 6BT, England
[3] Johns Hopkins Univ, Dept Cognit Sci, Baltimore, MD 21218 USA
[4] Johns Hopkins Univ, Dept Comp Sci, Baltimore, MD 21218 USA
基金
欧盟地平线“2020”;
关键词
Convolutional neural networks; semantic segmentation; atrous convolution; conditional random fields; DISCRETE WAVELET TRANSFORM; NETWORKS;
D O I
10.1109/TPAMI.2017.2699184
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this work we address the task of semantic image segmentation with Deep Learning and make three main contributions that are experimentally shown to have substantial practical merit. First, we highlight convolution with upsampled filters, or 'atrous convolution', as a powerful tool in dense prediction tasks. Atrous convolution allows us to explicitly control the resolution at which feature responses are computed within Deep Convolutional Neural Networks. It also allows us to effectively enlarge the field of view of filters to incorporate larger context without increasing the number of parameters or the amount of computation. Second, we propose atrous spatial pyramid pooling (ASPP) to robustly segment objects at multiple scales. ASPP probes an incoming convolutional feature layer with filters at multiple sampling rates and effective fields-of-views, thus capturing objects as well as image context at multiple scales. Third, we improve the localization of object boundaries by combining methods from DCNNs and probabilistic graphical models. The commonly deployed combination of max-pooling and downsampling in DCNNs achieves invariance but has a toll on localization accuracy. We overcome this by combining the responses at the final DCNN layer with a fully connected Conditional Random Field (CRF), which is shown both qualitatively and quantitatively to improve localization performance. Our proposed "DeepLab" system sets the new state-of-art at the PASCAL VOC-2012 semantic image segmentation task, reaching 79.7 percent mIOU in the test set, and advances the results on three other datasets: PASCAL-Context, PASCAL-Person-Part, and Cityscapes. All of our code is made publicly available online.
引用
收藏
页码:834 / 848
页数:15
相关论文
共 50 条
  • [1] Outdoor Semantic Segmentation for UGVs Based on CNN and Fully Connected CRFs
    Qiu, Zengshuai
    Yan, Fei
    Zhuang, Yan
    Leung, Henry
    [J]. IEEE SENSORS JOURNAL, 2019, 19 (11) : 4290 - 4298
  • [2] Inferring Skin Lesion Segmentation With Fully Connected CRFs Based on Multiple Deep Convolutional Neural Networks
    Qiu, Yuming
    Cai, Jingyong
    Qin, Xiaolin
    Zhang, Ju
    [J]. IEEE ACCESS, 2020, 8 : 144246 - 144258
  • [3] Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation
    Chen, Liang-Chieh
    Zhu, Yukun
    Papandreou, George
    Schroff, Florian
    Adam, Hartwig
    [J]. COMPUTER VISION - ECCV 2018, PT VII, 2018, 11211 : 833 - 851
  • [4] Image Semantic Segmentation Method Based on Atrous Algorithm and Convolution CRF
    Lv, Linjue
    Li, Xingwei
    Jin, Jiating
    Li, Xinlong
    [J]. PROCEEDINGS OF 2019 IEEE 7TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND NETWORK TECHNOLOGY (ICCSNT 2019), 2019, : 160 - 165
  • [5] An enhancement model based on dense atrous and inception convolution for image semantic segmentation
    Erjing Zhou
    Xiang Xu
    Baomin Xu
    Hongwei Wu
    [J]. Applied Intelligence, 2023, 53 : 5519 - 5531
  • [6] An enhancement model based on dense atrous and inception convolution for image semantic segmentation
    Zhou, Erjing
    Xu, Xiang
    Xu, Baomin
    Wu, Hongwei
    [J]. APPLIED INTELLIGENCE, 2023, 53 (05) : 5519 - 5531
  • [7] Guided Networks for Few-Shot Image Segmentation and Fully Connected CRFs
    Zhang, Kun
    Zheng, Yuanjie
    Deng, Xiaobo
    Jia, Weikuan
    Lian, Jian
    Chen, Xin
    [J]. ELECTRONICS, 2020, 9 (09) : 1 - 15
  • [8] ATROUS CONVOLUTION FOR BINARY SEMANTIC SEGMENTATION OF LUNG NODULE
    Hesamian, Mohammad Hesam
    Jia, Wenjing
    He, Xiangjian
    Kennedy, Paul J.
    [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 1015 - 1019
  • [9] A-DenseUNet: Adaptive Densely Connected UNet for Biomedical Image Segmentation with Atrous Convolution
    Safarov, Sirojbek
    Whangbo, Taeg Keun
    [J]. BASIC & CLINICAL PHARMACOLOGY & TOXICOLOGY, 2020, 127 : 13 - 13
  • [10] Image semantic segmentation with an improved fully convolutional network
    Kuo-Kun Tseng
    Haichuan Sun
    Junwu Liu
    Jiaqi Li
    K. L. Yung
    W. H. Ip
    [J]. Soft Computing, 2020, 24 : 8253 - 8273