Filling the Gaps in Atrous Convolution: Semantic Segmentation With a Better Context

被引:4
|
作者
Liu, Liyuan [1 ]
Pang, Yanwei [1 ]
Zamir, Syed Waqas [2 ]
Khan, Salman [2 ]
Khan, Fahad Shahbaz [2 ]
Shao, Ling [2 ]
机构
[1] Tianjin Univ, Sch Elect & Informat Engn, Tianjin Key Lab Brain Inspired Intelligence Techn, Tianjin 300072, Peoples R China
[2] Incept Inst Artificial Intelligence, Abu Dhabi 999041, U Arab Emirates
来源
IEEE ACCESS | 2020年 / 8卷
基金
中国国家自然科学基金;
关键词
Semantics; Kernel; Context modeling; Image segmentation; Task analysis; Decoding; Spatial resolution; Image processing; neural networks; semantic segmentation; supervised learning;
D O I
10.1109/ACCESS.2019.2946031
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The main challenge for scene parsing arises when complex scenes with highly diverse objects are encountered. The objects not only differ in scale and appearance but also in semantics. Previous works focus on encoding the multi-scale contextual information (via pooling or atrous convolutions) generally on top of compact high-level features (i.e., at a single stage). In this work, we argue that a rich set of cues exist at multiple stages of the network, encapsulating low, mid and high-level scene details. Therefore, an optimal scene parsing model must aggregate multi-scale context at all three levels of the feature hierarchy; a capability that lacks in state-of-the-art scene parsing models. To address this limitation, we introduce a novel architecture with three new blocks that systematically aggregate low, mid and high tier features. The heart of our approach is a high-level feature aggregation module that augments sparsely connected atrous convolution with dense local and layer-wise connections to avoid gridding artifacts. Besides, we employ a novel feature pyramid augmentation and semantic refinement unit to generate low- and mid-level features that are mixed with high-level features at the decoder. We extensively evaluate our proposed approach on the large-scale Cityscapes and ADE2K benchmarks. Our approach surpasses many latest models on both datasets, achieving mean intersection-over-union (mIoU) scores of 80.5% and 44.0% on Cityscapes and ADE20K, respectively.
引用
收藏
页码:34019 / 34028
页数:10
相关论文
共 50 条
  • [1] SPACEMESHLAB: SPATIAL CONTEXT MEMOIZATION AND MESHGRID ATROUS CONVOLUTION CONSENSUS FOR SEMANTIC SEGMENTATION
    Kim, Taehun
    Kim, Jinseong
    Kim, Daijin
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2021, : 2259 - 2263
  • [2] ATROUS CONVOLUTION FOR BINARY SEMANTIC SEGMENTATION OF LUNG NODULE
    Hesamian, Mohammad Hesam
    Jia, Wenjing
    He, Xiangjian
    Kennedy, Paul J.
    [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 1015 - 1019
  • [3] Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation
    Chen, Liang-Chieh
    Zhu, Yukun
    Papandreou, George
    Schroff, Florian
    Adam, Hartwig
    [J]. COMPUTER VISION - ECCV 2018, PT VII, 2018, 11211 : 833 - 851
  • [4] Image Semantic Segmentation Method Based on Atrous Algorithm and Convolution CRF
    Lv, Linjue
    Li, Xingwei
    Jin, Jiating
    Li, Xinlong
    [J]. PROCEEDINGS OF 2019 IEEE 7TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND NETWORK TECHNOLOGY (ICCSNT 2019), 2019, : 160 - 165
  • [5] An enhancement model based on dense atrous and inception convolution for image semantic segmentation
    Erjing Zhou
    Xiang Xu
    Baomin Xu
    Hongwei Wu
    [J]. Applied Intelligence, 2023, 53 : 5519 - 5531
  • [6] An enhancement model based on dense atrous and inception convolution for image semantic segmentation
    Zhou, Erjing
    Xu, Xiang
    Xu, Baomin
    Wu, Hongwei
    [J]. APPLIED INTELLIGENCE, 2023, 53 (05) : 5519 - 5531
  • [7] Cascaded Multiscale Structure With Self-Smoothing Atrous Convolution for Semantic Segmentation
    Li, Zhiqiang
    Chen, Xi
    Jiang, Jie
    Han, Zhen
    Li, Zhihong
    Fang, Tao
    Huo, Hong
    Li, Qingli
    Liu, Min
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [8] Semantic Segmentation of Tennis Scene Based on Series Atrous Convolution Neural Network
    Li, Yuyan
    Zhang, Yinhui
    He, Zifen
    [J]. Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2020, 32 (04): : 606 - 615
  • [9] See more than once: Kernel-sharing atrous convolution for semantic segmentation
    Huang, Ye
    Wang, Qingqing
    Jia, Wenjing
    Lu, Yue
    Li, Yuxin
    He, Xiangjian
    [J]. NEUROCOMPUTING, 2021, 443 : 26 - 34
  • [10] A Novel Upsampling and Context Convolution for Image Semantic Segmentation
    Sediqi, Khwaja Monib
    Lee, Hyo Jong
    [J]. SENSORS, 2021, 21 (06) : 1 - 16