Scale-aware spatial pyramid pooling with both encoder-mask and scale-attention for semantic segmentation

被引:17
|
作者
Zhou, Feng [1 ]
Hu, Yong [1 ,2 ]
Shen, Xukun [1 ,2 ]
机构
[1] Beihang Univ, State Key Lab Virtual Real Technol & Syst, Beijing, Peoples R China
[2] Beihang Univ, Sch New Media Art & Design, Beijing, Peoples R China
关键词
Scene understanding; Semantic segmentation; Encoder-decoder; Scale selection; Attention; FEATURES;
D O I
10.1016/j.neucom.2019.11.042
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper focuses on semantic segmentation of scenes by capturing the appropriate scale, rich detail and contextual dependencies in a feature representation. Semantic segmentation is a pixel-level classification task and has made steady progress on the basis of fully convolutional networks (FCNs). However, we find there is still room for improvement in the following aspects. The first is that the pixel itself has not enough information for semantic prediction, it needs to look around to determine which category it belongs to. However, the fixed size of the receptive field defined by the network is not suitable for all pixels when an image contains objects with various scales. The second is that the extracted scale-aware features do not handle sharper object boundaries due to low-resolution. The final aspect is regarding the ability to model long-range dependencies. In order to solve the above challenges, in this paper we propose three modules: Scale-aware spatial pyramid pool module, Encoder mask module and Scale-attention module (SSPP-ES). Extensive experiments on the Cityscapes and ADE20K benchmarks demonstrate the effectiveness of our approach for semantic segmentation. (C) 2019 Elsevier B.V. All rights reserved.
引用
收藏
页码:174 / 182
页数:9
相关论文
共 35 条
  • [1] Attention to Scale: Scale-aware Semantic Image Segmentation
    Chen, Liang-Chieh
    Yang, Yi
    Wang, Jiang
    Xu, Wei
    Yuille, Alan L.
    [J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 3640 - 3649
  • [2] Scale-aware attention network for weakly supervised semantic segmentation
    Cao, Zhiyuan
    Gao, Yufei
    Zhang, Jiacai
    [J]. NEUROCOMPUTING, 2022, 492 : 34 - 49
  • [3] Scale-aware attention network for weakly supervised semantic segmentation
    Cao, Zhiyuan
    Gao, Yufei
    Zhang, Jiacai
    [J]. Neurocomputing, 2022, 492 : 34 - 49
  • [4] Pyramid Scale-aware and Soft-channel spatial attention for traffic sign detection
    Liu, Yalei
    Wu, Jinghua
    Sheng, Xueliang
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 83 (12) : 37201 - 37224
  • [5] Pyramid Scale-aware and Soft-channel spatial attention for traffic sign detection
    Yalei Liu
    Jinghua Wu
    Xueliang Sheng
    [J]. Multimedia Tools and Applications, 2024, 83 : 37201 - 37224
  • [6] Scale-Aware Feature Network for Weakly Supervised Semantic Segmentation
    Xu, Lian
    Bennamoun, Mohammed
    Boussaid, Farid
    Sohel, Ferdous
    [J]. IEEE ACCESS, 2020, 8 : 75957 - 75967
  • [7] Semantic segmentation using stride spatial pyramid pooling and dual attention decoder
    Peng, Chengli
    Ma, Jiayi
    [J]. PATTERN RECOGNITION, 2020, 107
  • [8] Multi scale-aware attention for pyramid convolution network on finger vein recognition
    Zhang, Huijie
    Sun, Weizhen
    Lv, Ling
    [J]. SCIENTIFIC REPORTS, 2024, 14 (01)
  • [9] Multi scale-aware attention for pyramid convolution network on finger vein recognition
    Huijie Zhang
    Weizhen Sun
    Ling Lv
    [J]. Scientific Reports, 14
  • [10] Instance Semantic Segmentation via Scale-Aware Patch Fusion Network
    Yang, Jinfu
    Zhang, Jingling
    Li, Mingai
    Wang, Meijie
    [J]. COMPUTER VISION, PT II, 2017, 772 : 521 - 532