Attention to Scale: Scale-aware Semantic Image Segmentation

被引:990
|
作者
Chen, Liang-Chieh [1 ]
Yang, Yi [1 ]
Wang, Jiang [1 ]
Xu, Wei [1 ]
Yuille, Alan L. [1 ]
机构
[1] Baidu USA, Sunnyvale, CA 94089 USA
关键词
D O I
10.1109/CVPR.2016.396
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Incorporating multi-scale features in fully convolutional neural networks (FCNs) has been a key element to achieving state-of-the-art performance on semantic image segmentation. One common way to extract multi-scale features is to feed multiple resized input images to a shared deep network and then merge the resulting features for pixelwise classification. In this work, we propose an attention mechanism that learns to softly weight the multi-scale features at each pixel location. We adapt a state-of-the-art semantic image segmentation model, which we jointly train with multi-scale input images and the attention model. The proposed attention model not only outperforms average-and max-pooling, but allows us to diagnostically visualize the importance of features at different positions and scales. Moreover, we show that adding extra supervision to the output at each scale is essential to achieving excellent performance when merging multi-scale features. We demonstrate the effectiveness of our model with extensive experiments on three challenging datasets, including PASCAL-Person-Part, PASCAL VOC 2012 and a subset of MS-COCO 2014.
引用
收藏
页码:3640 / 3649
页数:10
相关论文
共 50 条
  • [1] Scale-aware attention network for weakly supervised semantic segmentation
    Cao, Zhiyuan
    Gao, Yufei
    Zhang, Jiacai
    [J]. NEUROCOMPUTING, 2022, 492 : 34 - 49
  • [2] Scale-aware attention network for weakly supervised semantic segmentation
    Cao, Zhiyuan
    Gao, Yufei
    Zhang, Jiacai
    [J]. Neurocomputing, 2022, 492 : 34 - 49
  • [3] Scale-Aware Alignment of Hierarchical Image Segmentation
    Chen, Yuhua
    Dai, Dengxin
    Pont-Tuset, Jordi
    Van Gool, Luc
    [J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 364 - 372
  • [4] Scale-aware spatial pyramid pooling with both encoder-mask and scale-attention for semantic segmentation
    Zhou, Feng
    Hu, Yong
    Shen, Xukun
    [J]. NEUROCOMPUTING, 2020, 383 : 174 - 182
  • [5] Scale-Aware Detailed Matching for Few-Shot Aerial Image Semantic Segmentation
    Yao, Xiwen
    Cao, Qinglong
    Feng, Xiaoxu
    Cheng, Gong
    Han, Junwei
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [6] Scale-Aware Feature Network for Weakly Supervised Semantic Segmentation
    Xu, Lian
    Bennamoun, Mohammed
    Boussaid, Farid
    Sohel, Ferdous
    [J]. IEEE ACCESS, 2020, 8 : 75957 - 75967
  • [7] Instance Semantic Segmentation via Scale-Aware Patch Fusion Network
    Yang, Jinfu
    Zhang, Jingling
    Li, Mingai
    Wang, Meijie
    [J]. COMPUTER VISION, PT II, 2017, 772 : 521 - 532
  • [8] Collaborative multi-feature extraction and scale-aware semantic information mining for medical image segmentation
    Zhang, Ruijun
    He, Zixuan
    Zhu, Jian
    Yuan, Xiaochen
    Huang, Guoheng
    Pun, Chi-Man
    Peng, Jianhong
    Lin, Junzhong
    Zhou, Jian
    [J]. PHYSICS IN MEDICINE AND BIOLOGY, 2022, 67 (20):
  • [9] Scale-Aware Graph Neural Network for Few-Shot Semantic Segmentation
    Xie, Guo-Sen
    Liu, Jie
    Xiong, Huan
    Shao, Ling
    [J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 5471 - 5480
  • [10] DEEP SCALE-AWARE IMAGE SMOOTHING
    Li, Jiachun
    Qin, Kunkun
    Xu, Ruotao
    Ji, Hui
    [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 2105 - 2109