Attention to Scale: Scale-aware Semantic Image Segmentation

被引:990
|
作者
Chen, Liang-Chieh [1 ]
Yang, Yi [1 ]
Wang, Jiang [1 ]
Xu, Wei [1 ]
Yuille, Alan L. [1 ]
机构
[1] Baidu USA, Sunnyvale, CA 94089 USA
关键词
D O I
10.1109/CVPR.2016.396
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Incorporating multi-scale features in fully convolutional neural networks (FCNs) has been a key element to achieving state-of-the-art performance on semantic image segmentation. One common way to extract multi-scale features is to feed multiple resized input images to a shared deep network and then merge the resulting features for pixelwise classification. In this work, we propose an attention mechanism that learns to softly weight the multi-scale features at each pixel location. We adapt a state-of-the-art semantic image segmentation model, which we jointly train with multi-scale input images and the attention model. The proposed attention model not only outperforms average-and max-pooling, but allows us to diagnostically visualize the importance of features at different positions and scales. Moreover, we show that adding extra supervision to the output at each scale is essential to achieving excellent performance when merging multi-scale features. We demonstrate the effectiveness of our model with extensive experiments on three challenging datasets, including PASCAL-Person-Part, PASCAL VOC 2012 and a subset of MS-COCO 2014.
引用
收藏
页码:3640 / 3649
页数:10
相关论文
共 50 条
  • [41] Scale-Aware RPN for Vehicle Detection
    Ding, Lu
    Wang, Yong
    Laganiere, Robert
    Luo, Xinbin
    Fu, Shan
    [J]. ADVANCES IN VISUAL COMPUTING, ISVC 2018, 2018, 11241 : 487 - 499
  • [42] Scale-Aware Distillation Network for Lightweight Image Super-Resolution
    Lu, Haowei
    Lu, Yao
    Li, Gongping
    Sun, Yanbei
    Wang, Shunzhou
    Li, Yugang
    [J]. PATTERN RECOGNITION AND COMPUTER VISION,, PT III, 2021, 13021 : 128 - 139
  • [43] Multi scale-aware attention for pyramid convolution network on finger vein recognition
    Zhang, Huijie
    Sun, Weizhen
    Lv, Ling
    [J]. SCIENTIFIC REPORTS, 2024, 14 (01)
  • [44] An enhanced vision transformer with scale-aware and spatial-aware attention for thighbone fracture detection
    Guan, Bin
    Yao, Jinkun
    Zhang, Guoshan
    [J]. Neural Computing and Applications, 2024, 36 (19) : 11425 - 11438
  • [45] Scale-Aware Spatially Guided Mapping
    Hao, Shijie
    Guo, Yanrong
    Hong, Richang
    Wang, Meng
    [J]. IEEE MULTIMEDIA, 2016, 23 (03) : 34 - 42
  • [46] PROGRESSIVE SCALE-AWARE NETWORK FOR REMOTE SENSING IMAGE CHANGE CAPTIONING
    Liu, Chenyang
    Yang, Jiajun
    Qi, Zipeng
    Zou, Zhengxia
    Shi, Zhenwei
    [J]. IGARSS 2023 - 2023 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2023, : 6668 - 6671
  • [47] Multi scale-aware attention for pyramid convolution network on finger vein recognition
    Huijie Zhang
    Weizhen Sun
    Ling Lv
    [J]. Scientific Reports, 14
  • [48] SAR: Scale-Aware Restoration Learning for 3D Tumor Segmentation
    Zhang, Xiaoman
    Feng, Shixiang
    Zhou, Yuhang
    Zhang, Ya
    Wang, Yanfeng
    [J]. MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2021, PT II, 2021, 12902 : 124 - 133
  • [49] Object Counting in Remote Sensing via Triple Attention and Scale-Aware Network
    Guo, Xiangyu
    Anisetti, Marco
    Gao, Mingliang
    Jeon, Gwanggil
    [J]. REMOTE SENSING, 2022, 14 (24)
  • [50] Scale-aware dimension-wise attention network for small ship instance segmentation in synthetic aperture radar images
    Ke, Xiao
    Zhang, Tianwen
    Shao, Zikang
    [J]. JOURNAL OF APPLIED REMOTE SENSING, 2023, 17 (04)