Attention Augmented Convolutional Networks

被引:825
|
作者
Bello, Irwan [1 ]
Zoph, Barret [1 ]
Vaswani, Ashish [1 ]
Shlens, Jonathon [1 ]
Le, Quoc V. [1 ]
机构
[1] Google Brain, Mountain View, CA 94043 USA
关键词
ARCHITECTURES;
D O I
10.1109/ICCV.2019.00338
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Convolutional networks have been the paradigm of choice in many computer vision applications. The convolution operation however has a significant weakness in that it only operates on a local neighborhood, thus missing global information. Self-attention, on the other hand, has emerged as a recent advance to capture long range interactions, but has mostly been applied to sequence modeling and generative modeling tasks. In this paper, we consider the use of self-attention for discriminative visual tasks as an alternative to convolutions. We introduce a novel two-dimensional relative self-attention mechanism that proves competitive in replacing convolutions as a stand-alone computational primitive for image classification. We find in control experiments that the best results are obtained when combining both convolutions and self-attention. We therefore propose to augment convolutional operators with this self-attention mechanism by concatenating convolutional feature maps with a set of feature maps produced via self-attention. Extensive experiments show that Attention Augmentation leads to consistent improvements in image classification on ImageNet and object detection on COCO across many different models and scales, including ResNets and a stateof-the art mobile constrained network, while keeping the number of parameters similar. In particular, our method achieves a 1.3% top-1 accuracy improvement on ImageNet classification over a ResNet50 baseline and outperforms other attention mechanisms for images such as Squeeze-and-Excitation [17]. It also achieves an improvement of 1.4 mAP in COCO Object Detection on top of a RetinaNet baseline.
引用
收藏
页码:3285 / 3294
页数:10
相关论文
共 50 条
  • [31] Attention based convolutional networks for traffic flow prediction
    Lin, Juncong
    Lin, Chengqiao
    Ye, Qi
    MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (03) : 7379 - 7394
  • [32] Quantifying Student Attention using Convolutional Neural Networks
    Coaja, Andreea
    Rusu, Catalin, V
    ICAART: PROCEEDINGS OF THE 14TH INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE - VOL 3, 2022, : 293 - 299
  • [33] Attention Based Graph Convolutional Networks for Trajectory Prediction
    Chen, Jianxiao
    Chen, Guang
    Li, Zhijun
    Wu, Ya
    Knoll, Alois
    2021 6TH IEEE INTERNATIONAL CONFERENCE ON ADVANCED ROBOTICS AND MECHATRONICS (ICARM 2021), 2021, : 852 - 857
  • [34] GAttANet: Global Attention Agreement for Convolutional Neural Networks
    VanRullen, Rufin
    Alamia, Andrea
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2021, PT I, 2021, 12891 : 281 - 293
  • [35] Neural Architecture Search for Convolutional Neural Networks with Attention
    Nakai, Kohei
    Matsubara, Takashi
    Uehara, Kuniaki
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2021, E104D (02) : 312 - 321
  • [36] Attention based convolutional networks for traffic flow prediction
    Juncong Lin
    Chengqiao Lin
    Qi Ye
    Multimedia Tools and Applications, 2024, 83 : 7379 - 7394
  • [37] Spatial Channel Attention for Deep Convolutional Neural Networks
    Liu, Tonglai
    Luo, Ronghai
    Xu, Longqin
    Feng, Dachun
    Cao, Liang
    Liu, Shuangyin
    Guo, Jianjun
    MATHEMATICS, 2022, 10 (10)
  • [38] Circular Convolutional Neural Networks Based on Triplet Attention
    Wang J.
    Lei J.
    Zhang J.
    Sun S.
    Moshi Shibie yu Rengong Zhineng/Pattern Recognition and Artificial Intelligence, 2022, 35 (02): : 116 - 129
  • [39] Spatial Pyramid Attention for Deep Convolutional Neural Networks
    Ma, Xu
    Guo, Jingda
    Sansom, Andrew
    McGuire, Mara
    Kalaani, Andrew
    Chen, Qi
    Tang, Sihai
    Yang, Qing
    Fu, Song
    IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 23 : 3048 - 3058
  • [40] Attention Augmented Convolutional Neural Network for acoustics based machine state estimation
    Tan, Jiannan
    Oyekan, John
    APPLIED SOFT COMPUTING, 2021, 110