Attention Augmented Convolutional Networks

被引:825
|
作者
Bello, Irwan [1 ]
Zoph, Barret [1 ]
Vaswani, Ashish [1 ]
Shlens, Jonathon [1 ]
Le, Quoc V. [1 ]
机构
[1] Google Brain, Mountain View, CA 94043 USA
关键词
ARCHITECTURES;
D O I
10.1109/ICCV.2019.00338
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Convolutional networks have been the paradigm of choice in many computer vision applications. The convolution operation however has a significant weakness in that it only operates on a local neighborhood, thus missing global information. Self-attention, on the other hand, has emerged as a recent advance to capture long range interactions, but has mostly been applied to sequence modeling and generative modeling tasks. In this paper, we consider the use of self-attention for discriminative visual tasks as an alternative to convolutions. We introduce a novel two-dimensional relative self-attention mechanism that proves competitive in replacing convolutions as a stand-alone computational primitive for image classification. We find in control experiments that the best results are obtained when combining both convolutions and self-attention. We therefore propose to augment convolutional operators with this self-attention mechanism by concatenating convolutional feature maps with a set of feature maps produced via self-attention. Extensive experiments show that Attention Augmentation leads to consistent improvements in image classification on ImageNet and object detection on COCO across many different models and scales, including ResNets and a stateof-the art mobile constrained network, while keeping the number of parameters similar. In particular, our method achieves a 1.3% top-1 accuracy improvement on ImageNet classification over a ResNet50 baseline and outperforms other attention mechanisms for images such as Squeeze-and-Excitation [17]. It also achieves an improvement of 1.4 mAP in COCO Object Detection on top of a RetinaNet baseline.
引用
收藏
页码:3285 / 3294
页数:10
相关论文
共 50 条
  • [41] An Attention-augmented Fully Convolutional Neural Network for Monaural Speech Enhancement
    Xu, Zezheng
    Jiang, Ting
    Li, Chao
    Yu, Jiacheng
    2021 12TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2021,
  • [42] Prediction of Transcription Factor Binding Sites With an Attention Augmented Convolutional Neural Network
    Jing, Fang
    Zhang, Shao-Wu
    Zhang, Shihua
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2022, 19 (06) : 3614 - 3623
  • [43] Attention Visualization of Gated Convolutional Neural Networks with Self Attention in Sentiment Analysis
    Yanagimto, Hidekazu
    Hashimoto, Kiyota
    Okada, Makoto
    2018 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND DATA ENGINEERING (ICMLDE 2018), 2018, : 77 - 82
  • [44] AAN-Face: Attention Augmented Networks for Face Recognition
    Wang, Qiangchang
    Guo, Guodong
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 7636 - 7648
  • [45] BCA: Bilinear Convolutional Neural Networks and Attention Networks for legal question answering
    Zhang, Haiguang
    Zhang, Tongyue
    Cao, Faxin
    Wang, Zhizheng
    Zhang, Yuanyu
    Sun, Yuanyuan
    Vicente, Mark Anthony
    AI OPEN, 2022, 3 : 172 - 181
  • [46] Sentiment Lexical-Augmented Convolutional Neural Networks for Sentiment Analysis
    Yin, Rongchao
    Li, Peng
    Wang, Bin
    2017 IEEE SECOND INTERNATIONAL CONFERENCE ON DATA SCIENCE IN CYBERSPACE (DSC), 2017, : 630 - 635
  • [47] Retrieval Augmented Convolutional Encoder-decoder Networks for Video Captioning
    Chen, Jingwen
    Pan, Yingwei
    Li, Yehao
    Yao, Ting
    Chao, Hongyang
    Mei, Tao
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2023, 19 (01)
  • [48] Retrieval-Augmented Convolutional Neural Networks against Adversarial Examples
    Zhao , Jake
    Cho, Kyunghyun
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 11555 - 11563
  • [49] Context-augmented convolutional neural networks for twitter sarcasm detection
    Ren, Yafeng
    Ji, Donghong
    Ren, Han
    NEUROCOMPUTING, 2018, 308 : 1 - 7
  • [50] Attention-Augmented Convolutional Autoencoder for Radar-Based Human Activity Recognition
    Campbell, Christopher
    Ahmad, Fauzia
    2020 IEEE INTERNATIONAL RADAR CONFERENCE (RADAR), 2020, : 990 - 995