Attention Augmented Convolutional Networks

被引:825
|
作者
Bello, Irwan [1 ]
Zoph, Barret [1 ]
Vaswani, Ashish [1 ]
Shlens, Jonathon [1 ]
Le, Quoc V. [1 ]
机构
[1] Google Brain, Mountain View, CA 94043 USA
关键词
ARCHITECTURES;
D O I
10.1109/ICCV.2019.00338
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Convolutional networks have been the paradigm of choice in many computer vision applications. The convolution operation however has a significant weakness in that it only operates on a local neighborhood, thus missing global information. Self-attention, on the other hand, has emerged as a recent advance to capture long range interactions, but has mostly been applied to sequence modeling and generative modeling tasks. In this paper, we consider the use of self-attention for discriminative visual tasks as an alternative to convolutions. We introduce a novel two-dimensional relative self-attention mechanism that proves competitive in replacing convolutions as a stand-alone computational primitive for image classification. We find in control experiments that the best results are obtained when combining both convolutions and self-attention. We therefore propose to augment convolutional operators with this self-attention mechanism by concatenating convolutional feature maps with a set of feature maps produced via self-attention. Extensive experiments show that Attention Augmentation leads to consistent improvements in image classification on ImageNet and object detection on COCO across many different models and scales, including ResNets and a stateof-the art mobile constrained network, while keeping the number of parameters similar. In particular, our method achieves a 1.3% top-1 accuracy improvement on ImageNet classification over a ResNet50 baseline and outperforms other attention mechanisms for images such as Squeeze-and-Excitation [17]. It also achieves an improvement of 1.4 mAP in COCO Object Detection on top of a RetinaNet baseline.
引用
收藏
页码:3285 / 3294
页数:10
相关论文
共 50 条
  • [1] Dark web author alignment based on attention augmented convolutional networks
    Yang Y.
    Du Y.
    Liu H.
    Zhao J.
    Shi J.
    Wang X.
    Xi'an Dianzi Keji Daxue Xuebao/Journal of Xidian University, 2023, 50 (04): : 206 - 214
  • [2] A Trimodel SAR Semisupervised Recognition Method Based on Attention-Augmented Convolutional Networks
    Yan, Sifan
    Zhang, Yaotian
    Gao, Fei
    Sun, Jinping
    Hussain, Amir
    Zhou, Huiyu
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2022, 15 : 9566 - 9583
  • [3] Building siamese attention-augmented recurrent convolutional neural networks for document similarity scoring
    Han, Sifei
    Shi, Lingyun
    Richie, Russell
    Tsui, Fuchiang R. Rich
    INFORMATION SCIENCES, 2022, 615 : 90 - 102
  • [4] Enhancing Plant Disease Detection Using Attention-Augmented Residual Networks and Faster Region-Convolutional Networks
    Sathya, K.
    Balakrishnan, Arunkumar
    Baskaran, P.
    Ramamoorthy, Arun Kumar
    IEEE ACCESS, 2025, 13 : 48625 - 48642
  • [5] Convolutional Self-Attention Networks
    Yang, Baosong
    Wang, Longyue
    Wong, Derek F.
    Chao, Lidia S.
    Tu, Zhaopeng
    2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, 2019, : 4040 - 4045
  • [6] An Attention Module for Convolutional Neural Networks
    Zhu, Baozhou
    Hofstee, Peter
    Lee, Jinho
    Al-Ars, Zaid
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2021, PT I, 2021, 12891 : 167 - 178
  • [7] Reparameterized attention for convolutional neural networks
    Wu, Yiming
    Li, Ruixiang
    Yu, Yunlong
    Li, Xi
    PATTERN RECOGNITION LETTERS, 2022, 164 : 89 - 95
  • [8] Attention Augmented Convolutional Transformer for Tabular Time-series
    Shankaranarayana, Sharath M.
    Runje, Davor
    21ST IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS ICDMW 2021, 2021, : 537 - 541
  • [9] Epileptic Seizure Prediction Using Attention Augmented Convolutional Network
    Liu, Dongsheng
    Dong, Xingchen
    Bian, Dong
    Zhou, Weidong
    INTERNATIONAL JOURNAL OF NEURAL SYSTEMS, 2023, 33 (11)
  • [10] Convolutional Attention Networks for Scene Text Recognition
    Xie, Hongtao
    Fang, Shancheng
    Zha, Zheng-Jun
    Yang, Yating
    Li, Yan
    Zhang, Yongdong
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2019, 15 (01)