Attention Augmented Convolutional Networks

被引:825
|
作者
Bello, Irwan [1 ]
Zoph, Barret [1 ]
Vaswani, Ashish [1 ]
Shlens, Jonathon [1 ]
Le, Quoc V. [1 ]
机构
[1] Google Brain, Mountain View, CA 94043 USA
关键词
ARCHITECTURES;
D O I
10.1109/ICCV.2019.00338
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Convolutional networks have been the paradigm of choice in many computer vision applications. The convolution operation however has a significant weakness in that it only operates on a local neighborhood, thus missing global information. Self-attention, on the other hand, has emerged as a recent advance to capture long range interactions, but has mostly been applied to sequence modeling and generative modeling tasks. In this paper, we consider the use of self-attention for discriminative visual tasks as an alternative to convolutions. We introduce a novel two-dimensional relative self-attention mechanism that proves competitive in replacing convolutions as a stand-alone computational primitive for image classification. We find in control experiments that the best results are obtained when combining both convolutions and self-attention. We therefore propose to augment convolutional operators with this self-attention mechanism by concatenating convolutional feature maps with a set of feature maps produced via self-attention. Extensive experiments show that Attention Augmentation leads to consistent improvements in image classification on ImageNet and object detection on COCO across many different models and scales, including ResNets and a stateof-the art mobile constrained network, while keeping the number of parameters similar. In particular, our method achieves a 1.3% top-1 accuracy improvement on ImageNet classification over a ResNet50 baseline and outperforms other attention mechanisms for images such as Squeeze-and-Excitation [17]. It also achieves an improvement of 1.4 mAP in COCO Object Detection on top of a RetinaNet baseline.
引用
收藏
页码:3285 / 3294
页数:10
相关论文
共 50 条
  • [21] Remaining Useful Life Estimation of Aircraft Engines Using Siamese Attention-Augmented Quantum Convolutional Neural Networks
    Ali, Al-Moayed Zaid Abdulrazaq Ali
    Abdulaziz, Al-Qubati Mohammed Ahmed
    Ahmed, Al-Jonaid Amjad Mohammed
    Wang, Cheng Long
    2024 5TH INTERNATIONAL CONFERENCE ON COMPUTER ENGINEERING AND APPLICATION, ICCEA 2024, 2024, : 1366 - 1371
  • [22] Probabilistic Attention Map: A Probabilistic Attention Mechanism for Convolutional Neural Networks
    Liu, Yifeng
    Tian, Jing
    SENSORS, 2024, 24 (24)
  • [23] Augmented Equivariant Attention Networks for Microscopy Image Transformation
    Xie, Yaochen
    Ding, Yu
    Ji, Shuiwang
    IEEE TRANSACTIONS ON MEDICAL IMAGING, 2022, 41 (11) : 3194 - 3206
  • [24] Cross Attention Augmented Transducer Networks for Simultaneous Translation
    Liu, Dan
    Du, Mengge
    Li, Xiaoxi
    Li, Ya
    Chen, Enhong
    2021 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2021), 2021, : 39 - 55
  • [25] Augmented Convolutional Neural Networks with Transformer for Wireless Interference Identification
    Wang, Pengyu
    Cheng, Yufan
    Dong, Binhong
    2021 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2021,
  • [26] Integrated Convolutional and Graph Attention Neural Networks for Electroencephalography
    Kang, Jae-eon
    Lee, Changha
    Lee, Jong-Hwan
    2024 12TH INTERNATIONAL WINTER CONFERENCE ON BRAIN-COMPUTER INTERFACE, BCI 2024, 2024,
  • [27] Attention Guided Graph Convolutional Networks for Relation Extraction
    Guo, Zhijiang
    Zhang, Yan
    Lu, Wei
    57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), 2019, : 241 - 251
  • [28] Evaluating Attention in Convolutional Neural Networks for Blended Images
    Portscher, Andrea
    Stabinger, Sebastian
    Rodriguez-Sanchez, Antonio
    2022 IEEE 5TH INTERNATIONAL CONFERENCE ON IMAGE PROCESSING APPLICATIONS AND SYSTEMS, IPAS, 2022,
  • [29] Graph Convolutional Networks with Motif-based Attention
    Lee, John Boaz
    Rossi, Ryan A.
    Kong, Xiangnan
    Kim, Sungchul
    Koh, Eunyee
    Rao, Anup
    PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT (CIKM '19), 2019, : 499 - 508
  • [30] Spatial Decomposition and Aggregation for Attention in Convolutional Neural Networks
    Zhu, Meng
    Min, Weidong
    Xiang, Hongyue
    Zha, Cheng
    Huang, Zheng
    Li, Longfei
    Fu, Qiyan
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2024, 38 (01)