Attention Branch Network: Learning of Attention Mechanism for Visual Explanation

被引:319
|
作者
Fukui, Hiroshi [1 ]
Hirakawa, Tsubasa [1 ]
Yamashita, Takayoshi [1 ]
Fujiyoshi, Hironobu [1 ]
机构
[1] Chubu Univ, 1200 Matsumotocho, Kasugai, Aichi, Japan
关键词
D O I
10.1109/CVPR.2019.01096
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Visual explanation enables humans to understand the decision making of deep convolutional neural network (CNN), but it is insufficient to contribute to improving CNN performance. In this paper, we focus on the attention map for visual explanation, which represents a high response value as the attention location in image recognition. This attention region significantly improves the performance of CNN by introducing an attention mechanism that focuses on a specific region in an image. In this work, we propose Attention Branch Network (ABN), which extends a response-based visual explanation model by introducing a branch structure with an attention mechanism. ABN can be applicable to several image recognition tasks by introducing a branch for the attention mechanism and is trainable for visual explanation and image recognition in an end-to-end manner. We evaluate ABN on several image recognition tasks such as image classification, fine-grained recognition, and multiple facial attribute recognition. Experimental results indicate that ABN outperforms the baseline models on these image recognition tasks while generating an attention map for visual explanation. Our code is available(1).
引用
收藏
页码:10697 / 10706
页数:10
相关论文
共 50 条
  • [1] Visual Explanation by Attention Branch Network for End-to-end Learning-based Self-driving
    Mori, Keisuke
    Fukui, Hiroshi
    Murase, Takuya
    Hirakawa, Tsubasa
    Yamashita, Takayoshi
    Fujiyoshi, Hironobu
    2019 30TH IEEE INTELLIGENT VEHICLES SYMPOSIUM (IV19), 2019, : 1577 - 1582
  • [2] MANet: Mixed Attention Network for Visual Explanation
    Bai, Jingjing
    Kawahara, Yoshinobu
    NEW GENERATION COMPUTING, 2024, 42 (04) : 825 - 843
  • [3] Visual Explanation using Attention Mechanism in Actor-Critic-based Deep Reinforcement Learning
    Itaya, Hidenori
    Hirakawa, Tsubasa
    Yamashita, Takayoshi
    Fujiyoshi, Hironobu
    Sugiura, Komei
    2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [4] Visual attention network
    Guo, Meng-Hao
    Lu, Cheng-Ze
    Liu, Zheng-Ning
    Cheng, Ming-Ming
    Hu, Shi-Min
    COMPUTATIONAL VISUAL MEDIA, 2023, 9 (04) : 733 - 752
  • [5] Visual attention network
    Meng-Hao Guo
    Cheng-Ze Lu
    Zheng-Ning Liu
    Ming-Ming Cheng
    Shi-Min Hu
    Computational Visual Media, 2023, 9 : 733 - 752
  • [6] Residual Network for Deep Reinforcement Learning with Attention Mechanism
    Zhu, Hanhua
    Kaneko, Tomoyuki
    JOURNAL OF INFORMATION SCIENCE AND ENGINEERING, 2021, 37 (03) : 517 - 533
  • [7] Evota: an enhanced visual object tracking network with attention mechanism
    An Zhao
    Yi Zhang
    Multimedia Tools and Applications, 2024, 83 : 24939 - 24960
  • [8] A novel visual classification framework on panoramic attention mechanism network
    Li, Wenshu
    Li, Shenhao
    Yin, Lingzhi
    Guo, Xiaoying
    Yang, Xu
    IET COMPUTER VISION, 2022, 16 (06) : 479 - 488
  • [9] Evota: an enhanced visual object tracking network with attention mechanism
    Zhao, An
    Zhang, Yi
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 83 (8) : 24939 - 24960
  • [10] Visual attention and the mechanism of metacontrast
    Neumann, Odmar
    Scharlau, Ingrid
    PSYCHOLOGICAL RESEARCH-PSYCHOLOGISCHE FORSCHUNG, 2007, 71 (06): : 626 - 633