Attention-Guided Spatial Transformer Networks for Fine-Grained Visual Recognition

被引:2
|
作者
Liu, Dichao [1 ]
Wang, Yu [2 ]
Kato, Jien [2 ]
机构
[1] Nagoya Univ, Grad Sch Informat, Nagoya, Aichi 4648601, Japan
[2] Ritsumeikan Univ, Coll Informat Sci & Engn, Kusatsu 5258577, Japan
关键词
recognition; attention; fine-grained; deep learning;
D O I
10.1587/transinf.2019EDP7045
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The aim of this paper is to propose effective attentional regions for fine-grained visual recognition. Based on the Spatial Transformers' capability of spatial manipulation within networks, we propose an extension model, the Attention-Guided Spatial Transformer Networks (AG-STNs). This model can guide the Spatial Transformers with hard-coded attentional regions at first. Then such guidance can be turned off, and the network model will adjust the region learning in terms of the location and scale. Such adjustment is conditioned to the classification loss so that it is actually optimized for better recognition results. With this model, we are able to successfully capture detailed attentional information. Also, the AG-STNs are able to capture attentional information in multiple levels, and different levels of attentional information are complementary to each other in our experiments. A fusion of them brings better results.
引用
收藏
页码:2577 / 2586
页数:10
相关论文
共 50 条
  • [1] Supervised Spatial Transformer Networks for Attention Learning in Fine-grained Action Recognition
    Liu, Dichao
    Wang, Yu
    Kato, Jien
    [J]. VISAPP: PROCEEDINGS OF THE 14TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS, VOL 4, 2019, : 311 - 318
  • [2] Attention-Guided CutMix Data Augmentation Network for Fine-Grained Bird Recognition
    Guo, Wenming
    Wang, Yifei
    Han, Fang
    [J]. PROCEEDINGS OF 2021 2ND INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND INFORMATION SYSTEMS (ICAIIS '21), 2021,
  • [3] Learning attention-guided pyramidal features for few-shot fine-grained recognition
    Tang, Hao
    Yuan, Chengcheng
    Li, Zechao
    Tang, Jinhui
    [J]. Pattern Recognition, 2022, 130
  • [4] Learning attention-guided pyramidal features for few-shot fine-grained recognition
    Tang, Hao
    Yuan, Chengcheng
    Li, Zechao
    Tang, Jinhui
    [J]. PATTERN RECOGNITION, 2022, 130
  • [5] A weakly supervised spatial group attention network for fine-grained visual recognition
    Xie, Jiangjian
    Zhong, Yujie
    Zhang, Junguo
    Zhang, Changchun
    Schuller, Bjoern W.
    [J]. APPLIED INTELLIGENCE, 2023, 53 (20) : 23301 - 23315
  • [6] A weakly supervised spatial group attention network for fine-grained visual recognition
    Jiangjian Xie
    Yujie Zhong
    Junguo Zhang
    Changchun Zhang
    Björn W Schuller
    [J]. Applied Intelligence, 2023, 53 : 23301 - 23315
  • [7] Hierarchical attention vision transformer for fine-grained visual classification
    Hu, Xiaobin
    Zhu, Shining
    Peng, Taile
    [J]. JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2023, 91
  • [8] Attention-guided Fine-grained Feature Learning For Robust Face Forgery Detection
    Zhu, Yizhe
    Gao, Jialin
    Liu, Qiong
    Zhou, Xi
    [J]. 2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2022, : 1222 - 1228
  • [9] Deep Attention-Based Spatially Recursive Networks for Fine-Grained Visual Recognition
    Wu, Lin
    Wang, Yang
    Li, Xue
    Gao, Junbin
    [J]. IEEE TRANSACTIONS ON CYBERNETICS, 2019, 49 (05) : 1791 - 1802
  • [10] Dual-Dependency Attention Transformer for Fine-Grained Visual Classification
    Cui, Shiyan
    Hui, Bin
    [J]. SENSORS, 2024, 24 (07)