Look Closer to See Better: Recurrent Attention Convolutional Neural Network for Fine-grained Image Recognition

被引:886
|
作者
Fu, Jianlong [1 ]
Zheng, Heliang [2 ]
Mei, Tao [1 ]
机构
[1] Microsoft Res, Beijing, Peoples R China
[2] Univ Sci & Technol China, Hefei, Anhui, Peoples R China
关键词
D O I
10.1109/CVPR.2017.476
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recognizing fine-grained categories (e.g., bird species) is difficult due to the challenges of discriminative region localization and fine-grained feature learning. Existing approaches predominantly solve these challenges independently, while neglecting the fact that region detection and fine-grained feature learning are mutually correlated and thus can reinforce each other. In this paper, we propose a novel recurrent attention convolutional neural network (RA-CNN) which recursively learns discriminative region attention and region-based feature representation at multiple scales in a mutually reinforced way. The learning at each scale consists of a classification sub-network and an attention proposal sub-network (APN). The APN starts from full images, and iteratively generates region attention from coarse to fine by taking previous predictions as a reference, while a finer scale network takes as input an amplified attended region from previous scales in a recurrent way. The proposed RA-CNN is optimized by an intra-scale classification loss and an inter-scale ranking loss, to mutually learn accurate region attention and fine-grained representation. RA-CNN does not need bounding box/part annotations and can be trained end-to-end. We conduct comprehensive experiments and show that RA-CNN achieves the best performance in three fine-grained tasks, with relative accuracy gains of 3.3%, 3.7%, 3.8%, on CUB Birds, Stanford Dogs and Stanford Cars, respectively.
引用
收藏
页码:4476 / 4484
页数:9
相关论文
共 50 条
  • [21] Summary of Fine-Grained Image Recognition Based on Attention Mechanism
    Yao, Ma
    Min, Zhi
    [J]. THIRTEENTH INTERNATIONAL CONFERENCE ON GRAPHICS AND IMAGE PROCESSING (ICGIP 2021), 2022, 12083
  • [22] Group-Attention Transformer for Fine-Grained Image Recognition
    Yan, Bo
    Wang, Siwei
    Zhu, En
    Liu, Xinwang
    Chen, Wei
    [J]. Communications in Computer and Information Science, 2022, 1587 CCIS : 40 - 54
  • [23] Attention cutting and padding learning for fine-grained image recognition
    Cheng, Zhuo
    Li, Hongjian
    Duan, Xiaolin
    Zeng, Xiangyan
    He, Mingxuan
    Luo, Hao
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (21-23) : 32791 - 32805
  • [24] Two-Level Attentions and Grouping Attention Convolutional Network for Fine-Grained Image Classification
    Yang, Yadong
    Wang, Xiaofeng
    Zhao, Quan
    Sui, Tingting
    [J]. APPLIED SCIENCES-BASEL, 2019, 9 (09):
  • [25] Fine-Grained Sentiment Analysis Based on Convolutional Neural Network
    Li H.
    Chai Y.
    [J]. Data Analysis and Knowledge Discovery, 2019, 3 (01) : 95 - 103
  • [26] CACRNN: A Context-Aware Attention-Based Convolutional Recurrent Neural Network for Fine-Grained Taxi Demand Prediction
    Wu, Wenbin
    Liu, Tong
    Yang, Jiahao
    [J]. ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2020, PT I, 2020, 12084 : 636 - 648
  • [27] Fine-Grained Food Image Recognition: A Study on Optimising Convolutional Neural Networks for Improved Performance
    Boyd, Liam
    Nnamoko, Nonso
    Lopes, Ricardo
    [J]. JOURNAL OF IMAGING, 2024, 10 (06)
  • [28] Pay Attention to the Activations: A Modular Attention Mechanism for Fine-Grained Image Recognition
    Rodriguez, Pau
    Velazquez, Diego
    Cucurull, Guillem
    Gonfaus, Josep M.
    Roca, E. Xavier
    Gonzalez, Jordi
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2020, 22 (02) : 502 - 514
  • [29] IMPROVING DYNAMIC GRAPH CONVOLUTIONAL NETWORK WITH FINE-GRAINED ATTENTION MECHANISM
    Wu, Bo
    Liang, Xun
    Zheng, Xiangping
    Guo, Yuhui
    Tang, Hui
    [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 3938 - 3942
  • [30] Subtler mixed attention network on fine-grained image classification
    Liu, Chao
    Huang, Lei
    Wei, Zhiqiang
    Zhang, Wenfeng
    [J]. APPLIED INTELLIGENCE, 2021, 51 (11) : 7903 - 7916