Look Closer to See Better: Recurrent Attention Convolutional Neural Network for Fine-grained Image Recognition

被引:886
|
作者
Fu, Jianlong [1 ]
Zheng, Heliang [2 ]
Mei, Tao [1 ]
机构
[1] Microsoft Res, Beijing, Peoples R China
[2] Univ Sci & Technol China, Hefei, Anhui, Peoples R China
关键词
D O I
10.1109/CVPR.2017.476
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recognizing fine-grained categories (e.g., bird species) is difficult due to the challenges of discriminative region localization and fine-grained feature learning. Existing approaches predominantly solve these challenges independently, while neglecting the fact that region detection and fine-grained feature learning are mutually correlated and thus can reinforce each other. In this paper, we propose a novel recurrent attention convolutional neural network (RA-CNN) which recursively learns discriminative region attention and region-based feature representation at multiple scales in a mutually reinforced way. The learning at each scale consists of a classification sub-network and an attention proposal sub-network (APN). The APN starts from full images, and iteratively generates region attention from coarse to fine by taking previous predictions as a reference, while a finer scale network takes as input an amplified attended region from previous scales in a recurrent way. The proposed RA-CNN is optimized by an intra-scale classification loss and an inter-scale ranking loss, to mutually learn accurate region attention and fine-grained representation. RA-CNN does not need bounding box/part annotations and can be trained end-to-end. We conduct comprehensive experiments and show that RA-CNN achieves the best performance in three fine-grained tasks, with relative accuracy gains of 3.3%, 3.7%, 3.8%, on CUB Birds, Stanford Dogs and Stanford Cars, respectively.
引用
收藏
页码:4476 / 4484
页数:9
相关论文
共 50 条
  • [1] Multiple Recurrent Attention Convolutional Neural Network For fine-grained image recognition
    Zhu, Xiaotong
    Bian, Hengwei
    [J]. 2022 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, COMPUTER VISION AND MACHINE LEARNING (ICICML), 2022, : 44 - 48
  • [2] Adaptive Multi-Attention Convolutional Neural Network for Fine-Grained Image Recognition
    Li, Ang
    Chen, Jianxin
    Kang, Bin
    Zhuang, Wenqin
    Zhang, Xuguang
    [J]. 2019 IEEE GLOBECOM WORKSHOPS (GC WKSHPS), 2019,
  • [3] Learning Multi-Attention Convolutional Neural Network for Fine-Grained Image Recognition
    Zheng, Heliang
    Fu, Jianlong
    Mei, Tao
    Luo, Jiebo
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 5219 - 5227
  • [4] A Multi-part Convolutional Attention Network for Fine-Grained Image Recognition
    Zhong, Weilin
    Jiang, Linfeng
    Zhang, Tao
    Ji, Jinsheng
    Xiong, Huilin
    [J]. 2018 24TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2018, : 1857 - 1862
  • [5] Multi-branch Recurrent Attention Convolutional Neural Network with Evidence Theory for Fine-Grained Image Classification
    Xu, Zhikang
    Zhang, Bofeng
    Fu, Haijie
    Yue, Xiaodong
    Lv, Ying
    [J]. BELIEF FUNCTIONS: THEORY AND APPLICATIONS (BELIEF 2021), 2021, 12915 : 177 - 184
  • [6] Two-Level Progressive Attention Convolutional Network for Fine-Grained Image Recognition
    Wei, Hua
    Zhu, Ming
    Wang, Bo
    Wang, Jiarong
    Sun, Deyao
    [J]. IEEE ACCESS, 2020, 8 : 104985 - 104995
  • [7] Fine-grained Vehicle Recognition by Deep Convolutional Neural Network
    Huang, Kun
    Zhang, Bailing
    [J]. 2016 9TH INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING, BIOMEDICAL ENGINEERING AND INFORMATICS (CISP-BMEI 2016), 2016, : 465 - 470
  • [8] Fine-grained Image Recognition via Attention Interaction and Counterfactual Attention Network
    Huang, Lei
    An, Chen
    Wang, Xiaodong
    Bullock, Leon Bevan
    Wei, Zhiqiang
    [J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 125
  • [9] Focus Longer to See Better: Recursively Refined Attention for Fine-Grained Image Classification
    Shroff, Prateek
    Chen, Tianlong
    Wei, Yunchao
    Wang, Zhangyang
    [J]. 2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2020), 2020, : 3782 - 3789
  • [10] DACBN: Dual attention convolutional broad network for fine-grained visual recognition
    Chen, Tao
    Wang, Lijie
    Liu, Yang
    Yu, Haisheng
    [J]. PATTERN RECOGNITION, 2024, 156