Learning Scale-Consistent Attention Part Network for Fine-Grained Image Recognition

被引:18
|
作者
Liu, Huabin [1 ]
Li, Jianguo [2 ]
Li, Dian [3 ]
See, John [4 ]
Lin, Weiyao [1 ]
机构
[1] Shanghai Jiao Tong Univ, Dept Elect Engn, Shanghai 200240, Peoples R China
[2] Ant Financial Serv Grp, Beijing 101100, Peoples R China
[3] Tencent Technol Beijing Co Ltd, Beijing 100080, Peoples R China
[4] Heriot Watt Univ, Sch Math & Comp Sci, Putrajaya 62200, Malaysia
基金
中国国家自然科学基金;
关键词
Image recognition; Task analysis; Logic gates; Location awareness; Visualization; Training; Object detection; Fine-grained image recognition; scale-consistent; attention part;
D O I
10.1109/TMM.2021.3090274
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Discriminative region localization and feature learning are crucial for fine-grained visual recognition. Existing approaches solve this issue by attention mechanism or part based methods while neglecting consistency between attention and local parts, as well as the rich relation information among parts. This paper proposes a Scale-consistent Attention Part Network (SCAPNet) to address that issue, which seamlessly integrates three novel modules: grid gate attention unit (gGAU), scale-consistent attention part selection (SCAPS), and part relation modeling (PRM). The gGAU module represents the grid region at a certain fine-scale with middle layer CNN features and produces hard attention maps with the lightweight Gumbel-Max based gate. The SCAPS module utilizes attention to guide part selection across multi-scales and keep the selection scale-consistent. The PRM module utilizes the self-attention mechanism to build the relationship among parts based on their appearance and relative geo-positions. SCAPNet can be learned in an end-to-end way and demonstrates state-of-the-art accuracy on several publicly available fine-grained recognition datasets (CUB-200-2011, FGVC-Aircraft, Veg200, and Fru92).
引用
收藏
页码:2902 / 2913
页数:12
相关论文
共 50 条
  • [1] A Multi-part Convolutional Attention Network for Fine-Grained Image Recognition
    Zhong, Weilin
    Jiang, Linfeng
    Zhang, Tao
    Ji, Jinsheng
    Xiong, Huilin
    [J]. 2018 24TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2018, : 1857 - 1862
  • [2] Learning Rich Part Hierarchies With Progressive Attention Networks for Fine-Grained Image Recognition
    Zheng, Heliang
    Fu, Jianlong
    Zha, Zheng-Jun
    Luo, Jiebo
    Mei, Tao
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 : 476 - 488
  • [3] Attention cutting and padding learning for fine-grained image recognition
    Zhuo Cheng
    Hongjian Li
    Xiaolin Duan
    Xiangyan Zeng
    Mingxuan He
    Hao Luo
    [J]. Multimedia Tools and Applications, 2021, 80 : 32791 - 32805
  • [4] Attention cutting and padding learning for fine-grained image recognition
    Cheng, Zhuo
    Li, Hongjian
    Duan, Xiaolin
    Zeng, Xiangyan
    He, Mingxuan
    Luo, Hao
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (21-23) : 32791 - 32805
  • [5] Learning Multi-Attention Convolutional Neural Network for Fine-Grained Image Recognition
    Zheng, Heliang
    Fu, Jianlong
    Mei, Tao
    Luo, Jiebo
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 5219 - 5227
  • [6] Fine-grained Image Recognition via Attention Interaction and Counterfactual Attention Network
    Huang, Lei
    An, Chen
    Wang, Xiaodong
    Bullock, Leon Bevan
    Wei, Zhiqiang
    [J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 125
  • [7] Looking for the Devil in the Details: Learning Trilinear Attention Sampling Network for Fine-grained Image Recognition
    Zheng, Heliang
    Fu, Jianlong
    Zha, Zheng-Jun
    Luo, Jiebo
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 5007 - 5016
  • [8] Fine-Grained Image Recognition via Multi-Part Learning
    Jiang, Hailang
    Liu, Jianming
    [J]. Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2023, 35 (07): : 1032 - 1039
  • [9] Learning to locate for fine-grained image recognition
    Chen, Jiamin
    Hu, Jianguo
    Li, Shiren
    [J]. COMPUTER VISION AND IMAGE UNDERSTANDING, 2021, 206
  • [10] Incremental Learning for Fine-Grained Image Recognition
    Cao, Liangliang
    Hsiao, Jenhao
    de Juan, Paloma
    Li, Yuncheng
    Thomee, Bart
    [J]. ICMR'16: PROCEEDINGS OF THE 2016 ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, 2016, : 363 - 366