Learn from each other to Classify better: Cross-layer mutual attention learning for fine-grained visual classification

被引:19
|
作者
Liu, Dichao [1 ,3 ]
Zhao, Longjiao [1 ]
Wang, Yu [2 ]
Kato, Jien [2 ]
机构
[1] Nagoya Univ, Grad Sch Informat, Furo Cho,Chikusa Ku, Nagoya, Aichi 4648601, Japan
[2] Ritsumeikan Univ, Coll Informat Sci & Engn, 1 Nojihigashi, Kusatsu, Shiga 5250058, Japan
[3] Navier Inc, Res Team, 9-2 Nibancho,Chiyoda Ku, Tokyo 1020084, Japan
关键词
Fine-grained recognition; Image classification; Deep features; NEURAL-NETWORK;
D O I
10.1016/j.patcog.2023.109550
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Fine-grained visual classification (FGVC) is valuable yet challenging. The difficulty of FGVC mainly lies in its intrinsic inter-class similarity, intra-class variation, and limited training data. Moreover, with the popularity of deep convolutional neural networks, researchers have mainly used deep, abstract, semantic information for FGVC, while shallow, detailed information has been neglected. This work proposes a cross-layer mutual attention learning network (CMAL-Net) to solve the above problems. Specifically, this work views the shallow to deep layers of CNNs as "experts" knowledgeable about different perspectives. We let each expert give a category prediction and an attention region indicating the found clues. Attention regions are treated as information carriers among experts, bringing three benefits: ( i ) helping the model focus on discriminative regions; ( ii ) providing more training data; ( iii ) allowing experts to learn from each other to improve the overall performance. CMAL-Net achieves state-of-the-art performance on three competitive datasets: FGVC-Aircraft, Stanford Cars, and Food-11. The source code is available at https://github.com/Dichao-Liu/CMAL
引用
收藏
页数:12
相关论文
共 50 条
  • [21] Cross-X Learning for Fine-Grained Visual Categorization
    Luo, Wei
    Yang, Xitong
    Mo, Xianjie
    Lu, Yuheng
    Davis, Larry S.
    Li, Jun
    Yang, Jian
    Lim, Ser-Nam
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 8241 - 8250
  • [22] Cross-Part Learning for Fine-Grained Image Classification
    Liu, Man
    Zhang, Chunjie
    Bai, Huihui
    Zhang, Riquan
    Zhao, Yao
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 748 - 758
  • [23] Hierarchical Feature Attention Learning Network for Detecting Object and Discriminative Parts in Fine-Grained Visual Classification
    Han, A. Yeong
    Yi, Kwang Moo
    Kim, Kyeong Tae
    Choi, Jae Young
    IEEE ACCESS, 2025, 13 : 19533 - 19544
  • [24] Where to Focus: Investigating Hierarchical Attention Relationship for Fine-Grained Visual Classification
    School of Computer Science and Engineering, State Key Laboratory of Software Development Environment, Jiangxi Research Institute, Beihang University, Beijing, China
    不详
    不详
    不详
    不详
    Lect. Notes Comput. Sci., (57-73): : 57 - 73
  • [25] Multi-Granularity Part Sampling Attention for Fine-Grained Visual Classification
    Wang, Jiahui
    Xu, Qin
    Jiang, Bo
    Luo, Bin
    Tang, Jinhui
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 33 : 4529 - 4542
  • [26] Where to Focus: Investigating Hierarchical Attention Relationship for Fine-Grained Visual Classification
    Liu, Yang
    Zhou, Lei
    Zhang, Pengcheng
    Bai, Xiao
    Gu, Lin
    Yu, Xiaohan
    Zhou, Jun
    Hancock, Edwin R.
    COMPUTER VISION, ECCV 2022, PT XXIV, 2022, 13684 : 57 - 73
  • [27] Fine-Grained Visual Classification Network Based on Fusion Pooling and Attention Enhancement
    Xiao B.
    Guo J.
    Zhang X.
    Wang M.
    Moshi Shibie yu Rengong Zhineng/Pattern Recognition and Artificial Intelligence, 2023, 36 (07): : 661 - 670
  • [28] Divide&Classify: Fine-Grained Classification for City-Wide Visual Place Recognition
    Trivigno, Gabriele
    Berton, Gabriele
    Aragon, Juan
    Caputo, Barbara
    Masone, Carlo
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 11108 - 11118
  • [29] Fine-Grained Visual Classification via Internal Ensemble Learning Transformer
    Xu, Qin
    Wang, Jiahui
    Jiang, Bo
    Luo, Bin
    IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 9015 - 9028
  • [30] Dual Cross-Attention Learning for Fine-Grained Visual Categorization and Object Re-Identification
    Zhu, Haowei
    Ke, Wenjing
    Li, Dong
    Liu, Ji
    Tian, Lu
    Shan, Yi
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 4682 - 4692