Learn from each other to Classify better: Cross-layer mutual attention learning for fine-grained visual classification

被引:19
|
作者
Liu, Dichao [1 ,3 ]
Zhao, Longjiao [1 ]
Wang, Yu [2 ]
Kato, Jien [2 ]
机构
[1] Nagoya Univ, Grad Sch Informat, Furo Cho,Chikusa Ku, Nagoya, Aichi 4648601, Japan
[2] Ritsumeikan Univ, Coll Informat Sci & Engn, 1 Nojihigashi, Kusatsu, Shiga 5250058, Japan
[3] Navier Inc, Res Team, 9-2 Nibancho,Chiyoda Ku, Tokyo 1020084, Japan
关键词
Fine-grained recognition; Image classification; Deep features; NEURAL-NETWORK;
D O I
10.1016/j.patcog.2023.109550
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Fine-grained visual classification (FGVC) is valuable yet challenging. The difficulty of FGVC mainly lies in its intrinsic inter-class similarity, intra-class variation, and limited training data. Moreover, with the popularity of deep convolutional neural networks, researchers have mainly used deep, abstract, semantic information for FGVC, while shallow, detailed information has been neglected. This work proposes a cross-layer mutual attention learning network (CMAL-Net) to solve the above problems. Specifically, this work views the shallow to deep layers of CNNs as "experts" knowledgeable about different perspectives. We let each expert give a category prediction and an attention region indicating the found clues. Attention regions are treated as information carriers among experts, bringing three benefits: ( i ) helping the model focus on discriminative regions; ( ii ) providing more training data; ( iii ) allowing experts to learn from each other to improve the overall performance. CMAL-Net achieves state-of-the-art performance on three competitive datasets: FGVC-Aircraft, Stanford Cars, and Food-11. The source code is available at https://github.com/Dichao-Liu/CMAL
引用
收藏
页数:12
相关论文
共 50 条
  • [41] Fine-grained recognition algorithm of crop pests based on cross-layer bilinear aggregation and multi-task learning
    Ruan, Juquan
    Liu, Shuo
    Mao, Wanjing
    Zeng, Shan
    Zhang, Zhuoyi
    Yin, Guangsun
    JOURNAL OF AGRICULTURAL ENGINEERING, 2024, 55 (03)
  • [42] Feature alignment via mutual mapping for few-shot fine-grained visual classification
    Wu, Qin
    Song, Tingting
    Fan, Shengnan
    Chen, Zeda
    Jin, Kelei
    Zhou, Haojie
    IMAGE AND VISION COMPUTING, 2024, 147
  • [43] Feature Combination with Multi-Kernel Learning for Fine-Grained Visual Classification
    Angelova, Anelia
    Niculescu-Mizil, Alexandru
    2014 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2014, : 241 - 246
  • [44] Consistency-aware Feature Learning for Hierarchical Fine-grained Visual Classification
    Wang, Rui
    Zou, Cong
    Zhang, Weizhong
    Zhu, Zixuan
    Jing, Lihua
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 2326 - 2334
  • [45] ATTENTION-BASED MULTI-TASK LEARNING FOR FINE-GRAINED IMAGE CLASSIFICATION
    Liu, Dichao
    Wang, Yu
    Mase, Kenji
    Kato, Jien
    2021 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2021, : 1499 - 1503
  • [46] Multi-Depth Learning with Multi-Attention for fine-grained image classification
    Dai, Zuhua
    Li, Hongyi
    Li, Kelong
    Zhou, Anwei
    2020 INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING AND HUMAN-COMPUTER INTERACTION (ICHCI 2020), 2020, : 206 - 212
  • [47] Learning Attention-Aware Interactive Features for Fine-Grained Vegetable and Fruit Classification
    Wang, Yimin
    Xiao, Zhifeng
    Meng, Lingguo
    APPLIED SCIENCES-BASEL, 2021, 11 (14):
  • [48] To Know and To Learn About the Integration of Knowledge Representation and Deep Learning for Fine-Grained Visual Categorization
    Setti, Francesco
    PROCEEDINGS OF THE 13TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS (VISIGRAPP 2018), VOL 5: VISAPP, 2018, : 387 - 392
  • [49] Research on Fine-Grained Visual Classification Method Based on Dual-Attention Feature Complementation
    Huang, Min
    Li, Ke
    Yu, Xiaoyan
    Yang, Chen
    IEEE ACCESS, 2024, 12 : 192209 - 192218
  • [50] Fine-grained Cross-media Representation Learning with Deep Quantization Attention Network
    Liang, Meiyu
    Du, Junping
    Liu, Wu
    Xue, Zhe
    Geng, Yue
    Yang, Congxian
    PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA (MM'19), 2019, : 1313 - 1321