Fine-grained Image Recognition via Attention Interaction and Counterfactual Attention Network

被引:0
|
作者
Huang, Lei [1 ]
An, Chen [1 ]
Wang, Xiaodong [1 ]
Bullock, Leon Bevan [1 ]
Wei, Zhiqiang [1 ]
机构
[1] Ocean Univ China, Fac Informat Sci & Engn, Qingdao 266100, Peoples R China
基金
中国国家自然科学基金;
关键词
Fine-grained image recognition; Counterfactual attention; Attention interaction; Attention mechanism; MODEL;
D O I
10.1016/j.engappai.2023.106735
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Learning subtle and discriminative regions plays an important role in fine-grained image recognition, and attention mechanisms have shown great potential in such tasks. Recent research mainly focuses on employing the attention mechanism to locate key discriminative regions and learn salient features, whilst ignoring imperceptible complementary features and the causal relationship between prediction results and attention. To address the above issues, we propose an Attention Interaction and Counterfactual Attention Network (AICA-Net). Specifically, we propose an Attention Interaction Fusion Module (AIFM) to model the negative correlation between the attention map channels to locate the complementary features, and fuse the complementary features and key discriminative features to generate richer fine-grained features. Simultaneously, an Enhanced Counterfactual Attention Module (ECAM) is proposed to generate a counterfactual attention map. By comparing the impact of the learned attention map and the counterfactual attention map on the final prediction results, quantifying the quality of attention drives the network to learn more effective attention. Extensive experiments on CUB-200-2011, FGVC-Aircraft and Stanford Cars datasets have shown that our AICA-Net can get outstanding results. In particular, it achieves 90.83% and 95.87% accuracy on two open competitive benchmark datasets CUB-200-2011 and Stanford Cars, respectively. Experiments demonstrate that our method outperforms state-of-the-art solutions.
引用
收藏
页数:10
相关论文
共 50 条
  • [1] Pay Attention to the Activations: A Modular Attention Mechanism for Fine-Grained Image Recognition
    Rodriguez, Pau
    Velazquez, Diego
    Cucurull, Guillem
    Gonfaus, Josep M.
    Roca, E. Xavier
    Gonzalez, Jordi
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2020, 22 (02) : 502 - 514
  • [2] Multiple Recurrent Attention Convolutional Neural Network For fine-grained image recognition
    Zhu, Xiaotong
    Bian, Hengwei
    [J]. 2022 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, COMPUTER VISION AND MACHINE LEARNING (ICICML), 2022, : 44 - 48
  • [3] A Multi-part Convolutional Attention Network for Fine-Grained Image Recognition
    Zhong, Weilin
    Jiang, Linfeng
    Zhang, Tao
    Ji, Jinsheng
    Xiong, Huilin
    [J]. 2018 24TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2018, : 1857 - 1862
  • [4] Channel Attention Multi-Branch Network for Fine-Grained Image Recognition
    Wang Binzhou
    Xiao Zhiyong
    [J]. LASER & OPTOELECTRONICS PROGRESS, 2021, 58 (22)
  • [5] Hierarchical Attention Network for Open-Set Fine-Grained Image Recognition
    Sun, Jiayin
    Wang, Hong
    Dong, Qiulei
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (05) : 3891 - 3904
  • [6] Attention cutting and padding learning for fine-grained image recognition
    Zhuo Cheng
    Hongjian Li
    Xiaolin Duan
    Xiangyan Zeng
    Mingxuan He
    Hao Luo
    [J]. Multimedia Tools and Applications, 2021, 80 : 32791 - 32805
  • [7] Group-Attention Transformer for Fine-Grained Image Recognition
    Yan, Bo
    Wang, Siwei
    Zhu, En
    Liu, Xinwang
    Chen, Wei
    [J]. Communications in Computer and Information Science, 2022, 1587 CCIS : 40 - 54
  • [8] Summary of Fine-Grained Image Recognition Based on Attention Mechanism
    Yao, Ma
    Min, Zhi
    [J]. THIRTEENTH INTERNATIONAL CONFERENCE ON GRAPHICS AND IMAGE PROCESSING (ICGIP 2021), 2022, 12083
  • [9] Attention cutting and padding learning for fine-grained image recognition
    Cheng, Zhuo
    Li, Hongjian
    Duan, Xiaolin
    Zeng, Xiangyan
    He, Mingxuan
    Luo, Hao
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (21-23) : 32791 - 32805
  • [10] Learning Scale-Consistent Attention Part Network for Fine-Grained Image Recognition
    Liu, Huabin
    Li, Jianguo
    Li, Dian
    See, John
    Lin, Weiyao
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 24 : 2902 - 2913