Fine-grained Image Recognition via Attention Interaction and Counterfactual Attention Network

被引：0

作者：

Huang, Lei ^{[1
]}

An, Chen ^{[1
]}

Wang, Xiaodong ^{[1
]}

Bullock, Leon Bevan ^{[1
]}

Wei, Zhiqiang ^{[1
]}

机构：

[1] Ocean Univ China, Fac Informat Sci & Engn, Qingdao 266100, Peoples R China

来源：

ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE | 2023年 / 125卷

基金：

中国国家自然科学基金;

关键词：

Fine-grained image recognition; Counterfactual attention; Attention interaction; Attention mechanism; MODEL;

D O I：

10.1016/j.engappai.2023.106735

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Learning subtle and discriminative regions plays an important role in fine-grained image recognition, and attention mechanisms have shown great potential in such tasks. Recent research mainly focuses on employing the attention mechanism to locate key discriminative regions and learn salient features, whilst ignoring imperceptible complementary features and the causal relationship between prediction results and attention. To address the above issues, we propose an Attention Interaction and Counterfactual Attention Network (AICA-Net). Specifically, we propose an Attention Interaction Fusion Module (AIFM) to model the negative correlation between the attention map channels to locate the complementary features, and fuse the complementary features and key discriminative features to generate richer fine-grained features. Simultaneously, an Enhanced Counterfactual Attention Module (ECAM) is proposed to generate a counterfactual attention map. By comparing the impact of the learned attention map and the counterfactual attention map on the final prediction results, quantifying the quality of attention drives the network to learn more effective attention. Extensive experiments on CUB-200-2011, FGVC-Aircraft and Stanford Cars datasets have shown that our AICA-Net can get outstanding results. In particular, it achieves 90.83% and 95.87% accuracy on two open competitive benchmark datasets CUB-200-2011 and Stanford Cars, respectively. Experiments demonstrate that our method outperforms state-of-the-art solutions.

引用

页数：10

共 50 条

[1] Pay Attention to the Activations: A Modular Attention Mechanism for Fine-Grained Image Recognition
Rodriguez, Pau
Velazquez, Diego
Cucurull, Guillem
Gonfaus, Josep M.
Roca, E. Xavier
Gonzalez, Jordi
[J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2020, 22 (02) : 502 - 514
[2] Multiple Recurrent Attention Convolutional Neural Network For fine-grained image recognition
Zhu, Xiaotong
Bian, Hengwei
[J]. 2022 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, COMPUTER VISION AND MACHINE LEARNING (ICICML), 2022, : 44 - 48
[3] A Multi-part Convolutional Attention Network for Fine-Grained Image Recognition
Zhong, Weilin
Jiang, Linfeng
Zhang, Tao
Ji, Jinsheng
Xiong, Huilin
[J]. 2018 24TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2018, : 1857 - 1862
[4] Channel Attention Multi-Branch Network for Fine-Grained Image Recognition
Wang Binzhou
Xiao Zhiyong
[J]. LASER & OPTOELECTRONICS PROGRESS, 2021, 58 (22)
[5] Hierarchical Attention Network for Open-Set Fine-Grained Image Recognition
Sun, Jiayin
Wang, Hong
Dong, Qiulei
[J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (05) : 3891 - 3904
[6] Attention cutting and padding learning for fine-grained image recognition
Zhuo Cheng
Hongjian Li
Xiaolin Duan
Xiangyan Zeng
Mingxuan He
Hao Luo
[J]. Multimedia Tools and Applications, 2021, 80 : 32791 - 32805
[7] Group-Attention Transformer for Fine-Grained Image Recognition
Yan, Bo
Wang, Siwei
Zhu, En
Liu, Xinwang
Chen, Wei
[J]. Communications in Computer and Information Science, 2022, 1587 CCIS : 40 - 54
[8] Summary of Fine-Grained Image Recognition Based on Attention Mechanism
Yao, Ma
Min, Zhi
[J]. THIRTEENTH INTERNATIONAL CONFERENCE ON GRAPHICS AND IMAGE PROCESSING (ICGIP 2021), 2022, 12083
[9] Attention cutting and padding learning for fine-grained image recognition
Cheng, Zhuo
Li, Hongjian
Duan, Xiaolin
Zeng, Xiangyan
He, Mingxuan
Luo, Hao
[J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (21-23) : 32791 - 32805
[10] Learning Scale-Consistent Attention Part Network for Fine-Grained Image Recognition
Liu, Huabin
Li, Jianguo
Li, Dian
See, John
Lin, Weiyao
[J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 24 : 2902 - 2913

← 1 2 3 4 5 →