ALMA: Adjustable Location and Multi-Angle Attention for Fine-Grained Visual Classification

被引:0
|
作者
Ding, Boyu [1 ]
Xu, Xiaofeng [1 ,2 ]
Bao, Xianglin [1 ]
Yan, Nan [1 ,2 ]
Zhang, Ruiheng [3 ]
机构
[1] Anhui Polytech Univ, Sch Comp & Informat, Wuhu 241000, Peoples R China
[2] Anhui Polytech Univ, Ind Innovat Technol Res Co Ltd, Wuhu 241000, Peoples R China
[3] Beijing Inst Technol, Sch Mechatron Engn, Beijing 100081, Peoples R China
关键词
Fine-grained visual classification; Adjustable location; Multi-angle attention; Image cropping; Background masking;
D O I
10.1109/CSCWD61410.2024.10580689
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Fine-grained visual classification (FGVC) is a challenging but realistic problem that recognizes objects from common categories with subtle differences. Most previous work focused on identifying more regional features while neglecting the fact that these regions still contain a large amount of secondary information. To alleviate the interference of the secondary information, in this paper, we propose a novel Adjustable Location and Multi-angle Attention (ALMA) network to solve the FGVC problem. ALMA consists of two branches, i.e. the adjustable location module and the multi-angle attention module. Specifically, in the adjustable localization module, we first locate the interested area of the object and obtain the adjusted cropped area by adjusting the interested area through the background masking. Then, the adjusted regions will be gathered to locate objects with better prediction performance. Furthermore, we design the multi-angle attention module to gradually maximize the difference between the original attention map and the randomly selected attention map. Consequently, the model can focus on the main information which represents the entire object. To evaluate the effectiveness of the proposed model, we conduct extensive experiments on three public fine-grained benchmark datasets. Experimental results demonstrate that the proposed ALMA model has significant superiority over other FGVC methods.
引用
收藏
页码:2967 / 2972
页数:6
相关论文
共 50 条
  • [41] Fine-grained visual classification with multi-scale features based on self-supervised attention filtering mechanism
    Chen, Haiyuan
    Cheng, Lianglun
    Huang, Guoheng
    Zhang, Ganghan
    Lan, Jiaying
    Yu, Zhiwen
    Pun, Chi-Man
    Ling, Wing-Kuen
    APPLIED INTELLIGENCE, 2022, 52 (13) : 15673 - 15689
  • [42] ATTENTION-BASED MULTI-TASK LEARNING FOR FINE-GRAINED IMAGE CLASSIFICATION
    Liu, Dichao
    Wang, Yu
    Mase, Kenji
    Kato, Jien
    2021 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2021, : 1499 - 1503
  • [43] Fine-grained visual classification with multi-scale features based on self-supervised attention filtering mechanism
    Haiyuan Chen
    Lianglun Cheng
    Guoheng Huang
    Ganghan Zhang
    Jiaying Lan
    Zhiwen Yu
    Chi-Man Pun
    Wing-Kuen Ling
    Applied Intelligence, 2022, 52 : 15673 - 15689
  • [44] Multi-Scale Salient Features Bilinear Attention Fine-Grained Classification Method
    Liu G.
    Zhan H.
    Meng Y.
    Wang B.
    Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2023, 35 (11): : 1683 - 1691
  • [45] Multi-scale discriminative regions attention network for fine-grained vehicle classification
    Rong, Wen-Zhong
    Han, Jin
    Cai, Ying-Hao
    Liu, Gen
    Han, Jin (shnk123@163.com); Cai, Ying-Hao (yinghao.cai@ia.ac.cn), 1600, Taiwan Ubiquitous Information (06): : 164 - 177
  • [46] Dual attention guided multi-scale CNN for fine-grained image classification
    Liu, Xiaozhang
    Zhang, Lifeng
    Li, Tao
    Wang, Dejian
    Wang, Zhaojie
    INFORMATION SCIENCES, 2021, 573 : 37 - 45
  • [47] Fine-Grained Thyroid Nodule Classification via Multi-Semantic Attention Network
    Li, Shuai
    Guo, Yuting
    Song, Wenfeng
    Pang, Zhennan
    Hao, Aimin
    Zhang, Bo
    Qin, Hong
    2019 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2019, : 826 - 833
  • [48] Category attention transfer for efficient fine-grained visual categorization
    Liao, Qiyu
    Wang, Dadong
    Xu, Min
    PATTERN RECOGNITION LETTERS, 2022, 153 : 10 - 15
  • [49] Research on Fine-Grained Visual Classification Method Based on Dual-Attention Feature Complementation
    Huang, Min
    Li, Ke
    Yu, Xiaoyan
    Yang, Chen
    IEEE ACCESS, 2024, 12 : 192209 - 192218
  • [50] Attention-based cropping and erasing learning with coarse-to-fine refinement for fine-grained visual classification
    Chen, Jianpin
    Li, Heng
    Liang, Junlin
    Su, Xiaofan
    Zhai, Zhenzhen
    Chai, Xinyu
    NEUROCOMPUTING, 2022, 501 : 359 - 369