ALMA: Adjustable Location and Multi-Angle Attention for Fine-Grained Visual Classification

被引:0
|
作者
Ding, Boyu [1 ]
Xu, Xiaofeng [1 ,2 ]
Bao, Xianglin [1 ]
Yan, Nan [1 ,2 ]
Zhang, Ruiheng [3 ]
机构
[1] Anhui Polytech Univ, Sch Comp & Informat, Wuhu 241000, Peoples R China
[2] Anhui Polytech Univ, Ind Innovat Technol Res Co Ltd, Wuhu 241000, Peoples R China
[3] Beijing Inst Technol, Sch Mechatron Engn, Beijing 100081, Peoples R China
关键词
Fine-grained visual classification; Adjustable location; Multi-angle attention; Image cropping; Background masking;
D O I
10.1109/CSCWD61410.2024.10580689
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Fine-grained visual classification (FGVC) is a challenging but realistic problem that recognizes objects from common categories with subtle differences. Most previous work focused on identifying more regional features while neglecting the fact that these regions still contain a large amount of secondary information. To alleviate the interference of the secondary information, in this paper, we propose a novel Adjustable Location and Multi-angle Attention (ALMA) network to solve the FGVC problem. ALMA consists of two branches, i.e. the adjustable location module and the multi-angle attention module. Specifically, in the adjustable localization module, we first locate the interested area of the object and obtain the adjusted cropped area by adjusting the interested area through the background masking. Then, the adjusted regions will be gathered to locate objects with better prediction performance. Furthermore, we design the multi-angle attention module to gradually maximize the difference between the original attention map and the randomly selected attention map. Consequently, the model can focus on the main information which represents the entire object. To evaluate the effectiveness of the proposed model, we conduct extensive experiments on three public fine-grained benchmark datasets. Experimental results demonstrate that the proposed ALMA model has significant superiority over other FGVC methods.
引用
收藏
页码:2967 / 2972
页数:6
相关论文
共 50 条
  • [21] Pairwise Confusion for Fine-Grained Visual Classification
    Dubey, Abhimanyu
    Gupta, Otkrist
    Guo, Pei
    Raskar, Ramesh
    Farrell, Ryan
    Naik, Nikhil
    COMPUTER VISION - ECCV 2018, PT XII, 2018, 11216 : 71 - 88
  • [22] Multi-Depth Learning with Multi-Attention for fine-grained image classification
    Dai, Zuhua
    Li, Hongyi
    Li, Kelong
    Zhou, Anwei
    2020 INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING AND HUMAN-COMPUTER INTERACTION (ICHCI 2020), 2020, : 206 - 212
  • [23] Complemental Attention Multi-Feature Fusion Network for Fine-Grained Classification
    Miao, Zhuang
    Zhao, Xun
    Wang, Jiabao
    Li, Yang
    Li, Hang
    IEEE SIGNAL PROCESSING LETTERS, 2021, 28 : 1983 - 1987
  • [24] FINE-GRAINED MULTI-INSTANCE CLASSIFICATION IN MICROSCOPY THROUGH DEEP ATTENTION
    Fan, Mengran
    Chakraborti, Tapabrata
    Chang, Eric I-Chao
    Xu, Yan
    Rittscher, Jens
    2020 IEEE 17TH INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING (ISBI 2020), 2020, : 169 - 173
  • [25] Multi-scale local regional attention fusion using visual transformers for fine-grained image classification
    Li, Yusong
    Xie, Bin
    Li, Yuling
    Zhang, Jiahao
    VISUAL COMPUTER, 2024,
  • [26] Learning Cascade Attention for fine-grained image classification
    Zhu, Youxiang
    Li, Ruochen
    Yang, Yin
    Ye, Ning
    NEURAL NETWORKS, 2020, 122 : 174 - 182
  • [27] Adversarial erasing attention for fine-grained image classification
    Jinsheng Ji
    Linfeng Jiang
    Tao Zhang
    Weilin Zhong
    Huilin Xiong
    Multimedia Tools and Applications, 2021, 80 : 22867 - 22889
  • [28] The Pairs Network of Attention model for Fine-grained Classification
    Wang, Gaihua
    Han, Jingwei
    Zhang, Chuanlei
    Yao, Jingxuan
    Zhu, Bolun
    PROCEEDINGS OF THE 2024 6TH INTERNATIONAL CONFERENCE ON BIG DATA ENGINEERING, BDE 2024, 2024, : 39 - 47
  • [29] Adversarial erasing attention for fine-grained image classification
    Ji, Jinsheng
    Jiang, Linfeng
    Zhang, Tao
    Zhong, Weilin
    Xiong, Huilin
    MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (15) : 22867 - 22889
  • [30] Aggregate attention module for fine-grained image classification
    Xingmei Wang
    Jiahao Shi
    Hamido Fujita
    Yilin Zhao
    Journal of Ambient Intelligence and Humanized Computing, 2023, 14 : 8335 - 8345