ALMA: Adjustable Location and Multi-Angle Attention for Fine-Grained Visual Classification

被引:0
|
作者
Ding, Boyu [1 ]
Xu, Xiaofeng [1 ,2 ]
Bao, Xianglin [1 ]
Yan, Nan [1 ,2 ]
Zhang, Ruiheng [3 ]
机构
[1] Anhui Polytech Univ, Sch Comp & Informat, Wuhu 241000, Peoples R China
[2] Anhui Polytech Univ, Ind Innovat Technol Res Co Ltd, Wuhu 241000, Peoples R China
[3] Beijing Inst Technol, Sch Mechatron Engn, Beijing 100081, Peoples R China
关键词
Fine-grained visual classification; Adjustable location; Multi-angle attention; Image cropping; Background masking;
D O I
10.1109/CSCWD61410.2024.10580689
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Fine-grained visual classification (FGVC) is a challenging but realistic problem that recognizes objects from common categories with subtle differences. Most previous work focused on identifying more regional features while neglecting the fact that these regions still contain a large amount of secondary information. To alleviate the interference of the secondary information, in this paper, we propose a novel Adjustable Location and Multi-angle Attention (ALMA) network to solve the FGVC problem. ALMA consists of two branches, i.e. the adjustable location module and the multi-angle attention module. Specifically, in the adjustable localization module, we first locate the interested area of the object and obtain the adjusted cropped area by adjusting the interested area through the background masking. Then, the adjusted regions will be gathered to locate objects with better prediction performance. Furthermore, we design the multi-angle attention module to gradually maximize the difference between the original attention map and the randomly selected attention map. Consequently, the model can focus on the main information which represents the entire object. To evaluate the effectiveness of the proposed model, we conduct extensive experiments on three public fine-grained benchmark datasets. Experimental results demonstrate that the proposed ALMA model has significant superiority over other FGVC methods.
引用
收藏
页码:2967 / 2972
页数:6
相关论文
共 50 条
  • [31] Aggregate attention module for fine-grained image classification
    Wang, Xingmei
    Shi, Jiahao
    Fujita, Hamido
    Zhao, Yilin
    JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2021, 14 (7) : 8335 - 8345
  • [32] Multi-level navigation network: advancing fine-grained visual classification
    Liang, Hong
    Li, Xian
    Shao, Mingwen
    Zhang, Qian
    JOURNAL OF SUPERCOMPUTING, 2025, 81 (02):
  • [33] Fine-grained activity classification in assembly based on multi-visual modalities
    Chen, Haodong
    Zendehdel, Niloofar
    Leu, Ming C.
    Yin, Zhaozheng
    JOURNAL OF INTELLIGENT MANUFACTURING, 2024, 35 (05) : 2215 - 2233
  • [34] Feature Combination with Multi-Kernel Learning for Fine-Grained Visual Classification
    Angelova, Anelia
    Niculescu-Mizil, Alexandru
    2014 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2014, : 241 - 246
  • [35] Efficient Image Embedding for Fine-Grained Visual Classification
    Payatsuporn, Soranan
    Kijsirikul, Boonserm
    2022-14TH INTERNATIONAL CONFERENCE ON KNOWLEDGE AND SMART TECHNOLOGY (KST 2022), 2022, : 40 - 45
  • [36] Adaptive Destruction Learning for Fine-grained Visual Classification
    Zhang, Riheng
    Tan, Min
    Mao, Xiaoyang
    Gao, Zhigang
    Gu, Xiaoling
    2022 IEEE INTL CONF ON DEPENDABLE, AUTONOMIC AND SECURE COMPUTING, INTL CONF ON PERVASIVE INTELLIGENCE AND COMPUTING, INTL CONF ON CLOUD AND BIG DATA COMPUTING, INTL CONF ON CYBER SCIENCE AND TECHNOLOGY CONGRESS (DASC/PICOM/CBDCOM/CYBERSCITECH), 2022, : 946 - 950
  • [37] Exploration of Class Center for Fine-Grained Visual Classification
    Yao, Hang
    Miao, Qiguang
    Zhao, Peipei
    Li, Chaoneng
    Li, Xin
    Feng, Guanwen
    Liu, Ruyi
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (10) : 9954 - 9966
  • [38] A sparse focus framework for visual fine-grained classification
    YongXiong Wang
    Guangjun Li
    Li Ma
    Multimedia Tools and Applications, 2021, 80 : 25271 - 25289
  • [39] A sparse focus framework for visual fine-grained classification
    Wang, YongXiong
    Li, Guangjun
    Ma, Li
    MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (16) : 25271 - 25289
  • [40] CA-PMG: Channel attention and progressive multi-granularity training network for fine-grained visual classification
    Zhao, Peipei
    Miao, Qiguang
    Yao, Hang
    Liu, Xiangzeng
    Liu, Ruyi
    Gong, Maoguo
    IET IMAGE PROCESSING, 2021, 15 (14) : 3718 - 3727