Fine-grained visual classification with multi-scale features based on self-supervised attention filtering mechanism

被引:5
|
作者
Chen, Haiyuan [1 ]
Cheng, Lianglun [1 ]
Huang, Guoheng [1 ]
Zhang, Ganghan [1 ]
Lan, Jiaying [1 ]
Yu, Zhiwen [2 ]
Pun, Chi-Man [3 ]
Ling, Wing-Kuen [4 ]
机构
[1] Guangdong Univ Technol, Sch Comp Sci & Technol, Guangzhou 510006, Peoples R China
[2] South China Univ Technol, Sch Comp Sci & Engn, Guangzhou 510006, Peoples R China
[3] Univ Macau, Dept Comp & Informat Sci, Macau 999078, Peoples R China
[4] Guangdong Univ Technol, Sch Informat Engn, Guangzhou 510006, Peoples R China
关键词
Attention mechanism; Feature filtering; Fine-grained visual classification; Self-supervised learning;
D O I
10.1007/s10489-022-03232-w
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Although the existing Fine-Grained Visual Classification (FGVC) researches has made some progress, there are still some deficiencies need to be refined. Specifically, 1. The feature maps are used directly by most methods after they are extracted from the original images, which lacks further processing of feature maps and may lead irrelevant features to negatively affect network performance; 2. In many methods, the utilize of feature maps is relatively simple, and the relationship between feature maps that helpful for accurate classification is ignored. 3. Due to the high similarity between subcategories as well as the randomness and instability of training, the network prediction results may sometimes not accurate enough. To this end, we propose an efficient Self-supervised Attention Filtering and Multi-scale Features Network (SA-MFN) to improve the accuracy of FGVC, which consists of three modules. The first one is the Self-supervised Attention Map Filter, which is proposed to extract the initial attention maps of subcategories and filter out the most distinguishable and representative local attention maps. The second module is the Multi-scale Attention Map Generator, which extracts a global spatial feature map from the filtered attention maps and then concatenates it with the filtered attention maps. The third module is the Reiterative Prediction, in which the first prediction result of the network is re-utilized by this module to improve the accuracy and stability. Experimental results show that our SA-MFN outperforms the state-of-the-art methods on multiple fine-grained classification datasets, especially on the dataset of Stanford Cars, the proposed network achieves the accuracy of 94.7%.
引用
收藏
页码:15673 / 15689
页数:17
相关论文
共 50 条
  • [21] Multi-Scale Feature Transformer Based Fine-Grained Image Classification Method
    Zhang, Tiankui
    Cai, Changli
    Luo, Xiaoliang
    Zhu, Yutao
    [J]. Beijing Youdian Daxue Xuebao/Journal of Beijing University of Posts and Telecommunications, 2023, 46 (04): : 70 - 75
  • [22] A Streamlined Attention Mechanism for Image Classification and Fine-Grained Visual Recognition
    Dakshayani Himabindu, D.
    Praveen Kumar, S.
    [J]. Dakshayani Himabindu, D. (dakshayanihimabindu_d@vnrvjiet.in), 1600, Brno University of Technology (27): : 59 - 67
  • [23] Fine-grained Face Anti-Spoofing based on Recursive Self-Attention and Multi-Scale Fusion
    Xie, Shichuang
    Wu, Jiasheng
    Chen, Yanli
    Han, Meng
    Wu, Ting
    Qiao, Tong
    [J]. 2023 ASIA PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE, APSIPA ASC, 2023, : 1435 - 1442
  • [24] MSEC: Multi-Scale Erasure and Confusion for fine-grained image classification
    Zhang, Yan
    Sun, Yongsheng
    Wang, Nian
    Gao, Zijian
    Chen, Feng
    Wang, Chenfei
    Tang, Jun
    [J]. NEUROCOMPUTING, 2021, 449 : 1 - 14
  • [25] Attention-based supervised contrastive learning on fine-grained image classification
    Li, Qian
    Wu, Weining
    [J]. PATTERN ANALYSIS AND APPLICATIONS, 2024, 27 (03)
  • [26] FEATURE COMPARISON BASED CHANNEL ATTENTION FOR FINE-GRAINED VISUAL CLASSIFICATION
    Jia, Shukun
    Bai, Yan
    Zhang, Jing
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2020, : 1776 - 1780
  • [27] Multi-Granularity Part Sampling Attention for Fine-Grained Visual Classification
    Wang, Jiahui
    Xu, Qin
    Jiang, Bo
    Luo, Bin
    Tang, Jinhui
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 33 : 4529 - 4542
  • [28] Based on the multi-scale information sharing network of fine-grained attention for agricultural pest detection
    Wang Linfeng
    Liu Yong
    Liu Jiayao
    Wang Yunsheng
    Xu Shipu
    [J]. PLOS ONE, 2023, 18 (10):
  • [29] Learning Common Rationale to Improve Self-Supervised Representation for Fine-Grained Visual Recognition Problems
    Shu, Yangyang
    van den Hengel, Anton
    Liu, Lingqiao
    [J]. 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 11392 - 11401
  • [30] Fine-Grained Image Classification for Crop Disease Based on Attention Mechanism
    Yang, Guofeng
    He, Yong
    Yang, Yong
    Xu, Beibei
    [J]. FRONTIERS IN PLANT SCIENCE, 2020, 11