Fine-grained visual classification with multi-scale features based on self-supervised attention filtering mechanism

被引:5
|
作者
Chen, Haiyuan [1 ]
Cheng, Lianglun [1 ]
Huang, Guoheng [1 ]
Zhang, Ganghan [1 ]
Lan, Jiaying [1 ]
Yu, Zhiwen [2 ]
Pun, Chi-Man [3 ]
Ling, Wing-Kuen [4 ]
机构
[1] Guangdong Univ Technol, Sch Comp Sci & Technol, Guangzhou 510006, Peoples R China
[2] South China Univ Technol, Sch Comp Sci & Engn, Guangzhou 510006, Peoples R China
[3] Univ Macau, Dept Comp & Informat Sci, Macau 999078, Peoples R China
[4] Guangdong Univ Technol, Sch Informat Engn, Guangzhou 510006, Peoples R China
关键词
Attention mechanism; Feature filtering; Fine-grained visual classification; Self-supervised learning;
D O I
10.1007/s10489-022-03232-w
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Although the existing Fine-Grained Visual Classification (FGVC) researches has made some progress, there are still some deficiencies need to be refined. Specifically, 1. The feature maps are used directly by most methods after they are extracted from the original images, which lacks further processing of feature maps and may lead irrelevant features to negatively affect network performance; 2. In many methods, the utilize of feature maps is relatively simple, and the relationship between feature maps that helpful for accurate classification is ignored. 3. Due to the high similarity between subcategories as well as the randomness and instability of training, the network prediction results may sometimes not accurate enough. To this end, we propose an efficient Self-supervised Attention Filtering and Multi-scale Features Network (SA-MFN) to improve the accuracy of FGVC, which consists of three modules. The first one is the Self-supervised Attention Map Filter, which is proposed to extract the initial attention maps of subcategories and filter out the most distinguishable and representative local attention maps. The second module is the Multi-scale Attention Map Generator, which extracts a global spatial feature map from the filtered attention maps and then concatenates it with the filtered attention maps. The third module is the Reiterative Prediction, in which the first prediction result of the network is re-utilized by this module to improve the accuracy and stability. Experimental results show that our SA-MFN outperforms the state-of-the-art methods on multiple fine-grained classification datasets, especially on the dataset of Stanford Cars, the proposed network achieves the accuracy of 94.7%.
引用
收藏
页码:15673 / 15689
页数:17
相关论文
共 50 条
  • [1] Fine-grained visual classification with multi-scale features based on self-supervised attention filtering mechanism
    Haiyuan Chen
    Lianglun Cheng
    Guoheng Huang
    Ganghan Zhang
    Jiaying Lan
    Zhiwen Yu
    Chi-Man Pun
    Wing-Kuen Ling
    [J]. Applied Intelligence, 2022, 52 : 15673 - 15689
  • [2] Siamese self-supervised learning for fine-grained visual classification
    Ji, Ruyi
    Li, Jiaying
    Zhang, Libo
    [J]. COMPUTER VISION AND IMAGE UNDERSTANDING, 2023, 229
  • [3] Multi-Scale Salient Features Bilinear Attention Fine-Grained Classification Method
    Liu, Guanghui
    Zhan, Hua
    Meng, Yuebo
    Wang, Bo
    [J]. Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2023, 35 (11): : 1683 - 1691
  • [4] Multi-scale network via progressive multi-granularity attention for fine-grained visual classification
    An, Chen
    Wang, Xiaodong
    Wei, Zhiqiang
    Zhang, Ke
    Huang, Lei
    [J]. APPLIED SOFT COMPUTING, 2023, 146
  • [5] Dual attention guided multi-scale CNN for fine-grained image classification
    Liu, Xiaozhang
    Zhang, Lifeng
    Li, Tao
    Wang, Dejian
    Wang, Zhaojie
    [J]. INFORMATION SCIENCES, 2021, 573 : 37 - 45
  • [6] Multi-scale discriminative regions attention network for fine-grained vehicle classification
    Rong, Wen-Zhong
    Han, Jin
    Cai, Ying-Hao
    Liu, Gen
    [J]. Han, Jin (shnk123@163.com); Cai, Ying-Hao (yinghao.cai@ia.ac.cn), 1600, Taiwan Ubiquitous Information (06): : 164 - 177
  • [7] Scalenet: A Convolutional Network to Extract Multi-Scale and Fine-Grained Visual Features
    Zhang, Jinpeng
    Zhang, Jinming
    Hu, Guyue
    Chen, Yang
    Yu, Shan
    [J]. IEEE ACCESS, 2019, 7 : 147560 - 147570
  • [8] Multi-scale Sparse Network with Cross-Attention Mechanism for image-based butterflies fine-grained classification
    Li, Maopeng
    Zhou, Guoxiong
    Cai, Weiwei
    Li, Jiayong
    Li, Mingxuan
    He, Mingfang
    Hu, Yahui
    Li, Liujun
    [J]. APPLIED SOFT COMPUTING, 2022, 117
  • [9] Fine-Grained Object Classification via Self-Supervised Pose Alignment
    Yang, Xuhui
    Wang, Yaowei
    Chen, Ke
    Xu, Yong
    Tian, Yonghong
    [J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 7389 - 7398
  • [10] Convolutional Fine-Grained Classification With Self-Supervised Target Relation Regularization
    Liu, Kangjun
    Chen, Ke
    Jia, Kui
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 5570 - 5584