Learning enhanced features and inferring twice for fine-grained image classification

被引:5
|
作者
Nie, Xuan [1 ]
Chai, Bosong [1 ]
Wang, Luyao [1 ]
Liao, Qiyu [2 ]
Xu, Min [2 ]
机构
[1] Northwestern Polytech Univ, Xian, Peoples R China
[2] Univ Technol Sydney Ultimo, Sydney, NSW, Australia
关键词
Fine-grained visual categorization (FGVC); Image classification; Convolutional neural networks (CNN); CNNS;
D O I
10.1007/s11042-022-13619-z
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Fine-Grained Visual Categorization (FGVC) aims to distinguish between extremely similar subordinate-level categories within the same basic-level category. Existing research has proven the great importance of the discriminative features in FGVC but ignored the contributions for correct classification from other features, and the extracted features always contain more information about the obvious regions but less about subtle regions. In this paper, firstly, a novel module named forcing module is proposed to force the network to extract more diverse features for FGVC, which generates a suppression mask based on the class activation maps to suppress the most distinguishable regions, so as to force the network to extract other secondary distinguishable features as the final features. The forcing module consists of the original branch and the forcing branch. The original branch focuses on the primary discriminative regions while the forcing branch focuses on secondary discriminative regions. Secondly, in order to solve the problem that information of small-scale distinguishable features is lost seriously after multi-layer down-sampling, according to the class activation maps of the first prediction, the object is cropped and scaled as the second input. To reduce the prediction error, the first and second prediction probabilities are fused as the final prediction result. Experimental results indicate that the proposed method not only outperforms the baseline model by a large margin (3.7%, 5.9%, 3.1% respectively) on CUB-200-2011, Stanford-Cars, and FGVC-Aircraft, but also achieves state-of-the-art performance on FGVC-Aircraft.
引用
收藏
页码:14799 / 14813
页数:15
相关论文
共 50 条
  • [41] Evaluation of Output Embeddings for Fine-Grained Image Classification
    Akata, Zeynep
    Reed, Scott
    Walter, Daniel
    Lee, Honglak
    Schiele, Bernt
    2015 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2015, : 2927 - 2936
  • [42] Adversarial erasing attention for fine-grained image classification
    Jinsheng Ji
    Linfeng Jiang
    Tao Zhang
    Weilin Zhong
    Huilin Xiong
    Multimedia Tools and Applications, 2021, 80 : 22867 - 22889
  • [43] Adversarial erasing attention for fine-grained image classification
    Ji, Jinsheng
    Jiang, Linfeng
    Zhang, Tao
    Zhong, Weilin
    Xiong, Huilin
    MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (15) : 22867 - 22889
  • [44] Exploiting spatial relation for fine-grained image classification
    Qi, Lei
    Lu, Xiaoqiang
    Li, Xuelong
    PATTERN RECOGNITION, 2019, 91 : 47 - 55
  • [45] Aggregate attention module for fine-grained image classification
    Xingmei Wang
    Jiahao Shi
    Hamido Fujita
    Yilin Zhao
    Journal of Ambient Intelligence and Humanized Computing, 2023, 14 : 8335 - 8345
  • [46] Survey of Vision Transformer in Fine-Grained Image Classification
    Sun, Lulu
    Liu, Jianping
    Wang, Jian
    Xing, Jialu
    Zhang, Yue
    Wang, Chenyang
    Computer Engineering and Applications, 60 (10): : 30 - 46
  • [47] Fine-grained Image Classification and Retrieval by Combining Visual and Locally Pooled Textual Features
    Mafla, Andres
    Dey, Sounak
    Biten, Ali Furkan
    Gomez, Lluis
    Karatzas, Dimosthenis
    2020 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2020, : 2939 - 2948
  • [48] Robust fine-grained image classification with noisy labels
    Tan, Xinxing
    Dong, Zemin
    Zhao, Hualing
    VISUAL COMPUTER, 2022, 39 (11): : 5637 - 5650
  • [49] Application of Image Classification for Fine-Grained Nudity Detection
    Ion, Cristian
    Minea, Cristian
    ADVANCES IN VISUAL COMPUTING, ISVC 2019, PT I, 2020, 11844 : 3 - 15
  • [50] Improving Fine-Grained Image Classification With Multimodal Information
    Xu, Jie
    Zhang, Xiaoqian
    Zhao, Changming
    Geng, Zili
    Feng, Yuren
    Miao, Ke
    Li, Yunji
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 2082 - 2095