Learning enhanced features and inferring twice for fine-grained image classification

被引:5
|
作者
Nie, Xuan [1 ]
Chai, Bosong [1 ]
Wang, Luyao [1 ]
Liao, Qiyu [2 ]
Xu, Min [2 ]
机构
[1] Northwestern Polytech Univ, Xian, Peoples R China
[2] Univ Technol Sydney Ultimo, Sydney, NSW, Australia
关键词
Fine-grained visual categorization (FGVC); Image classification; Convolutional neural networks (CNN); CNNS;
D O I
10.1007/s11042-022-13619-z
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Fine-Grained Visual Categorization (FGVC) aims to distinguish between extremely similar subordinate-level categories within the same basic-level category. Existing research has proven the great importance of the discriminative features in FGVC but ignored the contributions for correct classification from other features, and the extracted features always contain more information about the obvious regions but less about subtle regions. In this paper, firstly, a novel module named forcing module is proposed to force the network to extract more diverse features for FGVC, which generates a suppression mask based on the class activation maps to suppress the most distinguishable regions, so as to force the network to extract other secondary distinguishable features as the final features. The forcing module consists of the original branch and the forcing branch. The original branch focuses on the primary discriminative regions while the forcing branch focuses on secondary discriminative regions. Secondly, in order to solve the problem that information of small-scale distinguishable features is lost seriously after multi-layer down-sampling, according to the class activation maps of the first prediction, the object is cropped and scaled as the second input. To reduce the prediction error, the first and second prediction probabilities are fused as the final prediction result. Experimental results indicate that the proposed method not only outperforms the baseline model by a large margin (3.7%, 5.9%, 3.1% respectively) on CUB-200-2011, Stanford-Cars, and FGVC-Aircraft, but also achieves state-of-the-art performance on FGVC-Aircraft.
引用
收藏
页码:14799 / 14813
页数:15
相关论文
共 50 条
  • [31] Incremental Learning for Fine-Grained Image Recognition
    Cao, Liangliang
    Hsiao, Jenhao
    de Juan, Paloma
    Li, Yuncheng
    Thomee, Bart
    ICMR'16: PROCEEDINGS OF THE 2016 ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, 2016, : 363 - 366
  • [32] ADVERSARIAL LEARNING FOR FINE-GRAINED IMAGE SEARCH
    Lin, Kevin
    Yang, Fan
    Wang, Qiaosong
    Piramuthu, Robinson
    2019 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2019, : 490 - 495
  • [33] Graph-based discriminative features learning for fine-grained image retrieval
    Sun, Han
    Lang, Wenxi
    Xu, Can
    Liu, Ningzhong
    Zhou, Huiyu
    SIGNAL PROCESSING-IMAGE COMMUNICATION, 2023, 110
  • [34] Cost-Sensitive Deep Metric Learning for Fine-Grained Image Classification
    Zhao, Junjie
    Peng, Yuxin
    MULTIMEDIA MODELING, MMM 2018, PT I, 2018, 10704 : 130 - 141
  • [35] Accuracy improvement for fine-grained image classification with semi-supervised learning
    Yu, Lei
    Cheng, Le
    Zhang, Jinli
    Zhu, Hongna
    Gao, Xiaorong
    2019 ASIA COMMUNICATIONS AND PHOTONICS CONFERENCE (ACP), 2019,
  • [36] Attention-based supervised contrastive learning on fine-grained image classification
    Li, Qian
    Wu, Weining
    PATTERN ANALYSIS AND APPLICATIONS, 2024, 27 (03)
  • [37] Feature relocation network for fine-grained image classification
    Zhao, Peng
    Li, Yi
    Tang, Baowei
    Liu, Huiting
    Yao, Sheng
    NEURAL NETWORKS, 2023, 161 : 306 - 317
  • [38] Fine-grained Image Classification Combined with Label Description
    Shi, Xiruo
    Xu, Liutong
    Wang, Pengfei
    2019 IEEE 31ST INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2019), 2019, : 1057 - 1064
  • [39] Efficient Image Embedding for Fine-Grained Visual Classification
    Payatsuporn, Soranan
    Kijsirikul, Boonserm
    2022-14TH INTERNATIONAL CONFERENCE ON KNOWLEDGE AND SMART TECHNOLOGY (KST 2022), 2022, : 40 - 45
  • [40] Separated smooth sampling for fine-grained image classification
    Rong, Shenghai
    Wang, Zilei
    Wang, Jie
    NEUROCOMPUTING, 2021, 461 : 350 - 359