Learning enhanced features and inferring twice for fine-grained image classification

被引:5
|
作者
Nie, Xuan [1 ]
Chai, Bosong [1 ]
Wang, Luyao [1 ]
Liao, Qiyu [2 ]
Xu, Min [2 ]
机构
[1] Northwestern Polytech Univ, Xian, Peoples R China
[2] Univ Technol Sydney Ultimo, Sydney, NSW, Australia
关键词
Fine-grained visual categorization (FGVC); Image classification; Convolutional neural networks (CNN); CNNS;
D O I
10.1007/s11042-022-13619-z
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Fine-Grained Visual Categorization (FGVC) aims to distinguish between extremely similar subordinate-level categories within the same basic-level category. Existing research has proven the great importance of the discriminative features in FGVC but ignored the contributions for correct classification from other features, and the extracted features always contain more information about the obvious regions but less about subtle regions. In this paper, firstly, a novel module named forcing module is proposed to force the network to extract more diverse features for FGVC, which generates a suppression mask based on the class activation maps to suppress the most distinguishable regions, so as to force the network to extract other secondary distinguishable features as the final features. The forcing module consists of the original branch and the forcing branch. The original branch focuses on the primary discriminative regions while the forcing branch focuses on secondary discriminative regions. Secondly, in order to solve the problem that information of small-scale distinguishable features is lost seriously after multi-layer down-sampling, according to the class activation maps of the first prediction, the object is cropped and scaled as the second input. To reduce the prediction error, the first and second prediction probabilities are fused as the final prediction result. Experimental results indicate that the proposed method not only outperforms the baseline model by a large margin (3.7%, 5.9%, 3.1% respectively) on CUB-200-2011, Stanford-Cars, and FGVC-Aircraft, but also achieves state-of-the-art performance on FGVC-Aircraft.
引用
收藏
页码:14799 / 14813
页数:15
相关论文
共 50 条
  • [21] Image Classification With Tailored Fine-Grained Dictionaries
    Shu, Xiangbo
    Tang, Jinhui
    Qi, Guo-Jun
    Li, Zechao
    Jiang, Yu-Gang
    Yan, Shuicheng
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2018, 28 (02) : 454 - 467
  • [22] FINE-GRAINED IMAGE CLASSIFICATION WITH COARSE AND FINE LABELS ON ONE-SHOT LEARNING
    Jiao, Qihan
    Liu, Zhi
    Li, Gongyang
    Ye, Linwei
    Wang, Yang
    2020 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO WORKSHOPS (ICMEW), 2020,
  • [23] Beyond the Attention: Distinguish the Discriminative and Confusable Features For Fine-grained Image Classification
    Shi, Xiruo
    Xu, Liutong
    Wang, Pengfei
    Gao, Yuanyuan
    Jian, Haifang
    Liu, Wu
    MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 601 - 609
  • [24] Towards Fine-Grained Recognition: Joint Learning for Object Detection and Fine-Grained Classification
    Wang, Qiaosong
    Rasmussen, Christopher
    ADVANCES IN VISUAL COMPUTING, ISVC 2019, PT II, 2019, 11845 : 332 - 344
  • [25] Learning Noise-Assisted Robust Image Features for Fine-Grained Image Retrieval
    Kumar V.
    Petwal H.
    Gairola A.K.
    Barmola P.P.
    Computer Systems Science and Engineering, 2023, 46 (03): : 2711 - 2724
  • [26] Multi-modal Knowledge-Enhanced Fine-Grained Image Classification
    Cheng, Suyan
    Zhang, Feifei
    Zhou, Haoliang
    Xu, Changsheng
    PATTERN RECOGNITION AND COMPUTER VISION, PT V, PRCV 2024, 2025, 15035 : 333 - 346
  • [27] Learning Features and Parts for Fine-Grained Recognition
    Krause, Jonathan
    Gebru, Timnit
    Deng, Jia
    Li, Li-Jia
    Li Fei-Fei
    2014 22ND INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2014, : 26 - 33
  • [28] Fine-grained Image Classification Based on the Combination of Artificial Features and Deep Convolutional Activation Features
    Zheng, Qinghe
    Yang, Mingqiang
    Zhang, Qingrui
    Zhang, Xinxin
    2017 IEEE/CIC INTERNATIONAL CONFERENCE ON COMMUNICATIONS IN CHINA (ICCC), 2017, : 77 - 82
  • [29] LEARNING TO FOCUS AND DISCRIMINATE FOR FINE-GRAINED CLASSIFICATION
    Feng, Zhicong
    Fu, Keren
    Zhao, Qijun
    2019 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2019, : 415 - 419
  • [30] Learning to locate for fine-grained image recognition
    Chen, Jiamin
    Hu, Jianguo
    Li, Shiren
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2021, 206