Learning enhanced features and inferring twice for fine-grained image classification

被引:5
|
作者
Nie, Xuan [1 ]
Chai, Bosong [1 ]
Wang, Luyao [1 ]
Liao, Qiyu [2 ]
Xu, Min [2 ]
机构
[1] Northwestern Polytech Univ, Xian, Peoples R China
[2] Univ Technol Sydney Ultimo, Sydney, NSW, Australia
关键词
Fine-grained visual categorization (FGVC); Image classification; Convolutional neural networks (CNN); CNNS;
D O I
10.1007/s11042-022-13619-z
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Fine-Grained Visual Categorization (FGVC) aims to distinguish between extremely similar subordinate-level categories within the same basic-level category. Existing research has proven the great importance of the discriminative features in FGVC but ignored the contributions for correct classification from other features, and the extracted features always contain more information about the obvious regions but less about subtle regions. In this paper, firstly, a novel module named forcing module is proposed to force the network to extract more diverse features for FGVC, which generates a suppression mask based on the class activation maps to suppress the most distinguishable regions, so as to force the network to extract other secondary distinguishable features as the final features. The forcing module consists of the original branch and the forcing branch. The original branch focuses on the primary discriminative regions while the forcing branch focuses on secondary discriminative regions. Secondly, in order to solve the problem that information of small-scale distinguishable features is lost seriously after multi-layer down-sampling, according to the class activation maps of the first prediction, the object is cropped and scaled as the second input. To reduce the prediction error, the first and second prediction probabilities are fused as the final prediction result. Experimental results indicate that the proposed method not only outperforms the baseline model by a large margin (3.7%, 5.9%, 3.1% respectively) on CUB-200-2011, Stanford-Cars, and FGVC-Aircraft, but also achieves state-of-the-art performance on FGVC-Aircraft.
引用
收藏
页码:14799 / 14813
页数:15
相关论文
共 50 条
  • [1] Learning enhanced features and inferring twice for fine-grained image classification
    Xuan Nie
    Bosong Chai
    Luyao Wang
    Qiyu Liao
    Min Xu
    Multimedia Tools and Applications, 2023, 82 : 14799 - 14813
  • [2] Learning Semantically Enhanced Feature for Fine-Grained Image Classification
    Luo, Wei
    Zhang, Hengmin
    Li, Jun
    Wei, Xiu-Shen
    IEEE SIGNAL PROCESSING LETTERS, 2020, 27 (27) : 1545 - 1549
  • [3] Learning Two-level Features for Fine-grained Image Classification
    Ji, Jinsheng
    Jiang, Linfeng
    Lei, Chenxi
    Zhong, Weilin
    Xiong, Huilin
    PROCEEDINGS OF 2018 14TH IEEE INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP), 2018, : 544 - 549
  • [4] Learning Cascade Attention for fine-grained image classification
    Zhu, Youxiang
    Li, Ruochen
    Yang, Yin
    Ye, Ning
    NEURAL NETWORKS, 2020, 122 : 174 - 182
  • [5] DEEP DICTIONARY LEARNING FOR FINE-GRAINED IMAGE CLASSIFICATION
    Srinivas, M.
    Lin, Yen-Yu
    Liao, Hong-Yuan Mark
    2017 24TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2017, : 835 - 839
  • [6] Fine-Grained Image Classification Based on Multi-Modal Features and Enhanced Alignment
    Han, Jing
    Zhang, Tianpeng
    Lyu, Xueqiang
    Beijing Youdian Daxue Xuebao/Journal of Beijing University of Posts and Telecommunications, 2024, 47 (04): : 130 - 135
  • [7] Fine-Grained Features for Image Captioning
    Shao, Mengyue
    Feng, Jie
    Wu, Jie
    Zhang, Haixiang
    Zheng, Yayu
    CMC-COMPUTERS MATERIALS & CONTINUA, 2023, 75 (03): : 4697 - 4712
  • [8] An Interactive Deep Learning Method For Fine-grained Image Classification
    Luo, Liumin
    Wang, Mingxia
    Liu, Xiaoqing
    JOURNAL OF APPLIED SCIENCE AND ENGINEERING, 2025, 28 (04): : 701 - 708
  • [9] Cross-Part Learning for Fine-Grained Image Classification
    Liu, Man
    Zhang, Chunjie
    Bai, Huihui
    Zhang, Riquan
    Zhao, Yao
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 748 - 758
  • [10] Learning to Navigate for Fine-Grained Classification
    Yang, Ze
    Luo, Tiange
    Wang, Dong
    Hu, Zhiqiang
    Gao, Jun
    Wang, Liwei
    COMPUTER VISION - ECCV 2018, PT XIV, 2018, 11218 : 438 - 454