ELoPE: Fine-Grained Visual Classification with Efficient Localization, Pooling and Embedding

被引:0
|
作者
Hanselmann, Harald [1 ,2 ]
Ney, Hermann [1 ,2 ]
机构
[1] Rhein Westfal TH Aachen, Comp Sci Dept, Human Language Technol & Pattern Recognit Grp, D-52062 Aachen, Germany
[2] AppTek GmbH, D-52062 Aachen, Germany
关键词
D O I
10.1109/wacv45572.2020.9093601
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The task of fine-grained visual classification (FGVC) deals with classification problems that display a small interclass variance such as distinguishing between different bird species or car models. State-of-the-art approaches typically tackle this problem by integrating an elaborate attention mechanism or (part-) localization method into a standard convolutional neural network (CNN). Also in this work the aim is to enhance the performance of a backbone CNN such as ResNet by including three efficient and lightweight components specifically designed for FGVC. This is achieved by using global k-max pooling, a discriminative embedding layer trained by optimizing class means and an efficient localization module that estimates bounding boxes using only class labels for training. The resulting model achieves state-of-the-art recognition accuracies on multiple FGVC benchmark datasets.
引用
收藏
页码:1236 / 1245
页数:10
相关论文
共 50 条
  • [31] Fine-Grained Classification via Hierarchical Bilinear Pooling With Aggregated Slack Mask
    Tan, Min
    Wang, Guijun
    Zhou, Jian
    Peng, Zhiyou
    Zheng, Meilian
    [J]. IEEE ACCESS, 2019, 7 : 117944 - 117953
  • [32] AUGMENTING DESCRIPTORS FOR FINE-GRAINED VISUAL CATEGORIZATION USING POLYNOMIAL EMBEDDING
    Nakayama, Hideki
    [J]. 2013 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME 2013), 2013,
  • [33] Dual Transformer With Multi-Grained Assembly for Fine-Grained Visual Classification
    Ji, Ruyi
    Li, Jiaying
    Zhang, Libo
    Liu, Jing
    Wu, Yanjun
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (09) : 5009 - 5021
  • [34] A Progressive Gated Attention Model for Fine-Grained Visual Classification
    Zhu, Qiangxi
    Li, Zhixin
    [J]. 2023 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, ICME, 2023, : 2063 - 2068
  • [35] Learning Hierarchal Channel Attention for Fine-grained Visual Classification
    Guan, Xiang
    Wang, Guoqing
    Xu, Xing
    Bin, Yi
    [J]. PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 5011 - 5019
  • [36] Hierarchical attention vision transformer for fine-grained visual classification
    Hu, Xiaobin
    Zhu, Shining
    Peng, Taile
    [J]. JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2023, 91
  • [37] Using Coarse Label Constraint for Fine-Grained Visual Classification
    Lu, Chaohao
    Zou, Yuexian
    [J]. MULTIMEDIA MODELING, MMM 2019, PT II, 2019, 11296 : 266 - 277
  • [38] Symmetrical irregular local features for fine-grained visual classification
    Yang, Ming
    Xu, Yang
    Wu, Zebin
    Wei, Zhihui
    [J]. NEUROCOMPUTING, 2022, 505 : 304 - 314
  • [39] A collaborative gated attention network for fine-grained visual classification
    Zhu, Qiangxi
    Kuang, Wenlan
    Li, Zhixin
    [J]. DISPLAYS, 2023, 79
  • [40] Visual Analytics for Fine-grained Text Classification Models and Datasets
    Battogtokh, M.
    Xing, Y.
    Davidescu, C.
    Abdul-Rahman, A.
    Luck, M.
    Borgo, R.
    [J]. COMPUTER GRAPHICS FORUM, 2024, 43 (03)