ELoPE: Fine-Grained Visual Classification with Efficient Localization, Pooling and Embedding

被引:0
|
作者
Hanselmann, Harald [1 ,2 ]
Ney, Hermann [1 ,2 ]
机构
[1] Rhein Westfal TH Aachen, Comp Sci Dept, Human Language Technol & Pattern Recognit Grp, D-52062 Aachen, Germany
[2] AppTek GmbH, D-52062 Aachen, Germany
关键词
D O I
10.1109/wacv45572.2020.9093601
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The task of fine-grained visual classification (FGVC) deals with classification problems that display a small interclass variance such as distinguishing between different bird species or car models. State-of-the-art approaches typically tackle this problem by integrating an elaborate attention mechanism or (part-) localization method into a standard convolutional neural network (CNN). Also in this work the aim is to enhance the performance of a backbone CNN such as ResNet by including three efficient and lightweight components specifically designed for FGVC. This is achieved by using global k-max pooling, a discriminative embedding layer trained by optimizing class means and an efficient localization module that estimates bounding boxes using only class labels for training. The resulting model achieves state-of-the-art recognition accuracies on multiple FGVC benchmark datasets.
引用
收藏
页码:1236 / 1245
页数:10
相关论文
共 50 条
  • [1] Efficient Image Embedding for Fine-Grained Visual Classification
    Payatsuporn, Soranan
    Kijsirikul, Boonserm
    [J]. 2022-14TH INTERNATIONAL CONFERENCE ON KNOWLEDGE AND SMART TECHNOLOGY (KST 2022), 2022, : 40 - 45
  • [2] Fine-grained visual classification via multilayer bilinear pooling with object localization
    Li, Ming
    Lei, Lin
    Sun, Hao
    Li, Xiao
    Kuang, Gangyao
    [J]. VISUAL COMPUTER, 2022, 38 (03): : 811 - 820
  • [3] Fine-grained visual classification via multilayer bilinear pooling with object localization
    Ming Li
    Lin Lei
    Hao Sun
    Xiao Li
    Gangyao Kuang
    [J]. The Visual Computer, 2022, 38 : 811 - 820
  • [4] Grassmann Pooling as Compact Homogeneous Bilinear Pooling for Fine-Grained Visual Classification
    Wei, Xing
    Zhang, Yue
    Gong, Yihong
    Zhang, Jiawei
    Zheng, Nanning
    [J]. COMPUTER VISION - ECCV 2018, PT III, 2018, 11207 : 365 - 380
  • [5] Fine-grained Image Classification by Visual-Semantic Embedding
    Xu, Huapeng
    Qi, Guilin
    Li, Jingjing
    Wang, Meng
    Xu, Kang
    Gao, Huan
    [J]. PROCEEDINGS OF THE TWENTY-SEVENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2018, : 1043 - 1049
  • [6] Attention Bilinear Pooling for Fine-Grained Classification
    Wang, Wenqian
    Zhang, Jun
    Wang, Fenglei
    [J]. SYMMETRY-BASEL, 2019, 11 (08):
  • [7] Context-aware Attentional Pooling (CAP) for Fine-grained Visual Classification
    Behera, Ardhendu
    Wharton, Zachary
    Hewage, Pradeep R. P. G.
    Bera, Asish
    [J]. THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 929 - 937
  • [8] Fine-Grained Visual Classification Network Based on Fusion Pooling and Attention Enhancement
    Xiao, Bin
    Guo, Jingwei
    Zhang, Xingpeng
    Wang, Min
    [J]. Moshi Shibie yu Rengong Zhineng/Pattern Recognition and Artificial Intelligence, 2023, 36 (07): : 661 - 670
  • [9] Leveraging Fine-Grained Labels to Regularize Fine-Grained Visual Classification
    Wu, Junfeng
    Yao, Li
    Liu, Bin
    Ding, Zheyuan
    [J]. PROCEEDINGS OF THE 11TH INTERNATIONAL CONFERENCE ON COMPUTER MODELING AND SIMULATION (ICCMS 2019) AND 8TH INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING AND APPLICATIONS (ICICA 2019), 2019, : 133 - 136
  • [10] Squeezed Bilinear Pooling for Fine-Grained Visual Categorization
    Liao, Qiyu
    Wang, Dadong
    Holewa, Hamish
    Xu, Min
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2019, : 728 - 732