Fine-grained visual classification via multilayer bilinear pooling with object localization

被引:0
|
作者
Ming Li
Lin Lei
Hao Sun
Xiao Li
Gangyao Kuang
机构
[1] National University of Defense Technology,College of Electronic Science and Technology
来源
The Visual Computer | 2022年 / 38卷
关键词
Fine-grained visual classification; Multilayer bilinear pooling (MLBP); Object localization; Convolutional neural networks (CNNs);
D O I
暂无
中图分类号
学科分类号
摘要
Fine-grained visual classification is a challenging task in the computer vision field. How to explore discriminative features is vital for classification. As one crucial step, exactly object localization is able to eliminate the background noises and highlight interesting objects at the same time. However, some current methods usually use bounding boxes to locate objects, that are not suitable when the poses of objects change. Furthermore, it has been demonstrated that deep features have strong feature representation capability, especially the bilinear pooling features, which achieved superior performance in fine-grained visual classification tasks. However, the bilinear features, which captured only from the last convolutional layer, have limited discriminability, especially when dealing with small-scale objects. In this paper, we propose a multilayer bilinear pooling model combined with object localization. First, a flexible and scalable object localization module is utilized to locate the interesting object in an image instead of using bounding boxes. Then the refined features are obtained by highlighting object region and suppressing background noises. While the multilayer bilinear pooling, which exploits the complementarity between different layers, is used for further extracting more discriminative features. Experiment results on three public datasets show that our proposed method can achieve competitive performance compared with several state-of-the-art methods.
引用
收藏
页码:811 / 820
页数:9
相关论文
共 50 条
  • [1] Fine-grained visual classification via multilayer bilinear pooling with object localization
    Li, Ming
    Lei, Lin
    Sun, Hao
    Li, Xiao
    Kuang, Gangyao
    [J]. VISUAL COMPUTER, 2022, 38 (03): : 811 - 820
  • [2] Grassmann Pooling as Compact Homogeneous Bilinear Pooling for Fine-Grained Visual Classification
    Wei, Xing
    Zhang, Yue
    Gong, Yihong
    Zhang, Jiawei
    Zheng, Nanning
    [J]. COMPUTER VISION - ECCV 2018, PT III, 2018, 11207 : 365 - 380
  • [3] Attention Bilinear Pooling for Fine-Grained Classification
    Wang, Wenqian
    Zhang, Jun
    Wang, Fenglei
    [J]. SYMMETRY-BASEL, 2019, 11 (08):
  • [4] Hierarchical Bilinear Pooling for Fine-Grained Visual Recognition
    Yu, Chaojian
    Zhao, Xinyi
    Zheng, Qi
    Zhang, Peng
    You, Xinge
    [J]. COMPUTER VISION - ECCV 2018, PT XVI, 2018, 11220 : 595 - 610
  • [5] Squeezed Bilinear Pooling for Fine-Grained Visual Categorization
    Liao, Qiyu
    Wang, Dadong
    Holewa, Hamish
    Xu, Min
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2019, : 728 - 732
  • [6] Grouping Bilinear Pooling for Fine-Grained Image Classification
    Zeng, Rui
    He, Jingsong
    [J]. APPLIED SCIENCES-BASEL, 2022, 12 (10):
  • [7] ELoPE: Fine-Grained Visual Classification with Efficient Localization, Pooling and Embedding
    Hanselmann, Harald
    Ney, Hermann
    [J]. 2020 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2020, : 1236 - 1245
  • [8] Fine-Grained Classification via Hierarchical Bilinear Pooling With Aggregated Slack Mask
    Tan, Min
    Wang, Guijun
    Zhou, Jian
    Peng, Zhiyou
    Zheng, Meilian
    [J]. IEEE ACCESS, 2019, 7 : 117944 - 117953
  • [9] Saliency Enhanced Hierarchical Bilinear Pooling for Fine-Grained Classification
    Chen, Junying
    Chen, Ying
    [J]. Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2021, 33 (02): : 241 - 249
  • [10] Low-rank Bilinear Pooling for Fine-Grained Classification
    Kong, Shu
    Fowlkes, Charless
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 7025 - 7034