Hierarchical Bilinear Pooling for Fine-Grained Visual Recognition

被引:200
|
作者
Yu, Chaojian [1 ]
Zhao, Xinyi [1 ]
Zheng, Qi [1 ]
Zhang, Peng [1 ]
You, Xinge [1 ]
机构
[1] Huazhong Univ Sci & Technol, Sch Elect Informat & Commun, Wuhan, Peoples R China
来源
基金
中国国家自然科学基金;
关键词
Fine-grained visual recognition; Cross-layer interaction; Hierarchical bilinear pooling;
D O I
10.1007/978-3-030-01270-0_35
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Fine-grained visual recognition is challenging because it highly relies on the modeling of various semantic parts and fine-grained feature learning. Bilinear pooling based models have been shown to be effective at fine-grained recognition, while most previous approaches neglect the fact that inter-layer part feature interaction and fine-grained feature learning are mutually correlated and can reinforce each other. In this paper, we present a novel model to address these issues. First, a cross-layer bilinear pooling approach is proposed to capture the inter-layer part feature relations, which results in superior performance compared with other bilinear pooling based approaches. Second, we propose a novel hierarchical bilinear pooling framework to integrate multiple cross-layer bilinear features to enhance their representation capability. Our formulation is intuitive, efficient and achieves state-of-the-art results on the widely used fine-grained recognition datasets.
引用
收藏
页码:595 / 610
页数:16
相关论文
共 50 条
  • [21] Low-rank Bilinear Pooling for Fine-Grained Classification
    Kong, Shu
    Fowlkes, Charless
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 7025 - 7034
  • [22] Hyperlayer Bilinear Pooling with application to fine-grained categorization and image retrieval
    Sun, Qiule
    Wang, Qilong
    Zhang, Jianxin
    Li, Peihua
    [J]. NEUROCOMPUTING, 2018, 282 : 174 - 183
  • [23] Hierarchical Spatial Pyramid Pooling for Fine-Grained Vehicle Classification
    Rachmadi, Reza Fuad
    Uchimura, Keiichi
    Koutaki, Gou
    Ogata, Kohichi
    [J]. 2018 INTERNATIONAL WORKSHOP ON BIG DATA AND INFORMATION SECURITY (IWBIS), 2018, : 19 - 24
  • [24] GBP: Graph convolutional network embedded in bilinear pooling for fine-grained encoding
    Du, Yinan
    Tang, Jian
    Rui, Ting
    Li, Xinxin
    Yang, Chengsong
    [J]. COMPUTERS & ELECTRICAL ENGINEERING, 2024, 116
  • [25] Multi-Scale Feature Fusion of Covariance Pooling Networks for Fine-Grained Visual Recognition
    Qian, Lulu
    Yu, Tan
    Yang, Jianyu
    [J]. SENSORS, 2023, 23 (08)
  • [26] Fine-Grained Crowdsourcing for Fine-Grained Recognition
    Jia Deng
    Krause, Jonathan
    Li Fei-Fei
    [J]. 2013 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2013, : 580 - 587
  • [27] Fine-grained image analysis for facial expression recognition using deep convolutional neural networks with bilinear pooling
    Hossain, Sanoar
    Umer, Saiyed
    Rout, Ranjeet Kr.
    Tanveer, M.
    [J]. APPLIED SOFT COMPUTING, 2023, 134
  • [28] Annotation modification for fine-grained visual recognition
    Luo, Changzhi
    Meng, Zhijun
    Feng, Jiashi
    Ni, Bingbing
    Wang, Meng
    [J]. NEUROCOMPUTING, 2018, 274 : 58 - 65
  • [29] Hierarchical Part Matching for Fine-Grained Visual Categorization
    Xie, Lingxi
    Tian, Qi
    Hong, Richang
    Yan, Shuicheng
    Zhang, Bo
    [J]. 2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2013, : 1641 - 1648
  • [30] ELoPE: Fine-Grained Visual Classification with Efficient Localization, Pooling and Embedding
    Hanselmann, Harald
    Ney, Hermann
    [J]. 2020 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2020, : 1236 - 1245