w Bilinear Convolutional Neural Networks for Fine-Grained Visual Recognition

被引:226
|
作者
Lin, Tsung-Yu [1 ]
RoyChowdhury, Aruni [1 ]
Maji, Subhransu [1 ]
机构
[1] Univ Massachusetts, Coll Informat & Comp Sci, Amherst, MA 01003 USA
基金
美国国家科学基金会;
关键词
Fine-grained recognition; texture representations; second order pooling; bilinear models; convolutional networks;
D O I
10.1109/TPAMI.2017.2723400
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present a simple and effective architecture for fine-grained recognition called Bilinear Convolutional Neural Networks (B-CNNs). These networks represent an image as a pooled outer product of features derived from two CNNs and capture localized feature interactions in a translationally invariant manner. B-CNNs are related to orderless texture representations built on deep features but can be trained in an end-to-end manner. Our most accurate model obtains 84.1, 79.4, 84.5 and 91.3 percent per-image accuracy on the Caltech-UCSD birds [1], NABirds [2], FGVC aircraft [3], and Stanford cars [4] dataset respectively and runs at 30 frames-per-second on a NVIDIA Titan X GPU. We then present a systematic analysis of these networks and show that (1) the bilinear features are highly redundant and can be reduced by an order of magnitude in size without significant loss in accuracy, (2) are also effective for other image classification tasks such as texture and scene recognition, and (3) can be trained from scratch on the ImageNet dataset offering consistent improvements over the baseline architecture. Finally, we present visualizations of these models on various datasets using top activations of neural units and gradient-based inversion techniques. The source code for the complete system is available at http://vis-www.cs.umass.edu/bcnn.
引用
收藏
页码:1309 / 1322
页数:14
相关论文
共 50 条
  • [1] On fine-grained visual explanation in convolutional neural networks
    Lei, Xia
    Fan, Yongkai
    Luo, Xiong-Lin
    [J]. DIGITAL COMMUNICATIONS AND NETWORKS, 2023, 9 (05) : 1141 - 1147
  • [2] Fine-Grained Visual Classification Based on Sparse Bilinear Convolutional Neural Network
    Ma, Li
    Wang, Yongxiong
    [J]. Moshi Shibie yu Rengong Zhineng/Pattern Recognition and Artificial Intelligence, 2019, 32 (04): : 336 - 344
  • [3] Interpretable and Fine-Grained Visual Explanations for Convolutional Neural Networks
    Wagner, Joerg
    Koehler, Jan Mathias
    Gindele, Tobias
    Hetzel, Leon
    Wiedemer, Jakob Thaddaeus
    Behnke, Sven
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 9089 - 9099
  • [4] Fine-Grained Breast Cancer Classification With Bilinear Convolutional Neural Networks (BCNNs)
    Liu, Weihuang
    Juhas, Mario
    Zhang, Yang
    [J]. FRONTIERS IN GENETICS, 2020, 11
  • [5] Hierarchical Bilinear Pooling for Fine-Grained Visual Recognition
    Yu, Chaojian
    Zhao, Xinyi
    Zheng, Qi
    Zhang, Peng
    You, Xinge
    [J]. COMPUTER VISION - ECCV 2018, PT XVI, 2018, 11220 : 595 - 610
  • [6] Bilinear CNN Models for Fine-grained Visual Recognition
    Lin, Tsung-Yu
    RoyChowdhury, Aruni
    Maji, Subhransu
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 1449 - 1457
  • [7] Fine-grained Cars Recognition using Deep Convolutional Neural Networks
    Oliveira, Franklin
    Macena, Arianne
    Kamel, Otavio
    Souza, Wesley
    Freitas, Nicksson
    Vinuto, Tiago
    [J]. 2022 35TH SIBGRAPI CONFERENCE ON GRAPHICS, PATTERNS AND IMAGES (SIBGRAPI 2022), 2022, : 240 - 245
  • [8] INCREASINGLY SPECIALIZED ENSEMBLE OF CONVOLUTIONAL NEURAL NETWORKS FOR FINE-GRAINED RECOGNITION
    Simonelli, Andrea
    Messelodi, Stefano
    De Natale, Francesco
    Bulo, Samuel Rota
    [J]. 2018 25TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2018, : 594 - 598
  • [9] Fine-grained image analysis for facial expression recognition using deep convolutional neural networks with bilinear pooling
    Hossain, Sanoar
    Umer, Saiyed
    Rout, Ranjeet Kr.
    Tanveer, M.
    [J]. APPLIED SOFT COMPUTING, 2023, 134
  • [10] Kernelized Bilinear CNN Models for Fine-Grained Visual Recognition
    Ge, Shu-Yu
    Gao, Zi-Lin
    Zhang, Bing-Bing
    Li, Pei-Hua
    [J]. Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2019, 47 (10): : 2134 - 2141