Deep CNNs With Spatially Weighted Pooling for Fine-Grained Car Recognition

被引:76
|
作者
Hu, Qichang [1 ]
Wang, Huibing [3 ,4 ]
Li, Teng [1 ]
Shen, Chunhua [2 ]
机构
[1] Univ Adelaide, Australian Ctr Visual Technol, Adelaide, SA 5005, Australia
[2] Univ Adelaide, Sch Comp Sci, Adelaide, SA 5005, Australia
[3] Dalian Univ Technol, Sch Comp Sci & Technol, Fac Elect Informat & Elect Engn, Dalian 116024, Peoples R China
[4] Dalian Univ Technol, Sch Comp Sci & Technol, Fac Elect Informat & Elect Engn, Dalian 116024, Peoples R China
关键词
Deep learning; fine-grained recognition; car model classification; spatially weighted pooling;
D O I
10.1109/TITS.2017.2679114
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
Fine-grained car recognition aims to recognize the category information of a car, such as car make, car model, or even the year of manufacture. A number of recent studies have shown that a deep convolutional neural network (DCNN) trained on a large-scale data set can achieve impressive results at a range of generic object classification tasks. In this paper, we propose a spatially weighted pooling (SWP) strategy, which considerably improves the robustness and effectiveness of the feature representation of most dominant DCNNs. More specifically, the SWP is a novel pooling layer, which contains a predefined number of spatially weighted masks or pooling channels. The SWP pools the extracted features of DCNNs with the guidance of its learnt masks, which measures the importance of the spatial units in terms of discriminative power. As the existing methods that apply uniform grid pooling on the convolutional feature maps of DCNNs, the proposed method can extract the convolutional features and generate the pooling channels from a single DCNN. Thus minimal modification is needed in terms of implementation. Moreover, the parameters of the SWP layer can be learned in the end-to-end training process of the DCNN. By applying our method to several fine-grained car recognition data sets, we demonstrate that the proposed method can achieve better performance than recent approaches in the literature. We advance the state-of-the-art results by improving the accuracy from 92.6% to 93.1% on the Stanford Cars-196 data set and 91.2% to 97.6% on the recent CompCars data set. We have also tested the proposed method on two additional large-scale data sets with impressive results observed.
引用
收藏
页码:3147 / 3156
页数:10
相关论文
共 50 条
  • [1] Multi-Path Deep CNNs for Fine-Grained Car Recognition
    Wang, Huibing
    Peng, Jinjia
    Zhao, Yanzhu
    Fu, Xianping
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2020, 69 (10) : 10484 - 10493
  • [2] Selective Pooling Vector for Fine-grained Recognition
    Chen, Guang
    Yang, Jianchao
    Jin, Hailin
    Shechtman, Eli
    Brandt, Jonathan
    Han, Tony X.
    2015 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2015, : 860 - 867
  • [3] Semantic bilinear pooling for fine-grained recognition
    School of Computer and Communication Engineering, University of Science and Technology Beijing, Beijing, China
    Proc. Int. Conf. Pattern Recognit., (3660-3666):
  • [4] Semantic Bilinear Pooling for Fine-Grained Recognition
    Li, Xinjie
    Yang, Chun
    Chen, Song-Lu
    Zhu, Chao
    Yin, Xu-Cheng
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 3660 - 3666
  • [5] Fine-Grained Vehicle Classification With Channel Max Pooling Modified CNNs
    Ma, Zhanyu
    Chang, Dongliang
    Xie, Jiyang
    Ding, Yifeng
    Wen, Shaoguo
    Li, Xiaoxu
    Si, Zhongwei
    Guo, Jun
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2019, 68 (04) : 3224 - 3233
  • [6] Hierarchical Bilinear Pooling for Fine-Grained Visual Recognition
    Yu, Chaojian
    Zhao, Xinyi
    Zheng, Qi
    Zhang, Peng
    You, Xinge
    COMPUTER VISION - ECCV 2018, PT XVI, 2018, 11220 : 595 - 610
  • [7] Deep LSAC for Fine-Grained Recognition
    Lin, Di
    Wang, Yi
    Liang, Lingyu
    Li, Ping
    Chen, C. L. Philip
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 33 (01) : 200 - 214
  • [8] Attention Bilinear Pooling for Fine-Grained Facial Expression Recognition
    Liu, Liyuan
    Zhang, Lifeng
    Jia, Shixiang
    CYBERSPACE SAFETY AND SECURITY, PT II, 2019, 11983 : 535 - 542
  • [9] On the Eigenvalues of Global Covariance Pooling for Fine-Grained Visual Recognition
    Song, Yue
    Sebe, Nicu
    Wang, Wei
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (03) : 3554 - 3566
  • [10] Fine-Grained Crowdsourcing for Fine-Grained Recognition
    Jia Deng
    Krause, Jonathan
    Li Fei-Fei
    2013 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2013, : 580 - 587