Deep CNNs With Spatially Weighted Pooling for Fine-Grained Car Recognition

被引:76
|
作者
Hu, Qichang [1 ]
Wang, Huibing [3 ,4 ]
Li, Teng [1 ]
Shen, Chunhua [2 ]
机构
[1] Univ Adelaide, Australian Ctr Visual Technol, Adelaide, SA 5005, Australia
[2] Univ Adelaide, Sch Comp Sci, Adelaide, SA 5005, Australia
[3] Dalian Univ Technol, Sch Comp Sci & Technol, Fac Elect Informat & Elect Engn, Dalian 116024, Peoples R China
[4] Dalian Univ Technol, Sch Comp Sci & Technol, Fac Elect Informat & Elect Engn, Dalian 116024, Peoples R China
关键词
Deep learning; fine-grained recognition; car model classification; spatially weighted pooling;
D O I
10.1109/TITS.2017.2679114
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
Fine-grained car recognition aims to recognize the category information of a car, such as car make, car model, or even the year of manufacture. A number of recent studies have shown that a deep convolutional neural network (DCNN) trained on a large-scale data set can achieve impressive results at a range of generic object classification tasks. In this paper, we propose a spatially weighted pooling (SWP) strategy, which considerably improves the robustness and effectiveness of the feature representation of most dominant DCNNs. More specifically, the SWP is a novel pooling layer, which contains a predefined number of spatially weighted masks or pooling channels. The SWP pools the extracted features of DCNNs with the guidance of its learnt masks, which measures the importance of the spatial units in terms of discriminative power. As the existing methods that apply uniform grid pooling on the convolutional feature maps of DCNNs, the proposed method can extract the convolutional features and generate the pooling channels from a single DCNN. Thus minimal modification is needed in terms of implementation. Moreover, the parameters of the SWP layer can be learned in the end-to-end training process of the DCNN. By applying our method to several fine-grained car recognition data sets, we demonstrate that the proposed method can achieve better performance than recent approaches in the literature. We advance the state-of-the-art results by improving the accuracy from 92.6% to 93.1% on the Stanford Cars-196 data set and 91.2% to 97.6% on the recent CompCars data set. We have also tested the proposed method on two additional large-scale data sets with impressive results observed.
引用
收藏
页码:3147 / 3156
页数:10
相关论文
共 50 条
  • [21] Fine-grained Early Frequency Attention for Deep Speaker Recognition
    Hajavi, Amirhossein
    Etemad, Ali
    2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
  • [22] Squeezed Bilinear Pooling for Fine-Grained Visual Categorization
    Liao, Qiyu
    Wang, Dadong
    Holewa, Hamish
    Xu, Min
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2019, : 728 - 732
  • [23] Grouping Bilinear Pooling for Fine-Grained Image Classification
    Zeng, Rui
    He, Jingsong
    APPLIED SCIENCES-BASEL, 2022, 12 (10):
  • [24] Fine-Grained Car Recognition Model Based on Semantic DCNN Features Fusion
    Yang J.
    Cao H.
    Wang R.
    Xue L.
    Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2019, 31 (01): : 141 - 157
  • [25] A Heterogeneous Framework to Accelerate CNNs with Fine-Grained FPGA Management
    Guo K.-C.
    Wu C.-G.
    Zhang W.-F.
    Qi Z.-W.
    Guan H.-B.
    1600, Science Press (44): : 2529 - 2541
  • [26] Grassmann Pooling as Compact Homogeneous Bilinear Pooling for Fine-Grained Visual Classification
    Wei, Xing
    Zhang, Yue
    Gong, Yihong
    Zhang, Jiawei
    Zheng, Nanning
    COMPUTER VISION - ECCV 2018, PT III, 2018, 11207 : 365 - 380
  • [27] Bilinear CNNs for Blind Quality Assessment of Fine-Grained Images
    Liu, Lixia
    Yan, Jiebin
    Fang, Yuming
    Jiang, Wenhui
    2022 IEEE 24TH INTERNATIONAL WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING (MMSP), 2022,
  • [28] Fine-grained Cars Recognition using Deep Convolutional Neural Networks
    Oliveira, Franklin
    Macena, Arianne
    Kamel, Otavio
    Souza, Wesley
    Freitas, Nicksson
    Vinuto, Tiago
    2022 35TH SIBGRAPI CONFERENCE ON GRAPHICS, PATTERNS AND IMAGES (SIBGRAPI 2022), 2022, : 240 - 245
  • [29] DEEP MULTI-CONTEXT NETWORK FOR FINE-GRAINED VISUAL RECOGNITION
    Ou, Xinyu
    Wei, Zhen
    Ling, Hefei
    Liu, Si
    Cao, Xiaochun
    2016 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA & EXPO WORKSHOPS (ICMEW), 2016,
  • [30] Human Action Recognition Using Deep Data: A Fine-Grained Study
    Rao, D. Surendra
    Potturu, Sudharsana Rao
    Bhagyaraju, V
    INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2022, 22 (06): : 97 - 108