Deep CNNs With Spatially Weighted Pooling for Fine-Grained Car Recognition

被引：76

作者：

Hu, Qichang ^{[1
]}

Wang, Huibing ^{[3
,4
]}

Li, Teng ^{[1
]}

Shen, Chunhua ^{[2
]}

机构：

[1] Univ Adelaide, Australian Ctr Visual Technol, Adelaide, SA 5005, Australia

[2] Univ Adelaide, Sch Comp Sci, Adelaide, SA 5005, Australia

[3] Dalian Univ Technol, Sch Comp Sci & Technol, Fac Elect Informat & Elect Engn, Dalian 116024, Peoples R China

[4] Dalian Univ Technol, Sch Comp Sci & Technol, Fac Elect Informat & Elect Engn, Dalian 116024, Peoples R China

来源：

IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS | 2017年 / 18卷 / 11期

关键词：

Deep learning; fine-grained recognition; car model classification; spatially weighted pooling;

D O I：

10.1109/TITS.2017.2679114

中图分类号：

TU [建筑科学];

学科分类号：

0813 ;

摘要：

Fine-grained car recognition aims to recognize the category information of a car, such as car make, car model, or even the year of manufacture. A number of recent studies have shown that a deep convolutional neural network (DCNN) trained on a large-scale data set can achieve impressive results at a range of generic object classification tasks. In this paper, we propose a spatially weighted pooling (SWP) strategy, which considerably improves the robustness and effectiveness of the feature representation of most dominant DCNNs. More specifically, the SWP is a novel pooling layer, which contains a predefined number of spatially weighted masks or pooling channels. The SWP pools the extracted features of DCNNs with the guidance of its learnt masks, which measures the importance of the spatial units in terms of discriminative power. As the existing methods that apply uniform grid pooling on the convolutional feature maps of DCNNs, the proposed method can extract the convolutional features and generate the pooling channels from a single DCNN. Thus minimal modification is needed in terms of implementation. Moreover, the parameters of the SWP layer can be learned in the end-to-end training process of the DCNN. By applying our method to several fine-grained car recognition data sets, we demonstrate that the proposed method can achieve better performance than recent approaches in the literature. We advance the state-of-the-art results by improving the accuracy from 92.6% to 93.1% on the Stanford Cars-196 data set and 91.2% to 97.6% on the recent CompCars data set. We have also tested the proposed method on two additional large-scale data sets with impressive results observed.

引用

页码：3147 / 3156

页数：10

共 50 条

[1] Multi-Path Deep CNNs for Fine-Grained Car Recognition
Wang, Huibing
Peng, Jinjia
Zhao, Yanzhu
Fu, Xianping
IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2020, 69 (10) : 10484 - 10493
[2] Selective Pooling Vector for Fine-grained Recognition
Chen, Guang
Yang, Jianchao
Jin, Hailin
Shechtman, Eli
Brandt, Jonathan
Han, Tony X.
2015 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2015, : 860 - 867
[3] Semantic bilinear pooling for fine-grained recognition
School of Computer and Communication Engineering, University of Science and Technology Beijing, Beijing, China
Proc. Int. Conf. Pattern Recognit., (3660-3666):
[4] Semantic Bilinear Pooling for Fine-Grained Recognition
Li, Xinjie
Yang, Chun
Chen, Song-Lu
Zhu, Chao
Yin, Xu-Cheng
2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 3660 - 3666
[5] Fine-Grained Vehicle Classification With Channel Max Pooling Modified CNNs
Ma, Zhanyu
Chang, Dongliang
Xie, Jiyang
Ding, Yifeng
Wen, Shaoguo
Li, Xiaoxu
Si, Zhongwei
Guo, Jun
IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2019, 68 (04) : 3224 - 3233
[6] Hierarchical Bilinear Pooling for Fine-Grained Visual Recognition
Yu, Chaojian
Zhao, Xinyi
Zheng, Qi
Zhang, Peng
You, Xinge
COMPUTER VISION - ECCV 2018, PT XVI, 2018, 11220 : 595 - 610
[7] Deep LSAC for Fine-Grained Recognition
Lin, Di
Wang, Yi
Liang, Lingyu
Li, Ping
Chen, C. L. Philip
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 33 (01) : 200 - 214
[8] Attention Bilinear Pooling for Fine-Grained Facial Expression Recognition
Liu, Liyuan
Zhang, Lifeng
Jia, Shixiang
CYBERSPACE SAFETY AND SECURITY, PT II, 2019, 11983 : 535 - 542
[9] On the Eigenvalues of Global Covariance Pooling for Fine-Grained Visual Recognition
Song, Yue
Sebe, Nicu
Wang, Wei
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (03) : 3554 - 3566
[10] Fine-Grained Crowdsourcing for Fine-Grained Recognition
Jia Deng
Krause, Jonathan
Li Fei-Fei
2013 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2013, : 580 - 587

← 1 2 3 4 5 →