Deep CNNs With Spatially Weighted Pooling for Fine-Grained Car Recognition

被引：76

作者：

Hu, Qichang ^{[1
]}

Wang, Huibing ^{[3
,4
]}

Li, Teng ^{[1
]}

Shen, Chunhua ^{[2
]}

机构：

[1] Univ Adelaide, Australian Ctr Visual Technol, Adelaide, SA 5005, Australia

[2] Univ Adelaide, Sch Comp Sci, Adelaide, SA 5005, Australia

[3] Dalian Univ Technol, Sch Comp Sci & Technol, Fac Elect Informat & Elect Engn, Dalian 116024, Peoples R China

[4] Dalian Univ Technol, Sch Comp Sci & Technol, Fac Elect Informat & Elect Engn, Dalian 116024, Peoples R China

来源：

IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS | 2017年 / 18卷 / 11期

关键词：

Deep learning; fine-grained recognition; car model classification; spatially weighted pooling;

D O I：

10.1109/TITS.2017.2679114

中图分类号：

TU [建筑科学];

学科分类号：

0813 ;

摘要：

Fine-grained car recognition aims to recognize the category information of a car, such as car make, car model, or even the year of manufacture. A number of recent studies have shown that a deep convolutional neural network (DCNN) trained on a large-scale data set can achieve impressive results at a range of generic object classification tasks. In this paper, we propose a spatially weighted pooling (SWP) strategy, which considerably improves the robustness and effectiveness of the feature representation of most dominant DCNNs. More specifically, the SWP is a novel pooling layer, which contains a predefined number of spatially weighted masks or pooling channels. The SWP pools the extracted features of DCNNs with the guidance of its learnt masks, which measures the importance of the spatial units in terms of discriminative power. As the existing methods that apply uniform grid pooling on the convolutional feature maps of DCNNs, the proposed method can extract the convolutional features and generate the pooling channels from a single DCNN. Thus minimal modification is needed in terms of implementation. Moreover, the parameters of the SWP layer can be learned in the end-to-end training process of the DCNN. By applying our method to several fine-grained car recognition data sets, we demonstrate that the proposed method can achieve better performance than recent approaches in the literature. We advance the state-of-the-art results by improving the accuracy from 92.6% to 93.1% on the Stanford Cars-196 data set and 91.2% to 97.6% on the recent CompCars data set. We have also tested the proposed method on two additional large-scale data sets with impressive results observed.

引用

页码：3147 / 3156

页数：10

共 50 条

[21] Fine-grained Early Frequency Attention for Deep Speaker Recognition
Hajavi, Amirhossein
Etemad, Ali
2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
[22] Squeezed Bilinear Pooling for Fine-Grained Visual Categorization
Liao, Qiyu
Wang, Dadong
Holewa, Hamish
Xu, Min
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2019, : 728 - 732
[23] Grouping Bilinear Pooling for Fine-Grained Image Classification
Zeng, Rui
He, Jingsong
APPLIED SCIENCES-BASEL, 2022, 12 (10):
[24] Fine-Grained Car Recognition Model Based on Semantic DCNN Features Fusion
Yang J.
Cao H.
Wang R.
Xue L.
Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2019, 31 (01): : 141 - 157
[25] A Heterogeneous Framework to Accelerate CNNs with Fine-Grained FPGA Management
Guo K.-C.
Wu C.-G.
Zhang W.-F.
Qi Z.-W.
Guan H.-B.
1600, Science Press (44): : 2529 - 2541
[26] Grassmann Pooling as Compact Homogeneous Bilinear Pooling for Fine-Grained Visual Classification
Wei, Xing
Zhang, Yue
Gong, Yihong
Zhang, Jiawei
Zheng, Nanning
COMPUTER VISION - ECCV 2018, PT III, 2018, 11207 : 365 - 380
[27] Bilinear CNNs for Blind Quality Assessment of Fine-Grained Images
Liu, Lixia
Yan, Jiebin
Fang, Yuming
Jiang, Wenhui
2022 IEEE 24TH INTERNATIONAL WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING (MMSP), 2022,
[28] Fine-grained Cars Recognition using Deep Convolutional Neural Networks
Oliveira, Franklin
Macena, Arianne
Kamel, Otavio
Souza, Wesley
Freitas, Nicksson
Vinuto, Tiago
2022 35TH SIBGRAPI CONFERENCE ON GRAPHICS, PATTERNS AND IMAGES (SIBGRAPI 2022), 2022, : 240 - 245
[29] DEEP MULTI-CONTEXT NETWORK FOR FINE-GRAINED VISUAL RECOGNITION
Ou, Xinyu
Wei, Zhen
Ling, Hefei
Liu, Si
Cao, Xiaochun
2016 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA & EXPO WORKSHOPS (ICMEW), 2016,
[30] Human Action Recognition Using Deep Data: A Fine-Grained Study
Rao, D. Surendra
Potturu, Sudharsana Rao
Bhagyaraju, V
INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2022, 22 (06): : 97 - 108

← 1 2 3 4 5 →