Multi-stream Deep Networks for Vehicle Make and Model Recognition

被引:1
|
作者
Besbes, Mohamed Dhia Elhak [1 ]
Kessentini, Yousri [2 ]
Tabia, Hedi [1 ]
机构
[1] Univ Evry, Univ Paris Saclay, IBISC, F-91025 Evry, France
[2] Digital Res Ctr Sfax, Sfax, Tunisia
关键词
Fine-grained Vehicle Recognition; Convolutional Neural Network; Multi-stream Fusion;
D O I
10.5220/0008975404130419
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Vehicle recognition generally aims to classify vehicles based on make, model and year of manufacture. It is a particularly hard problem due to the large number of classes and small inter-class variations. To handle this problem recent state of the art methods use Convolutional Neural Network (CNN). These methods have however several limitations since they extract unstructured vehicle features used for the recognition task. In this paper, we propose more structured feature extraction method by leveraging robust multi-stream deep networks architecture. We employ a novel dynamic combination technique to aggregate different vehicle part features with the entire image. This allows combining global representation with local features. Our system which has been evaluated on publicly available datasets is able to learn highly discriminant representation and achieves state-of-the-art result.
引用
收藏
页码:413 / 419
页数:7
相关论文
共 50 条
  • [1] Fusing multi-stream deep neural networks for facial expression recognition
    Fatima Zahra Salmam
    Abdellah Madani
    Mohamed Kissi
    [J]. Signal, Image and Video Processing, 2019, 13 : 609 - 616
  • [2] Fusing multi-stream deep neural networks for facial expression recognition
    Zahra Salmam, Fatima
    Madani, Abdellah
    Kissi, Mohamed
    [J]. SIGNAL IMAGE AND VIDEO PROCESSING, 2019, 13 (03) : 609 - 616
  • [3] Multi-stream with Deep Convolutional Neural Networks for Human Action Recognition in Videos
    Liu, Xiao
    Yang, Xudong
    [J]. NEURAL INFORMATION PROCESSING (ICONIP 2018), PT I, 2018, 11301 : 251 - 262
  • [4] Multi-stream Convolutional Networks for Indoor Scene Recognition
    Anwer, Rao Muhammad
    Khan, Fahad Shahbaz
    Laaksonen, Jorma
    Zaki, Nazar
    [J]. COMPUTER ANALYSIS OF IMAGES AND PATTERNS, CAIP 2019, PT I, 2019, 11678 : 196 - 208
  • [5] Local Tiled Deep Networks for Recognition of Vehicle Make and Model
    Gao, Yongbin
    Lee, Hyo Jong
    [J]. SENSORS, 2016, 16 (02)
  • [6] Multi-Stream Interaction Networks for Human Action Recognition
    Wang, Haoran
    Yu, Baosheng
    Li, Jiaqi
    Zhang, Linlin
    Chen, Dongyue
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (05) : 3050 - 3060
  • [7] Multi-Stream Deep Neural Networks for RGB-D Egocentric Action Recognition
    Tang, Yansong
    Wang, Zian
    Lu, Jiwen
    Feng, Jianjiang
    Zhou, Jie
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2019, 29 (10) : 3001 - 3015
  • [8] Multimodal Multi-stream Deep Learning for Egocentric Activity Recognition
    Song, Sibo
    Chandrasekhar, Vijay
    Mandal, Bappaditya
    Li, Liyuan
    Lim, Joo-Hwee
    Babu, Giduthuri Sateesh
    San, Phyo Phyo
    Cheung, Ngai-Man
    [J]. PROCEEDINGS OF 29TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, (CVPRW 2016), 2016, : 378 - 385
  • [9] Multi-Stream Deep Similarity Learning Networks for Visual Tracking
    Li, Kunpeng
    Kong, Yu
    Fu, Yun
    [J]. PROCEEDINGS OF THE TWENTY-SIXTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 2166 - 2172
  • [10] Part-based Multi-stream Model for Vehicle Searching
    Sun, Ya
    Li, Minxian
    Lu, Jianfeng
    [J]. 2018 24TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2018, : 1372 - 1377