Vehicle color recognition based on smooth modulation neural network with multi-scale feature fusion

被引:6
|
作者
Hu, Mingdi [1 ,2 ]
Bai, Long [1 ,2 ]
Fan, Jiulun [1 ,2 ]
Zhao, Sirui [3 ]
Chen, Enhong [3 ]
机构
[1] Xian Univ Posts & Telecommun, Sch Commun & Informat Engn, Xian 710121, Peoples R China
[2] Xian Univ Posts & Telecommun, Sch Artificial Intelligence, Xian 710121, Peoples R China
[3] Univ Sci & Technol China, Sch Comp Sci & Technol, Hefei 230026, Peoples R China
基金
中国国家自然科学基金;
关键词
vehicle color recognition; benchmark dataset; multi-scale feature fusion; long-tail distribution; improved smooth l1 loss; CLASSIFICATION;
D O I
10.1007/s11704-022-1389-x
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Vehicle Color Recognition (VCR) plays a vital role in intelligent traffic management and criminal investigation assistance. However, the existing vehicle color datasets only cover 13 classes, which can not meet the current actual demand. Besides, although lots of efforts are devoted to VCR, they suffer from the problem of class imbalance in datasets. To address these challenges, in this paper, we propose a novel VCR method based on Smooth Modulation Neural Network with Multi-Scale Feature Fusion (SMNN-MSFF). Specifically, to construct the benchmark of model training and evaluation, we first present a new VCR dataset with 24 vehicle classes, Vehicle Color-24, consisting of 10091 vehicle images from a 100-hour urban road surveillance video. Then, to tackle the problem of long-tail distribution and improve the recognition performance, we propose the SMNN-MSFF model with multi-scale feature fusion and smooth modulation. The former aims to extract feature information from local to global, and the latter could increase the loss of the images of tail class instances for training with class-imbalance. Finally, comprehensive experimental evaluation on Vehicle Color-24 and previously three representative datasets demonstrate that our proposed SMNN-MSFF outperformed state-of-the-art VCR methods. And extensive ablation studies also demonstrate that each module of our method is effective, especially, the smooth modulation efficiently help feature learning of the minority or tail classes. Vehicle Color-24 and the code of SMNN-MSFF are publicly available and can contact the author to obtain.
引用
下载
收藏
页数:12
相关论文
共 50 条
  • [21] Vehicle-logo Recognition Based on Convolutional Neural Network with Multi-scale Parallel Layers
    Zhang, Su-wen
    Zhang, Yong-hui
    Yang, Jie
    Li, Song-bin
    INTERNATIONAL CONFERENCE ON COMPUTER, MECHATRONICS AND ELECTRONIC ENGINEERING (CMEE 2016), 2016,
  • [22] A Multi-Scale Feature Fusion Based Lightweight Vehicle Target Detection Network on Aerial Optical Images
    Yu, Chengrui
    Jiang, Xiaonan
    Wu, Fanlu
    Fu, Yao
    Pei, Junyan
    Zhang, Yu
    Li, Xiangzhi
    Fu, Tianjiao
    Remote Sensing, 2024, 16 (19)
  • [23] Automatic Modulation Recognition Based on Single-channel Multi-scale Graph Neural Network
    Guo, Qiang
    Nie, Mengyun
    Qi, Liangang
    Kaliuzhnyi, Mykola
    JOURNAL OF ELECTRONICS & INFORMATION TECHNOLOGY, 2023, 45 (05) : 1575 - 1584
  • [24] A Robust Vehicle Detection Model Based on Attention and Multi-scale Feature Fusion
    Zhu, Yuxin
    Liu, Wenbo
    Yan, Fei
    Li, Jun
    2022 14TH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS AND SIGNAL PROCESSING, WCSP, 2022, : 143 - 148
  • [25] Iris recognition based on local circular Gabor filters and multi-scale convolution feature fusion network
    Sun, Jie
    Zhao, Shipeng
    Yu, Yanan
    Wang, Xuan
    Zhou, Lijian
    MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (23) : 33051 - 33065
  • [26] AM-MSFF: A Pest Recognition Network Based on Attention Mechanism and Multi-Scale Feature Fusion
    Zhang, Meng
    Yang, Wenzhong
    Chen, Danny
    Fu, Chenghao
    Wei, Fuyuan
    ENTROPY, 2024, 26 (05)
  • [27] Iris recognition based on local circular Gabor filters and multi-scale convolution feature fusion network
    Jie Sun
    Shipeng Zhao
    Yanan Yu
    Xuan Wang
    Lijian Zhou
    Multimedia Tools and Applications, 2022, 81 : 33051 - 33065
  • [28] Speech emotion recognition based on multi-dimensional feature extraction and multi-scale feature fusion
    Yu, Lingli
    Xu, Fengjun
    Qu, Yundong
    Zhou, Kaijun
    APPLIED ACOUSTICS, 2024, 216
  • [29] Recognition of abnormal car door noise based on multi-scale feature fusion
    Wang, Xiaolan
    Song, Yongchao
    Su, Lili
    Wang, Yansong
    Pan, Zuofeng
    PROCEEDINGS OF THE INSTITUTION OF MECHANICAL ENGINEERS PART D-JOURNAL OF AUTOMOBILE ENGINEERING, 2023, 237 (06) : 1353 - 1364
  • [30] Multi-scale fusion public gathering recognition based on residual network
    Liu, Yicheng
    Hu, Zewei
    Nie, Haiwen
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2024, 46 (02) : 3881 - 3893