Vehicle color recognition based on smooth modulation neural network with multi-scale feature fusion

被引:6
|
作者
Hu, Mingdi [1 ,2 ]
Bai, Long [1 ,2 ]
Fan, Jiulun [1 ,2 ]
Zhao, Sirui [3 ]
Chen, Enhong [3 ]
机构
[1] Xian Univ Posts & Telecommun, Sch Commun & Informat Engn, Xian 710121, Peoples R China
[2] Xian Univ Posts & Telecommun, Sch Artificial Intelligence, Xian 710121, Peoples R China
[3] Univ Sci & Technol China, Sch Comp Sci & Technol, Hefei 230026, Peoples R China
基金
中国国家自然科学基金;
关键词
vehicle color recognition; benchmark dataset; multi-scale feature fusion; long-tail distribution; improved smooth l1 loss; CLASSIFICATION;
D O I
10.1007/s11704-022-1389-x
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Vehicle Color Recognition (VCR) plays a vital role in intelligent traffic management and criminal investigation assistance. However, the existing vehicle color datasets only cover 13 classes, which can not meet the current actual demand. Besides, although lots of efforts are devoted to VCR, they suffer from the problem of class imbalance in datasets. To address these challenges, in this paper, we propose a novel VCR method based on Smooth Modulation Neural Network with Multi-Scale Feature Fusion (SMNN-MSFF). Specifically, to construct the benchmark of model training and evaluation, we first present a new VCR dataset with 24 vehicle classes, Vehicle Color-24, consisting of 10091 vehicle images from a 100-hour urban road surveillance video. Then, to tackle the problem of long-tail distribution and improve the recognition performance, we propose the SMNN-MSFF model with multi-scale feature fusion and smooth modulation. The former aims to extract feature information from local to global, and the latter could increase the loss of the images of tail class instances for training with class-imbalance. Finally, comprehensive experimental evaluation on Vehicle Color-24 and previously three representative datasets demonstrate that our proposed SMNN-MSFF outperformed state-of-the-art VCR methods. And extensive ablation studies also demonstrate that each module of our method is effective, especially, the smooth modulation efficiently help feature learning of the minority or tail classes. Vehicle Color-24 and the code of SMNN-MSFF are publicly available and can contact the author to obtain.
引用
下载
收藏
页数:12
相关论文
共 50 条
  • [1] Vehicle color recognition based on smooth modulation neural network with multi-scale feature fusion
    Mingdi HU
    Long BAI
    Jiulun FAN
    Sirui ZHAO
    Enhong CHEN
    Frontiers of Computer Science, 2023, 17 (03) : 95 - 106
  • [2] A multi-scale feature fusion convolutional neural network for facial expression recognition
    Zhang, Xiufeng
    Fu, Xingkui
    Qi, Guobin
    Zhang, Ning
    EXPERT SYSTEMS, 2024, 41 (04)
  • [3] Vehicle detection method based on adaptive multi-scale feature fusion network
    Shen, Xuanjing
    Li, Hanyu
    Huang, Yongping
    Wang, Yu
    JOURNAL OF ELECTRONIC IMAGING, 2022, 31 (04)
  • [4] A Convolutional Neural Network With Multi-scale Kernel and Feature Fusion for sEMG-based Gesture Recognition
    Han, Lijun
    Zou, Yongxiang
    Cheng, Long
    2021 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND BIOMIMETICS (IEEE-ROBIO 2021), 2021, : 774 - 779
  • [5] Facial Expression Recognition Based on Multi-scale Feature Fusion Convolutional Neural Network and Attention Mechanism
    Wu, Yana
    Jia, Kebin
    Sun, Zhonghua
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2021, PT II, 2021, 13020 : 324 - 335
  • [6] A Vehicle Classification Model Based on Multi-scale Feature Fusion
    Wang, Xuanhong
    Yang, Shiyu
    Sun, Zengguo
    Li, Xiaojun
    Xiao, Yun
    2022 41ST CHINESE CONTROL CONFERENCE (CCC), 2022, : 7180 - 7185
  • [7] Road Recognition Based on Multi-scale Convolutional Network with Multi-level Feature Fusion
    Li, Ye
    Guo, Lili
    Xu, Lele
    Wang, Xianfeng
    Jin, Shan
    TENTH INTERNATIONAL CONFERENCE ON GRAPHICS AND IMAGE PROCESSING (ICGIP 2018), 2019, 11069
  • [8] Multi-scale Feature Fusion Based Dongba Character Recognition
    Luo, Haini
    Xu, Dan
    Yang, Bing
    Zhang, Haoyuan
    2020 5TH INTERNATIONAL CONFERENCE ON MECHANICAL, CONTROL AND COMPUTER ENGINEERING (ICMCCE 2020), 2020, : 1571 - 1575
  • [9] Lightweight silkworm recognition based on Multi-scale feature fusion
    Wen, Chunming
    Wen, Jie
    Li, Jianheng
    Luo, Yunyun
    Chen, Minbo
    Xiao, Zhanpeng
    Xu, Qing
    Liang, Xiang
    An, Hui
    COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2022, 200
  • [10] QoS Prediction via Multi-scale Feature Fusion Based on Convolutional Neural Network
    Xu, Hanzhi
    Shu, Yanjun
    Zhang, Zhan
    Zuo, Decheng
    SERVICE-ORIENTED COMPUTING, ICSOC 2023, PT I, 2023, 14419 : 119 - 134