Vehicle color recognition based on smooth modulation neural network with multi-scale feature fusion

被引:6
|
作者
Hu, Mingdi [1 ,2 ]
Bai, Long [1 ,2 ]
Fan, Jiulun [1 ,2 ]
Zhao, Sirui [3 ]
Chen, Enhong [3 ]
机构
[1] Xian Univ Posts & Telecommun, Sch Commun & Informat Engn, Xian 710121, Peoples R China
[2] Xian Univ Posts & Telecommun, Sch Artificial Intelligence, Xian 710121, Peoples R China
[3] Univ Sci & Technol China, Sch Comp Sci & Technol, Hefei 230026, Peoples R China
基金
中国国家自然科学基金;
关键词
vehicle color recognition; benchmark dataset; multi-scale feature fusion; long-tail distribution; improved smooth l1 loss; CLASSIFICATION;
D O I
10.1007/s11704-022-1389-x
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Vehicle Color Recognition (VCR) plays a vital role in intelligent traffic management and criminal investigation assistance. However, the existing vehicle color datasets only cover 13 classes, which can not meet the current actual demand. Besides, although lots of efforts are devoted to VCR, they suffer from the problem of class imbalance in datasets. To address these challenges, in this paper, we propose a novel VCR method based on Smooth Modulation Neural Network with Multi-Scale Feature Fusion (SMNN-MSFF). Specifically, to construct the benchmark of model training and evaluation, we first present a new VCR dataset with 24 vehicle classes, Vehicle Color-24, consisting of 10091 vehicle images from a 100-hour urban road surveillance video. Then, to tackle the problem of long-tail distribution and improve the recognition performance, we propose the SMNN-MSFF model with multi-scale feature fusion and smooth modulation. The former aims to extract feature information from local to global, and the latter could increase the loss of the images of tail class instances for training with class-imbalance. Finally, comprehensive experimental evaluation on Vehicle Color-24 and previously three representative datasets demonstrate that our proposed SMNN-MSFF outperformed state-of-the-art VCR methods. And extensive ablation studies also demonstrate that each module of our method is effective, especially, the smooth modulation efficiently help feature learning of the minority or tail classes. Vehicle Color-24 and the code of SMNN-MSFF are publicly available and can contact the author to obtain.
引用
下载
收藏
页数:12
相关论文
共 50 条
  • [31] Multi-Scale Feature Fusion Convolutional Neural Network for Indoor Small Target Detection
    Huang, Li
    Chen, Cheng
    Yun, Juntong
    Sun, Ying
    Tian, Jinrong
    Hao, Zhiqiang
    Yu, Hui
    Ma, Hongjie
    FRONTIERS IN NEUROROBOTICS, 2022, 16
  • [32] A convolutional neural network model of multi-scale feature fusion: MFF-Net
    Yi, Yunyun
    Wang, Jinbao
    Ding, Xingtao
    Li, Chenlong
    JOURNAL OF COMPUTATIONAL METHODS IN SCIENCES AND ENGINEERING, 2022, 22 (06) : 2217 - 2225
  • [33] Colon polyp detection based on multi-scale and multi-level feature fusion and lightweight convolutional neural network
    Li, Yiyang
    Zhao, Jiayi
    Yu, Ruoyi
    Liu, Huixiang
    Liang, Shuang
    Gu, Yu
    Shengwu Yixue Gongchengxue Zazhi/Journal of Biomedical Engineering, 2024, 41 (05): : 911 - 918
  • [34] Multi-Scale Feature For Recognition
    Lei, Songze
    Hao, Chongyang
    Qi, Min
    ICECT: 2009 INTERNATIONAL CONFERENCE ON ELECTRONIC COMPUTER TECHNOLOGY, PROCEEDINGS, 2009, : 277 - 280
  • [35] MFFLNet: lightweight semantic segmentation network based on multi-scale feature fusion
    Wei Depeng
    Wang Huabin
    Multimedia Tools and Applications, 2024, 83 : 30073 - 30093
  • [36] MFFLNet: lightweight semantic segmentation network based on multi-scale feature fusion
    Wei Depeng
    Wang Huabin
    MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (10) : 30073 - 30093
  • [37] Liver segmentation network based on detail enhancement and multi-scale feature fusion
    Lu Tinglan
    Qin Jun
    Qin Guihe
    Shi Weili
    Zhang Wentao
    Scientific Reports, 15 (1)
  • [38] Underwater Image Enhancement Based on Multi-Scale Feature Fusion and Attention Network
    Liu Y.
    Liu M.
    Lin S.
    Tao Z.
    Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2023, 35 (05): : 685 - 695
  • [39] MCANet: multi-scale contextual feature fusion network based on Atrous convolution
    Ke Li
    ZhanDong Liu
    Multimedia Tools and Applications, 2023, 82 : 34679 - 34702
  • [40] Neural Path Planning With Multi-Scale Feature Fusion Networks
    Jin, Xiang
    Lan, Wei
    Chang, Xin
    IEEE ACCESS, 2022, 10 : 118176 - 118186