Vehicle color recognition based on smooth modulation neural network with multi-scale feature fusion

被引：6

作者：

Hu, Mingdi ^{[1
,2
]}

Bai, Long ^{[1
,2
]}

Fan, Jiulun ^{[1
,2
]}

Zhao, Sirui ^{[3
]}

Chen, Enhong ^{[3
]}

机构：

[1] Xian Univ Posts & Telecommun, Sch Commun & Informat Engn, Xian 710121, Peoples R China

[2] Xian Univ Posts & Telecommun, Sch Artificial Intelligence, Xian 710121, Peoples R China

[3] Univ Sci & Technol China, Sch Comp Sci & Technol, Hefei 230026, Peoples R China

来源：

FRONTIERS OF COMPUTER SCIENCE | 2023年 / 17卷 / 03期

基金：

中国国家自然科学基金;

关键词：

vehicle color recognition; benchmark dataset; multi-scale feature fusion; long-tail distribution; improved smooth l1 loss; CLASSIFICATION;

D O I：

10.1007/s11704-022-1389-x

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Vehicle Color Recognition (VCR) plays a vital role in intelligent traffic management and criminal investigation assistance. However, the existing vehicle color datasets only cover 13 classes, which can not meet the current actual demand. Besides, although lots of efforts are devoted to VCR, they suffer from the problem of class imbalance in datasets. To address these challenges, in this paper, we propose a novel VCR method based on Smooth Modulation Neural Network with Multi-Scale Feature Fusion (SMNN-MSFF). Specifically, to construct the benchmark of model training and evaluation, we first present a new VCR dataset with 24 vehicle classes, Vehicle Color-24, consisting of 10091 vehicle images from a 100-hour urban road surveillance video. Then, to tackle the problem of long-tail distribution and improve the recognition performance, we propose the SMNN-MSFF model with multi-scale feature fusion and smooth modulation. The former aims to extract feature information from local to global, and the latter could increase the loss of the images of tail class instances for training with class-imbalance. Finally, comprehensive experimental evaluation on Vehicle Color-24 and previously three representative datasets demonstrate that our proposed SMNN-MSFF outperformed state-of-the-art VCR methods. And extensive ablation studies also demonstrate that each module of our method is effective, especially, the smooth modulation efficiently help feature learning of the minority or tail classes. Vehicle Color-24 and the code of SMNN-MSFF are publicly available and can contact the author to obtain.

引用

下载

页数：12

共 50 条

[1] Vehicle color recognition based on smooth modulation neural network with multi-scale feature fusion
Mingdi HU
Long BAI
Jiulun FAN
Sirui ZHAO
Enhong CHEN
Frontiers of Computer Science, 2023, 17 (03) : 95 - 106
[2] A multi-scale feature fusion convolutional neural network for facial expression recognition
Zhang, Xiufeng
Fu, Xingkui
Qi, Guobin
Zhang, Ning
EXPERT SYSTEMS, 2024, 41 (04)
[3] Vehicle detection method based on adaptive multi-scale feature fusion network
Shen, Xuanjing
Li, Hanyu
Huang, Yongping
Wang, Yu
JOURNAL OF ELECTRONIC IMAGING, 2022, 31 (04)
[4] A Convolutional Neural Network With Multi-scale Kernel and Feature Fusion for sEMG-based Gesture Recognition
Han, Lijun
Zou, Yongxiang
Cheng, Long
2021 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND BIOMIMETICS (IEEE-ROBIO 2021), 2021, : 774 - 779
[5] Facial Expression Recognition Based on Multi-scale Feature Fusion Convolutional Neural Network and Attention Mechanism
Wu, Yana
Jia, Kebin
Sun, Zhonghua
PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2021, PT II, 2021, 13020 : 324 - 335
[6] A Vehicle Classification Model Based on Multi-scale Feature Fusion
Wang, Xuanhong
Yang, Shiyu
Sun, Zengguo
Li, Xiaojun
Xiao, Yun
2022 41ST CHINESE CONTROL CONFERENCE (CCC), 2022, : 7180 - 7185
[7] Road Recognition Based on Multi-scale Convolutional Network with Multi-level Feature Fusion
Li, Ye
Guo, Lili
Xu, Lele
Wang, Xianfeng
Jin, Shan
TENTH INTERNATIONAL CONFERENCE ON GRAPHICS AND IMAGE PROCESSING (ICGIP 2018), 2019, 11069
[8] Multi-scale Feature Fusion Based Dongba Character Recognition
Luo, Haini
Xu, Dan
Yang, Bing
Zhang, Haoyuan
2020 5TH INTERNATIONAL CONFERENCE ON MECHANICAL, CONTROL AND COMPUTER ENGINEERING (ICMCCE 2020), 2020, : 1571 - 1575
[9] Lightweight silkworm recognition based on Multi-scale feature fusion
Wen, Chunming
Wen, Jie
Li, Jianheng
Luo, Yunyun
Chen, Minbo
Xiao, Zhanpeng
Xu, Qing
Liang, Xiang
An, Hui
COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2022, 200
[10] QoS Prediction via Multi-scale Feature Fusion Based on Convolutional Neural Network
Xu, Hanzhi
Shu, Yanjun
Zhang, Zhan
Zuo, Decheng
SERVICE-ORIENTED COMPUTING, ICSOC 2023, PT I, 2023, 14419 : 119 - 134

← 1 2 3 4 5 →