Efficient Vision Transformer YOLOv5 for Accurate and Fast Traffic Sign Detection

被引:1
|
作者
Zeng, Guang [1 ]
Wu, Zhizhou [2 ,3 ,4 ]
Xu, Lipeng [1 ]
Liang, Yunyi [5 ]
机构
[1] Xinjiang Univ, Sch Intelligent Mfg Modern Ind, Urumqi 830017, Peoples R China
[2] Xinjiang Univ, Sch Traff & Transportat Engn, Urumqi 830017, Peoples R China
[3] Tongji Univ, Coll Transportat Engn, Shanghai 201804, Peoples R China
[4] Xinjiang Univ, Xinjiang Key Lab Green Construct & Smart Traff Con, Urumqi 830017, Peoples R China
[5] Tech Univ Munich, Dept Mobil Syst Engn, D-80333 Munich, Germany
基金
中国国家自然科学基金;
关键词
traffic sign detection; attention mechanism; wise-IoU; YOLOv5; efficient vision transformer; RECOGNITION; MODEL;
D O I
10.3390/electronics13050880
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Accurate and fast detection of traffic sign information is vital for autonomous driving systems. However, the YOLOv5 algorithm faces challenges with low accuracy and slow detection when it is used for traffic sign detection. To address these shortcomings, this paper introduces an accurate and fast traffic sign detection algorithm-YOLOv5-Efficient Vision TransFormer(EfficientViT)). The algorithm focuses on improving both the accuracy and speed of the model by replacing the CSPDarknet backbone of the YOLOv5(s) model with the EfficientViT network. Additionally, the algorithm incorporates the Convolutional Block Attention Module(CBAM) attention mechanism to enhance feature layer information extraction and boost the accuracy of the detection algorithm. To mitigate the adverse effects of low-quality labels on gradient generation and enhance the competitiveness of high-quality anchor frames, a superior gradient gain allocation strategy is employed. Furthermore, the strategy introduces the Wise-IoU (WIoU), a dynamic non-monotonic focusing mechanism for bounding box loss, to further enhance the accuracy and speed of the object detection algorithm. The algorithm's effectiveness is validated through experiments conducted on the 3L-TT100K traffic sign dataset, showcasing a mean average precision (mAP) of 94.1% in traffic sign detection. This mAP surpasses the performance of the YOLOv5(s) algorithm by 4.76% and outperforms the baseline algorithm. Additionally, the algorithm achieves a detection speed of 62.50 frames per second, which is much better than the baseline algorithm.
引用
收藏
页数:21
相关论文
共 50 条
  • [1] Efficient traffic sign recognition with YOLOv5
    Nacir, Omran
    Amna, Maraoui
    Imen, Werda
    Hamdi, Belgacem
    [J]. International Journal of Powertrains, 2024, 13 (03) : 269 - 286
  • [2] Improved Traffic Sign Detection Method for YOLOv5
    Wei, Qiang
    Hu, Xiaoyang
    Zhao, Hongxin
    [J]. Computer Engineering and Applications, 2023, 59 (13) : 229 - 237
  • [3] Traffic Sign Detection Based on Improved YOLOv5
    Zhou, Hua-Ping
    Xu, Chen-Chen
    Sun, Ke-Lei
    [J]. Journal of Computers (Taiwan), 2023, 34 (03) : 63 - 73
  • [4] Improved YOLOv5 Traffic Sign Detection Algorithm
    Yang, Guoliang
    Yang, Hao
    Yu, Shuaiying
    Wang, Jixiang
    Nie, Ziling
    [J]. Computer Engineering and Applications, 2023, 59 (10) : 262 - 269
  • [5] Improved Traffic Sign Detection Algorithm for YOLOv5
    Hu, Zhaohua
    Wang, Ying
    [J]. Computer Engineering and Applications, 2023, 59 (01): : 82 - 91
  • [6] Traffic Sign Detection Based on the Improved YOLOv5
    Zhang, Rongyun
    Zheng, Kunming
    Shi, Peicheng
    Mei, Ye
    Li, Haoran
    Qiu, Tian
    [J]. APPLIED SCIENCES-BASEL, 2023, 13 (17):
  • [7] Improved YOLOv5’s Traffic Sign Detection Algorithm
    Yang, Xiang
    Wang, Huabin
    Dong, Minggang
    [J]. Computer Engineering and Applications, 2023, 59 (13) : 194 - 204
  • [8] ADVERSARIAL ATTACK ON YOLOV5 FOR TRAFFIC AND ROAD SIGN DETECTION
    Jain, Sanyam
    [J]. 2024 4TH INTERNATIONAL CONFERENCE ON APPLIED ARTIFICIAL INTELLIGENCE, ICAPAI, 2024, : 73 - 77
  • [9] An Improved Traffic Sign Detection and Recognition Deep Model Based on YOLOv5
    Wang, Qianying
    Li, Xiangyu
    Lu, Ming
    [J]. IEEE ACCESS, 2023, 11 : 54679 - 54691
  • [10] YOLO-HyperVision: A vision transformer backbone-based enhancement of YOLOv5 for detection of dynamic traffic information
    Xu, Shizhou
    Zhang, Mengjie
    Chen, Jingyu
    Zhong, Yiming
    [J]. EGYPTIAN INFORMATICS JOURNAL, 2024, 27