Helmet detection method based on improved YOLOv5

被引:0
|
作者
Hou G. [1 ]
Chen Q. [1 ]
Yang Z. [1 ]
Zhang Y. [1 ]
Zhang D. [1 ]
Li H. [1 ]
机构
[1] School of Mechanics and Civil Engineering, China University of Mining and Technology(Beijing), Beijing
关键词
attention mechanism; BiFPN; helmet; target detection; YOLOv5;
D O I
10.13374/j.issn2095-9389.2022.12.07.002
中图分类号
学科分类号
摘要
To address the challenge of low detection accuracy in existing safety helmet detection algorithms, particularly in scenarios with small targets, dense environments, and complex surroundings like construction sites, tunnels, and coal mines, we introduce an enhanced object detection approach, denoted as YOLOv5-GBCW. Our methodology includes several key innovations. First, we apply Ghost convolution to overhaul the backbone network, considerably reducing model complexity, decreasing computational requirements by 48.73%, and reducing model size by 45.84% while maintaining high accuracy with only a 1.6 percentage point reduction. Second, we employ a two-way feature pyramid network (BiFPN) to enhance feature fusion, providing distinct weights to objects of varying scales. This empowers our algorithm to excel in detecting small targets. We incorporate a leap-layer connection strategy for cross-scale weight suppression and feature expression, further enhancing object detection performance. In addition, we introduce the coordinate attention module to allocate attention resources to key areas, minimizing background interference in complex environments. Finally, we propose the Beta-WIoU border loss function, employing a dynamic non-monotonic focusing mechanism to reduce the impact of simple examples on loss values. This enables the model to prioritize challenging examples like occlusions, enhancing generalization performance. We also introduce anchor box feature calculations to improve prediction accuracy and expedite model convergence. To validate our algorithm’s feasibility, we use a dataset of 7000 images collected by our research group featuring safety helmets in construction sites, tunnels, mines, and various other scenarios. We conduct comparisons with classic algorithms, including Faster RCNN, SSD, YOLOv3, YOLOv4, and YOLOv5s, along with algorithms from relevant literature. We employ adaptive Gamma transformation for image preprocessing during training to facilitate subsequent detection. Ablation experiments systematically investigate the contributions of each improvement module. Our experimental findings demonstrate that, compared to the YOLOv5s algorithm, our improved YOLOv5-GBCW achieves a remarkable average accuracy improvement of 5.8% at IOU=0.5, reaching 94.5% while maintaining a detection speed of 124.6 FPS(Frames per second). This results in a lighter model with faster convergence, considerably enhancing its detection capabilities in complex, dense, and small target environments while meeting the stringent requirements for helmet detection accuracy and real-time performance. This work introduces a novel approach for detecting safety helmets in intricate construction settings. © 2024 Science Press. All rights reserved.
引用
收藏
页码:329 / 342
页数:13
相关论文
共 26 条
  • [1] Zhang M Y, Cao Z Y, Zhao X F, Et al., On the identification of the safety helmet wearing manners for the construction company workers based on the deep learning theory, J Saf Environ, 19, 2, (2019)
  • [2] Li X D, Chen Q., Main types and prevention measures on construction production safety accidents in China, China Civ Eng J, 45, (2012)
  • [3] Mu L, Zhao H, Li Y, Et al., Vehicle recognition based on gradient compression and YOLO v4 algorithm, Chin J Eng, 44, 5, (2022)
  • [4] Liu X H, Ye X N., Skin color detection and hu moments in helmet recognition research, J East China Univ Sci Technol (Nat Sci Ed), 40, 3, (2014)
  • [5] Park M W, Elsafty N, Zhu Z H., Hardhat-wearing detection for enhancing on-site safety of construction workers, J Constr Eng Manage, 141, 9, (2015)
  • [6] Mneymneh B E, Abbas M, Khoury H., Automated hardhat detection for construction safety applications, Procedia Eng, 196, (2017)
  • [7] Girshick R, Donahue J, Darrell T, Et al., Rich feature hierarchies for accurate object detection and semantic segmentation, Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition, (2014)
  • [8] Ren S, He K, Girshick R, Et al., Faster R-CNN: Towards real-time object detection with region proposal networks, IEEE Trans. Pattern Anal Mach Intell, 39, 6, (2017)
  • [9] He K M, Zhang X Y, Ren S Q, Et al., Spatial pyramid pooling in deep convolutional networks for visual recognition, IEEE Trans Pattern Anal Mach Intell, 37, 9, (2015)
  • [10] Xu S K, Wang Y R, Gu Y W, Et al., Safety helmet wearing detection study based on improved Faster RCNN, Appl Res Comput, 37, 3, (2020)