Real-time object detection method based on YOLOv5 and efficient mobile network

被引:2
|
作者
Feng, Shuai [1 ]
Qian, Huaming [1 ]
Wang, Huilin [1 ]
Wang, Wenna [1 ]
机构
[1] Harbin Engn Univ, Coll Intelligent Syst Sci & Engn, NanTong St, Harbin 150001, Peoples R China
关键词
Coordinate attention; MobileNetv2-CA; MSPPF; MPANet;
D O I
10.1007/s11554-024-01433-9
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The object detection algorithm YOLOv5, which is based on deep learning, experiences inefficiencies due to an overabundance of model parameters and an overly complex structure. These drawbacks hinder its deployment on mobile devices, which are constrained by their computational capabilities and storage capacities. Addressing these limitations, we introduce a lightweight object detection algorithm that harnesses the coordinate attention (CA) mechanism in synergy with the YOLOv5 framework. Our approach embeds the CA mechanism into MobileNetv2 to create MobileNetv2-CA, thereby replacing the CSDarkNet53 as YOLOv5's backbone network. This innovation not only trims the model's parameter count but also maintains a competitive level of accuracy. Further amplifying performance, we propose a multi-scale fast spatial pyramid pooling (MSPPF) layer, designed to expedite and refine the model's handling of various input image dimensions. Complementing this, we incorporate MPANet, a feature fusion network comprising optimally designed upsampling and downsample modules, along with feature extraction cells. This configuration is devised to elevate detection precision while minimizing the parameter overhead. Empirical results showcase the prowess of our methodology: we achieve a mean average precision (mAP) of 87.6% on the PASCAL VOC07+12 dataset and an average precision (AP) of 39.4% on the MS COCO dataset, with the model's parameter size being a mere 10.1MB. When compared to the original YOLOv5, our proposed model achieves a parameter reduction of 76.9% and operates at a velocity that is 1.72 times faster, reaching 54.9 frames per second (FPS) on an NVIDIA RTX3060. Versus SOTA techniques, our method demonstrates a commendable equilibrium between accuracy and real-time performance.
引用
下载
收藏
页数:13
相关论文
共 50 条
  • [1] Real-time object detection method based on YOLOv5 and efficient mobile network
    Shuai Feng
    Huaming Qian
    Huilin Wang
    Wenna Wang
    Journal of Real-Time Image Processing, 2024, 21
  • [2] Real-Time Object Tracking with YOLOv5 and Recurrent Network
    Mohammed, Al Ameri
    Memon, Qurban
    2024 7TH INTERNATIONAL CONFERENCE ON ELECTRONICS, COMMUNICATIONS, AND CONTROL ENGINEERING, ICECC 2024, 2024, : 28 - 32
  • [3] Real-Time Detection of Eichhornia crassipes Based on Efficient YOLOV5
    Qian, Yukun
    Miao, Yalun
    Huang, Shuqin
    Qiao, Xi
    Wang, Minghui
    Li, Yanzhou
    Luo, Liuming
    Zhao, Xiyong
    Cao, Long
    MACHINES, 2022, 10 (09)
  • [4] Research on Real-Time Diver Detection and Tracking Method Based on YOLOv5 and DeepSORT
    Zhao, Xinhua
    Huang, Zheng
    Lv, Yongjia
    PROCEEDINGS OF 2022 IEEE INTERNATIONAL CONFERENCE ON MECHATRONICS AND AUTOMATION (IEEE ICMA 2022), 2022, : 191 - 196
  • [5] Quantizing YOLOv5 for Real-Time Vehicle Detection
    Zhang, Zicheng
    Xu, Hongke
    Lin, Shan
    IEEE ACCESS, 2023, 11 : 145601 - 145611
  • [6] YOLOv5-R: lightweight real-time detection based on improved YOLOv5
    Ren, Jian
    Wang, Zhijie
    Zhang, Yifan
    Liao, Lei
    JOURNAL OF ELECTRONIC IMAGING, 2022, 31 (03)
  • [7] Enhanced YOLOv5: An Efficient Road Object Detection Method
    Chen, Hao
    Chen, Zhan
    Yu, Hang
    SENSORS, 2023, 23 (20)
  • [8] A Real-Time Detection Algorithm for Kiwifruit Defects Based on YOLOv5
    Yao, Jia
    Qi, Jiaming
    Zhang, Jie
    Shao, Hongmin
    Yang, Jia
    Li, Xin
    ELECTRONICS, 2021, 10 (14)
  • [9] CDNet: a real-time and robust crosswalk detection network on Jetson nano based on YOLOv5
    Zheng-De Zhang
    Meng-Lu Tan
    Zhi-Cai Lan
    Hai-Chun Liu
    Ling Pei
    Wen-Xian Yu
    Neural Computing and Applications, 2022, 34 : 10719 - 10730
  • [10] CDNet: a real-time and robust crosswalk detection network on Jetson nano based on YOLOv5
    Zhang, Zheng-De
    Tan, Meng-Lu
    Lan, Zhi-Cai
    Liu, Hai-Chun
    Pei, Ling
    Yu, Wen-Xian
    NEURAL COMPUTING & APPLICATIONS, 2022, 34 (13): : 10719 - 10730