YOLOv4-5D: An Effective and Efficient Object Detector for Autonomous Driving

被引:198
|
作者
Cai, Yingfeng [1 ]
Luan, Tianyu [2 ]
Gao, Hongbo [3 ]
Wang, Hai [2 ]
Chen, Long [1 ]
Li, Yicheng [1 ]
Sotelo, Miguel Angel [4 ]
Li, Zhixiong [5 ]
机构
[1] Jiangsu Univ, Automot Engn Res Inst, Zhenjiang 212013, Jiangsu, Peoples R China
[2] Jiangsu Univ, Sch Automot & Traff Engn, Zhenjiang 212013, Jiangsu, Peoples R China
[3] Univ Sci & Technol China, Dept Automat, Hefei 230026, Peoples R China
[4] Univ Alcal, Dept Comp Engn, Madrid 28801, Spain
[5] Yonsei Univ, Yonsei Frontier Lab, Seoul 03722, South Korea
基金
中国国家自然科学基金;
关键词
Autonomous driving; CSPDarknet53; object detection; Pruning; YOLOv4;
D O I
10.1109/TIM.2021.3065438
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The use of object detection algorithms has become extremely important in autonomous vehicles. Object detection at high accuracy and a fast inference speed is essential for safe autonomous driving. Therefore, the balance between the effectiveness and efficiency of the object detector must be considered. This article proposes a one-stage object detection framework for improving the detection accuracy while supporting a true real-time operation based on the YOLOv4. The backbone network in the proposed framework is the CSPDarknet53_dcn(P). The last output layer in the CSPDarknet53 is replaced with deformable convolution to improve the detection accuracy. In order to perform feature fusion, a new feature fusion module PAN++ is designed and five scales detection layers are used to improve the detection accuracy of small objects. In addition, this article proposes an optimized network pruning algorithm to solve the problem that the real-time performance of the algorithm cannot be satisfied due to the limited computing resources of the vehicle-mounted computing platform. The method of sparse scaling factor is used to improve the existing channel pruning algorithm. Compared to the YOLOv4, the YOLOV4-5D improves the mean average precision by 4.23% on the BDD data sets and 1.68% on the KITTI data sets. Finally, by pruning the model, the inference speed of YOLOV4-5D is increased 31.3% and the memory is only 98.1 MB when the detection accuracy is almost unchanged. Nevertheless, the proposed algorithm is capable of real-time detection at faster than 66 frames/s (fps) and shows higher accuracy than the previous approaches with a similar fps.
引用
收藏
页数:13
相关论文
共 50 条
  • [31] AEC3D: An Efficient and Compact Single Stage 3D Multiobject Detector for Autonomous Driving
    Hoang Duy Loc
    Kim, Gon-Woo
    [J]. IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (12) : 23422 - 23432
  • [32] Efficient Adversarial Attack Strategy Against 3D Object Detection in Autonomous Driving Systems
    Chen, Hai
    Yan, Huanqian
    Yang, Xiao
    Su, Hang
    Zhao, Shu
    Qian, Fulan
    [J]. IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2024,
  • [33] Monocular 3D Object Detection for Autonomous Driving
    Chen, Xiaozhi
    Kundu, Kaustav
    Zhang, Ziyu
    Ma, Huimin
    Fidler, Sanja
    Urtasun, Raquel
    [J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 2147 - 2156
  • [34] 3D Object Detection for Autonomous Driving: A Survey
    Qian, Rui
    Lai, Xin
    Li, Xirong
    [J]. PATTERN RECOGNITION, 2022, 130
  • [35] RGBD-SLAM Based on Object Detection With Two-Stream YOLOv4-MobileNetv3 in Autonomous Driving
    Li, Gongfa
    Fan, Hanwen
    Jiang, Guozhang
    Jiang, Du
    Liu, Yuting
    Tao, Bo
    Yun, Juntong
    [J]. IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2024, 25 (03) : 2847 - 2857
  • [36] Effective Face Detector Based on YOLOv5 and Superresolution Reconstruction
    Xu, Qingqing
    Zhu, Zhiyu
    Ge, Huilin
    Zhang, Zheqing
    Zang, Xu
    [J]. COMPUTATIONAL AND MATHEMATICAL METHODS IN MEDICINE, 2021, 2021
  • [37] THDet: A Lightweight and Efficient Traffic Helmet Object Detector based on YOLOv8
    Li, Yi
    Xu, Huiying
    Zhu, Xinzhong
    Huang, Xiao
    Li, Hongbo
    [J]. DIGITAL SIGNAL PROCESSING, 2024, 155
  • [38] A LOW POWER HARDWARE IMPLEMENTATION OF MULTI-OBJECT DPM DETECTOR FOR AUTONOMOUS DRIVING
    Ali, Alaa
    Olaleye, Oladiran G.
    Dey, Bappaditya
    Khalil, Kasem
    Bayoumi, Magdy A.
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 1937 - 1941
  • [39] Small Object Detector Using Contextual Local Features and Global Representations for Autonomous Driving
    Wu, Xuke
    Bin Tian
    Xiong, Gang
    Song, Bing
    Ye, Peijun
    Zhu, Fenghua
    [J]. 2023 IEEE 26TH INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS, ITSC, 2023, : 4396 - 4401
  • [40] PEPillar: a point-enhanced pillar network for efficient 3D object detection in autonomous driving
    Sun, Libo
    Li, Yifan
    Qin, Wenhu
    [J]. VISUAL COMPUTER, 2024,