End-to-end acceleration of the YOLO object detection framework on FPGA-only devices

被引:5
|
作者
Zhang, Dezheng [1 ,2 ]
Wang, Aibin [1 ,2 ]
Mo, Ruchan [1 ,2 ]
Wang, Dong [1 ,2 ]
机构
[1] Beijing Jiaotong Univ, Inst Informat Sci, Beijing 100044, Peoples R China
[2] Network Technol, Beijing Key Lab Adv Informat Sci, Beijing 100044, Peoples R China
来源
NEURAL COMPUTING & APPLICATIONS | 2024年 / 36卷 / 03期
基金
北京市自然科学基金;
关键词
Convolution neural networks (CNN); Object detection; YOLOv2; Field-programmable gate array (FPGA); High-level synthesis (HLS); Accelerator architecture; Post-processing; CNN;
D O I
10.1007/s00521-023-09078-8
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Object detection has been revolutionized by convolutional neural networks (CNNs), but their high computational complexity and heavy data access requirements make implementing these algorithms on edge devices challenging. To address this issue, we propose an efficient object detection accelerator for YOLO series algorithm. Our architecture utilizes multiple dimensions of parallelism to accelerate the convolution computation. We employ line-buffer-based parallel data caches and dedicated data access units to minimize off-chip bandwidth pressure. Additionally, our proposed design not only accelerates the convolutional computation, but also control-intensive post-processing to achieve low detection latency. We evaluate the final design on Xilinx V7-690t FPGA device, achieving a throughput of 525 GOP/s for a batch size of 1 and 914 GOP/s for a batch size equal to 2. Compared with state-of-the-art YOLOv2 and YOLOv3 implementations, our proposed accelerator offers up to 9x throughput improvement and 5x shorter latency.
引用
收藏
页码:1067 / 1089
页数:23
相关论文
共 50 条
  • [21] Pflow: An end-to-end heterogeneous acceleration framework for CNN inference on FPGAs
    Wan, Yi
    Xie, Xianzhong
    Yi, Lingjie
    Jiang, Bo
    Chen, Junfan
    Jiang, Yi
    JOURNAL OF SYSTEMS ARCHITECTURE, 2024, 150
  • [22] FracNet: An end-to-end framework for bone fracture detection
    Alwzwazy, Haider A.
    Alzubaidi, Laith
    Zhao, Zehui
    Gu, Yuantong
    PATTERN RECOGNITION LETTERS, 2025, 190 : 1 - 7
  • [23] An end-to-end framework for private DGA detection as a service
    Maia, Ricardo J. M.
    Ray, Dustin
    Pentyala, Sikha
    Dowsley, Rafael
    De Cock, Martine
    Nascimento, Anderson C. A.
    Jacobi, Ricardo
    PLOS ONE, 2024, 19 (08):
  • [24] End-to-End Human Object Interaction Detection with HOI Transformer
    Zou, Cheng
    Wang, Bohan
    Hu, Yue
    Liu, Junqi
    Wu, Qian
    Zhao, Yu
    Li, Boxun
    Zhang, Chenguang
    Zhang, Chi
    Wei, Yichen
    Sun, Jian
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 11820 - 11829
  • [25] Deeply Tensor Compressed Transformers for End-to-End Object Detection
    Zhen, Peining
    Gao, Ziyang
    Hou, Tianshu
    Cheng, Yuan
    Chen, Hai-Bao
    THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / THE TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 4716 - 4724
  • [26] End-to-End Object Detection with Enhanced Positive Sample Filter
    Song, Xiaolin
    Chen, Binghui
    Li, Pengyu
    Wang, Biao
    Zhang, Honggang
    APPLIED SCIENCES-BASEL, 2023, 13 (03):
  • [27] Dynamic DETR: End-to-End Object Detection with Dynamic Attention
    Dai, Xiyang
    Chen, Yinpeng
    Yang, Jianwei
    Zhang, Pengchuan
    Yuan, Lu
    Zhang, Lei
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 2968 - 2977
  • [28] SCIoT: A Secure and sCalable End-to-End Management Framework for IoT Devices
    Ambrosin, Moreno
    Conti, Mauro
    Ibrahim, Ahmad
    Sadeghi, Ahmad-Reza
    Schunter, Matthias
    COMPUTER SECURITY (ESORICS 2018), PT I, 2018, 11098 : 595 - 617
  • [29] DeepEdgeSoC: End-to-end deep learning framework for edge IoT devices
    Al Koutayni, Mhd Rashed
    Reis, Gerd
    Stricker, Didier
    INTERNET OF THINGS, 2023, 21
  • [30] A Semisupervised End-to-End Framework for Transportation Mode Detection by Using GPS-Enabled Sensing Devices
    Li, Zhishuai
    Xiong, Gang
    Wei, Zebing
    Lv, Yisheng
    Anwar, Noreen
    Wang, Fei-Yue
    IEEE INTERNET OF THINGS JOURNAL, 2022, 9 (10) : 7842 - 7852