Efficient FPGA-based Accelerator for Post-Processing in Object Detection

被引:1
|
作者
Guo, Zibo [1 ]
Liu, Kai [1 ]
Liu, Wei [2 ]
Li, Shangrong [1 ]
机构
[1] Xidian Univ, Sch Comp Sci & Technol, Xian, Peoples R China
[2] State Key Lab Geoinformat Engn, Xian, Peoples R China
基金
中国国家自然科学基金;
关键词
Object Detection; FPGA; NMS; Post-Processing; Hardware Accelerator;
D O I
10.1109/ICFPT59805.2023.00019
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The fast object detection algorithms such as YOLO, etc., have two main components: convolutional neural network (CNN) and post-processing. In this paper, we propose an efficient FPGA-based accelerator for the post-processing of YOLO object detection. We schedule a pipelined data path for post-processing, which fuses data scanning/caching, decoding, class identification, and non-maximum suppression (NMS) operations. This data path effectively hides the processing time of operations without data dependencies, providing up to 43x speedup over primitive serial processes. Moreover, We propose a parallel hardware architecture for NMS, allowing for parallel intersection over union (IoU) computation and threshold comparison. This architecture significantly reduces the latency of NMS, providing up to 811x speedup over primitive serial processes. Our accelerator implemented on a Xilinx Virtex-7 690t FPGA runs at 150MHz. Impressively, our accelerator achieves minimal latency of only 0.19 mu s and a processing time of merely 4.46 mu s for the postprocessing of the YOLOv3Tiny algorithm. This represents an 378x speedup compared to the Intel i7-8700 CPU with 3.2GHz. Additionally, our NMS unit exhibits a latency of 0.07 mu s, which is nearly 2x faster than the state-of-the-art FPGA acceleration method.
引用
收藏
页码:125 / 131
页数:7
相关论文
共 50 条
  • [21] Energy Efficient FPGA-Based Accelerator for Dynamic Sparse Transformer
    Li, Zuohao
    Lai, Yiwan
    Zhang, Hao
    2024 13TH INTERNATIONAL CONFERENCE ON COMMUNICATIONS, CIRCUITS AND SYSTEMS, ICCCAS 2024, 2024, : 7 - 12
  • [22] An FPGA-Based Energy-Efficient Reconfigurable Convolutional Neural Network Accelerator for Object Recognition Applications
    Li, Jixuan
    Un, Ka-Fai
    Yu, Wei-Han
    Mak, Pui-In
    Martins, Rui P.
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, 2021, 68 (09) : 3143 - 3147
  • [23] An Efficient Implementation of FPGA-based Object Detection Using Multi-scale Attention
    Furuta, Masanori
    Ban, Koichiro
    Kobayashi, Daisuke
    Shibata, Tomoyuki
    2021 IEEE INTERNATIONAL MIDWEST SYMPOSIUM ON CIRCUITS AND SYSTEMS (MWSCAS), 2021, : 321 - 325
  • [24] FPGA-based Object Detection for Autonomous Driving System
    Harada, Kenichi
    Kanazawa, Kenji
    Yasunaga, Moritoshi
    2019 INTERNATIONAL CONFERENCE ON FIELD-PROGRAMMABLE TECHNOLOGY (ICFPT 2019), 2019, : 465 - 468
  • [25] PBDE: an effective post-processing method based on box density for object detection
    Zhishan Li
    Baozhi Jia
    Yifan He
    Lei Xie
    Applied Intelligence, 2022, 52 : 2930 - 2941
  • [26] FPGA-based object detection in robot soccer application
    Kaulmann, T
    Strünkmann, M
    Witkowski, U
    PROCEEDINGS OF THE 3RD INTERNATIONAL SYMPOSIUM ON AUTONOMOUS MINIROBOTS FOR RESEARCH AND EDUTAINMENT (AMIRE 2005), 2006, : 135 - +
  • [27] PBDE: an effective post-processing method based on box density for object detection
    Li, Zhishan
    Jia, Baozhi
    He, Yifan
    Xie, Lei
    APPLIED INTELLIGENCE, 2022, 52 (03) : 2930 - 2941
  • [28] Design an Efficient FPGA-Based Accelerator for Leveled BFV Homomorphic Encryption
    Kong, Liang
    Qin, Guojie
    Li, Shuguo
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, 2024, 71 (03) : 1381 - 1385
  • [29] An Efficient FPGA-Based Accelerator for Perceptual Weighting Filter in Speech Coding
    Singh, Dilip
    Chandel, Rajeevan
    IETE TECHNICAL REVIEW, 2024, 41 (04) : 441 - 453
  • [30] Energy Efficient FPGA-Based Binary Transformer Accelerator for Edge Devices
    Du, Congpeng
    Ko, Seok-Bum
    Zhang, Hao
    2024 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, ISCAS 2024, 2024,