Efficient FPGA-based Accelerator for Post-Processing in Object Detection

被引:1
|
作者
Guo, Zibo [1 ]
Liu, Kai [1 ]
Liu, Wei [2 ]
Li, Shangrong [1 ]
机构
[1] Xidian Univ, Sch Comp Sci & Technol, Xian, Peoples R China
[2] State Key Lab Geoinformat Engn, Xian, Peoples R China
基金
中国国家自然科学基金;
关键词
Object Detection; FPGA; NMS; Post-Processing; Hardware Accelerator;
D O I
10.1109/ICFPT59805.2023.00019
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The fast object detection algorithms such as YOLO, etc., have two main components: convolutional neural network (CNN) and post-processing. In this paper, we propose an efficient FPGA-based accelerator for the post-processing of YOLO object detection. We schedule a pipelined data path for post-processing, which fuses data scanning/caching, decoding, class identification, and non-maximum suppression (NMS) operations. This data path effectively hides the processing time of operations without data dependencies, providing up to 43x speedup over primitive serial processes. Moreover, We propose a parallel hardware architecture for NMS, allowing for parallel intersection over union (IoU) computation and threshold comparison. This architecture significantly reduces the latency of NMS, providing up to 811x speedup over primitive serial processes. Our accelerator implemented on a Xilinx Virtex-7 690t FPGA runs at 150MHz. Impressively, our accelerator achieves minimal latency of only 0.19 mu s and a processing time of merely 4.46 mu s for the postprocessing of the YOLOv3Tiny algorithm. This represents an 378x speedup compared to the Intel i7-8700 CPU with 3.2GHz. Additionally, our NMS unit exhibits a latency of 0.07 mu s, which is nearly 2x faster than the state-of-the-art FPGA acceleration method.
引用
收藏
页码:125 / 131
页数:7
相关论文
共 50 条
  • [1] FPGA-based accelerator for object detection: a comprehensive survey
    Kai Zeng
    Qian Ma
    Jia Wen Wu
    Zhe Chen
    Tao Shen
    Chenggang Yan
    The Journal of Supercomputing, 2022, 78 : 14096 - 14136
  • [2] FPGA-based accelerator for object detection: a comprehensive survey
    Zeng, Kai
    Ma, Qian
    Wu, Jia Wen
    Chen, Zhe
    Shen, Tao
    Yan, Chenggang
    JOURNAL OF SUPERCOMPUTING, 2022, 78 (12): : 14096 - 14136
  • [3] Robust and efficient post-processing for video object detection
    Sabater, Alberto
    Montesano, Luis
    Murillo, Ana C.
    2020 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2020, : 10536 - 10542
  • [4] A FPGA-Based Post-Processing and Validation Platform for Random Number Generators
    Gantel, Laurent
    Duc, Alexandre
    Steiner, Lucie
    Vannel, Fabien
    Upegui, Andres
    Gluck, Florent
    2020 IEEE 34TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS (IPDPSW 2020), 2020, : 123 - 126
  • [5] An FPGA-based Accelerator for Cortical Object Classification
    Park, Mi Sun
    Kestur, Srinidhi
    Sabarad, Jagdish
    Narayanan, Vijaykrishnan
    Irwin, Mary Jane
    DESIGN, AUTOMATION & TEST IN EUROPE (DATE 2012), 2012, : 691 - 696
  • [6] Efficient Hardware Post Processing of Anchor-Based Object Detection on FPGA
    Zhang, Hui
    Wu, Wei
    Ma, Yufei
    Wang, Zhongfeng
    2020 IEEE COMPUTER SOCIETY ANNUAL SYMPOSIUM ON VLSI (ISVLSI 2020), 2020, : 580 - 585
  • [7] FPGA-Based Programmable Accelerator for Hybrid Processing
    Stefan, Gheorghe M.
    Bira, Calin
    Hobincu, Radu
    Malita, Mihaela
    ROMANIAN JOURNAL OF INFORMATION SCIENCE AND TECHNOLOGY, 2016, 19 (1-2): : 148 - 165
  • [8] An Efficient FPGA-Based Accelerator Design for Convolution
    Song, Peng-Fei
    Pan, Jeng-Shyang
    Yang, Chun-Sheng
    Lee, Chiou-Yng
    2017 IEEE 8TH INTERNATIONAL CONFERENCE ON AWARENESS SCIENCE AND TECHNOLOGY (ICAST), 2017, : 494 - 500
  • [9] An Efficient FPGA-based Accelerator for Deep Forest
    Zhu, Mingyu
    Luo, Jiapeng
    Mao, Wendong
    Wang, Zhongfeng
    2022 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS 22), 2022, : 3334 - 3338
  • [10] An FPGA-Based YOLOv6 Accelerator for High-Throughput and Energy-Efficient Object Detection
    Sha, Xingan
    Yanagisawa, Masao
    Shi, Youhua
    IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2025, E108A (03) : 473 - 481