Efficient FPGA-based Accelerator for Post-Processing in Object Detection

被引:1
|
作者
Guo, Zibo [1 ]
Liu, Kai [1 ]
Liu, Wei [2 ]
Li, Shangrong [1 ]
机构
[1] Xidian Univ, Sch Comp Sci & Technol, Xian, Peoples R China
[2] State Key Lab Geoinformat Engn, Xian, Peoples R China
基金
中国国家自然科学基金;
关键词
Object Detection; FPGA; NMS; Post-Processing; Hardware Accelerator;
D O I
10.1109/ICFPT59805.2023.00019
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The fast object detection algorithms such as YOLO, etc., have two main components: convolutional neural network (CNN) and post-processing. In this paper, we propose an efficient FPGA-based accelerator for the post-processing of YOLO object detection. We schedule a pipelined data path for post-processing, which fuses data scanning/caching, decoding, class identification, and non-maximum suppression (NMS) operations. This data path effectively hides the processing time of operations without data dependencies, providing up to 43x speedup over primitive serial processes. Moreover, We propose a parallel hardware architecture for NMS, allowing for parallel intersection over union (IoU) computation and threshold comparison. This architecture significantly reduces the latency of NMS, providing up to 811x speedup over primitive serial processes. Our accelerator implemented on a Xilinx Virtex-7 690t FPGA runs at 150MHz. Impressively, our accelerator achieves minimal latency of only 0.19 mu s and a processing time of merely 4.46 mu s for the postprocessing of the YOLOv3Tiny algorithm. This represents an 378x speedup compared to the Intel i7-8700 CPU with 3.2GHz. Additionally, our NMS unit exhibits a latency of 0.07 mu s, which is nearly 2x faster than the state-of-the-art FPGA acceleration method.
引用
收藏
页码:125 / 131
页数:7
相关论文
共 50 条
  • [41] A Fast and Flexible FPGA-based Accelerator for Natural Language Processing Neural Networks
    Suyeon, Hur
    Na, Seongmin
    Kwon, Dongup
    Joonsung, Kim
    Boutros, Andrew
    Nurvitadhi, Eriko
    Kim, Jangwoo
    ACM TRANSACTIONS ON ARCHITECTURE AND CODE OPTIMIZATION, 2022, 20 (01)
  • [42] A Hardware Accelerator for SSD Object Detection Algorithm Based on FPGA
    Xie H.
    Cao J.
    Li P.
    Zhao X.
    Zhang X.
    Beijing Daxue Xuebao (Ziran Kexue Ban)/Acta Scientiarum Naturalium Universitatis Pekinensis, 2022, 58 (06): : 1015 - 1022
  • [43] Object Detection Post Processing Accelerator Based on Co-Design of Hardware and Software
    Yang, Dengtian
    Chen, Lan
    Hao, Xiaoran
    Zhang, Yiheng
    INFORMATION, 2025, 16 (01)
  • [44] An Energy-Efficient FPGA-Based Packet Processing Framework
    Daniel Horvath
    Imre Bertalan
    Istvan Moldovan
    Tuan Anh Trinh
    NETWORKED SERVICES AND APPLICATIONS - ENGINEERING, CONTROL AND MANAGEMENT, 2010, 6164 : 31 - +
  • [45] Comprehensive and efficient post-processing
    不详
    AIRCRAFT ENGINEERING AND AEROSPACE TECHNOLOGY, 2000, 72 (05): : 479 - 480
  • [46] An FPGA-based accelerator for Fourier Descriptors computing for color object recognition using SVM
    Fethi Smach
    Johel Miteran
    Mohamed Atri
    Julien Dubois
    Mohamed Abid
    Jean-Paul Gauthier
    Journal of Real-Time Image Processing, 2007, 2 : 249 - 258
  • [47] An FPGA-based accelerator for Fourier Descriptors computing for color object recognition using SVM
    Smach, Fethi
    Miteran, Johel
    Atri, Mohamed
    Dubois, Julien
    Abid, Mohamed
    Gauthier, Jean-Paul
    JOURNAL OF REAL-TIME IMAGE PROCESSING, 2007, 2 (04) : 249 - 258
  • [48] A Fast and Efficient FPGA-based Level Set Hardware Accelerator for Image Segmentation
    Liu Ye
    Xiao Jianbiao
    Wu Fei
    Chang Liang
    Zhou Jun
    JOURNAL OF ELECTRONICS & INFORMATION TECHNOLOGY, 2021, 43 (06) : 1525 - 1532
  • [49] A Flexible FPGA-Based Accelerator for Efficient Inference of Multi-Precision CNNs
    Liu, Xinyan
    Wu, Xiao
    Shao, Haiku
    Wang, Zhongfeng
    2024 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, ISCAS 2024, 2024,
  • [50] Efficient FPGA-Based Transformer Accelerator Using In-Block Balanced Pruning
    Wang, Saiqun
    Zhang, Hao
    2024 13TH INTERNATIONAL CONFERENCE ON COMMUNICATIONS, CIRCUITS AND SYSTEMS, ICCCAS 2024, 2024, : 18 - 23