Algorithm-Hardware Co-Design of Single Shot Detector for Fast Object Detection on FPGAs

被引:25
|
作者
Ma, Yufei [1 ]
Zheng, Tu [2 ]
Cao, Yu [1 ]
Vrudhula, Sarma [3 ]
Seo, Jae-sun [1 ]
机构
[1] Arizona State Univ, Sch Elect Comp & Energy Engn, Tempe, AZ 85281 USA
[2] Zhejiang Univ, Coll Comp Sci & Technol, Hangzhou, Zhejiang, Peoples R China
[3] Arizona State Univ, Sch Comp Informat & Decis Syst Engn, Tempe, AZ USA
基金
美国国家科学基金会;
关键词
Hardware Accelerator; FPGA; Neural Network; HW/SW Co -design;
D O I
10.1145/3240765.3240775
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The rapid improvement in computation capability has made convolutional neural networks (CNNs) a great success in recent years on image classification tasks, which has also prospered the development of objection detection algorithms with significantly improved accuracy. However, during the deployment phase, many applications demand low latency processing of one image with strict power consumption requirement, which reduces the efficiency of GPU and other general-purpose platform, bringing opportunities for specific acceleration hardware, e.g. FPGA, by customizing the digital circuit specific for the inference algorithm. Therefore, this work proposes to customize the detection algorithm, e.g. SSD, to benefit its hardware implementation with low data precision at the cost of marginal accuracy degradation. The proposed FPGA-based deep learning inference accelerator is demonstrated on two Intel FPGAs for SSD algorithm achieving up to 2.18 TOPS throughput and up to 3.3x superior energy efficiency compared to GPO.
引用
收藏
页数:8
相关论文
共 50 条
  • [41] Bringing Powerful Machine-Learning Systems to Daily-Life Devices via Algorithm-Hardware Co-Design
    Lin, Yingyan
    [J]. 2020 INTERNATIONAL SYMPOSIUM ON VLSI DESIGN, AUTOMATION AND TEST (VLSI-DAT), 2020,
  • [42] HDSuper: Algorithm-Hardware Co-design for Light-weight High-quality Super-Resolution Accelerator
    Chang, Liang
    Zhao, Xin
    Fan, Dongqi
    Hu, Zhicheng
    Zhou, Jun
    [J]. 2023 60TH ACM/IEEE DESIGN AUTOMATION CONFERENCE, DAC, 2023,
  • [43] FABLE-DTS: Hardware-Software Co-Design of a Fast and Stable Data Transmission System for FPGAs
    Gao, Jia-Bao
    Wang, Jian
    Arafin, Md Tanvir
    Lai, Jin-Mei
    [J]. 2020 IEEE 33RD INTERNATIONAL SYSTEM-ON-CHIP CONFERENCE (SOCC), 2020, : 207 - 212
  • [44] Single-shot augmentation detector for object detection
    Jiaxu Leng
    Ying Liu
    [J]. Neural Computing and Applications, 2021, 33 : 3583 - 3596
  • [45] Single-shot augmentation detector for object detection
    Leng, Jiaxu
    Liu, Ying
    [J]. NEURAL COMPUTING & APPLICATIONS, 2021, 33 (08): : 3583 - 3596
  • [46] INCEPTION SINGLE SHOT MULTIBOX DETECTOR FOR OBJECT DETECTION
    Ning, Chengcheng
    Zhou, Huajun
    Song, Yan
    Tang, Jinhui
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA & EXPO WORKSHOPS (ICMEW), 2017,
  • [47] Hardware/Software co-design of a key point detector on FPGA
    Chati, H. Diakou
    Muehlbauer, F.
    Braun, T.
    Bobda, C.
    Berns, K.
    [J]. FCCM 2007: 15TH ANNUAL IEEE SYMPOSIUM ON FIELD-PROGRAMMABLE CUSTOM COMPUTING MACHINES, PROCEEDINGS, 2007, : 355 - +
  • [48] An object detection algorithm based on the feature pyramid network and single shot multibox detector
    Wang, Yanni
    Liu, Xiang
    Guo, Rongchun
    [J]. CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2022, 25 (05): : 3313 - 3324
  • [49] An object detection algorithm based on the feature pyramid network and single shot multibox detector
    Yanni Wang
    Xiang Liu
    Rongchun Guo
    [J]. Cluster Computing, 2022, 25 : 3313 - 3324
  • [50] HW/SW co-design of reconfigurable hardware-based genetic algorithm in FPGAs applicable to a variety of problems
    Nambiar, Vishnu P.
    Balakrishnan, Sathivellu
    Khalil-Hani, Mohamed
    Marsono, M. N.
    [J]. COMPUTING, 2013, 95 (09) : 863 - 896