A Parametrizable High-Level Synthesis Library for Accelerating Neural Networks on FPGAs

被引:0
|
作者
Lester Kalms
Pedram Amini Rad
Muhammad Ali
Arsany Iskander
Diana Göhringer
机构
[1] Technische Universität Dresden,
[2] German University in Cairo,undefined
来源
关键词
High-level synthesis; Neural networks; FPGA; Hardware acceleration; Library;
D O I
暂无
中图分类号
学科分类号
摘要
In recent years, Convolutional Neural Network CNN have been incorporated in a large number of applications, including multimedia retrieval and image classification. However, CNN based algorithms are computationally and resource intensive and therefore difficult to be used in embedded systems. FPGA based accelerators are becoming more and more popular in research and industry due to their flexibility and energy efficiency. However, the available resources and the size of the on-chip memory can limit the performance of the FPGA accelerator for CNN. This work proposes an High-Level Synthesis HLS library for CNN algorithms. It contains seven different streaming-capable CNN (plus two conversion) functions for creating large neural networks with deep pipelines. The different functions have many parameter settings (e.g. for resolution, feature maps, data types, kernel size, parallelilization, accuracy, etc.), which also enable compile-time optimizations. Our functions are integrated into the HiFlipVX library, which is an open source HLS FPGA library for image processing and object detection. This offers the possibility to implement different types of computer vision applications with one library. Due to the various configuration and parallelization possibilities of the library functions, it is possible to implement a high-performance, scalable and resource-efficient system, as our evaluation of the MobileNets algorithm shows.
引用
收藏
页码:513 / 529
页数:16
相关论文
共 50 条
  • [41] Designing and Accelerating Spiking Neural Networks using OpenCL for FPGAs
    Podobas, Artur
    Matsuoka, Satoshi
    2017 INTERNATIONAL CONFERENCE ON FIELD PROGRAMMABLE TECHNOLOGY (ICFPT), 2017, : 255 - 258
  • [42] TensorFlow to Cloud FPGAs: Tradeoffs for Accelerating Deep Neural Networks
    Hadjis, Stefan
    Olukotun, Kunle
    2019 29TH INTERNATIONAL CONFERENCE ON FIELD-PROGRAMMABLE LOGIC AND APPLICATIONS (FPL), 2019, : 360 - 366
  • [43] Rosetta: A Realistic High-Level Synthesis Benchmark Suite for Software Programmable FPGAs
    Zhou, Yuan
    Gupta, Udit
    Dai, Steve
    Zhao, Ritchie
    Srivastava, Nitish
    Jin, Hanchen
    Featherston, Joseph
    Lai, Yi-Hsiang
    Liu, Gai
    Velasquez, Gustavo Angarita
    Wang, Wenping
    Zhang, Zhiru
    PROCEEDINGS OF THE 2018 ACM/SIGDA INTERNATIONAL SYMPOSIUM ON FIELD-PROGRAMMABLE GATE ARRAYS (FPGA'18), 2018, : 269 - 278
  • [44] Syncopation: Adaptive Clock Management for High-Level Synthesis Generated Circuits on FPGAs
    Gibson, Kahlan
    Roorda, Esther
    Noronha, Daniel Holanda
    Wilton, Steven J. E.
    2020 30TH INTERNATIONAL CONFERENCE ON FIELD-PROGRAMMABLE LOGIC AND APPLICATIONS (FPL), 2020, : 225 - 230
  • [45] GRASP-based High-Level Synthesis Design Space Exploration for FPGAs
    Schuster, Nikolas P.
    Nazar, Gabriel L.
    2023 XIII BRAZILIAN SYMPOSIUM ON COMPUTING SYSTEMS ENGINEERING, SBESC, 2023,
  • [46] From Pthreads to Multicore Hardware Systems in LegUp High-Level Synthesis for FPGAs
    Choi, Jongsok
    Brown, Stephen D.
    Anderson, Jason H.
    IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, 2017, 25 (10) : 2867 - 2880
  • [47] Performance Modeling for FPGAs: Extending the Roofline Model with High-Level Synthesis Tools
    da Silva, Bruno
    Braeken, An
    D'Hollander, Erik H.
    Touhafi, Abdellah
    INTERNATIONAL JOURNAL OF RECONFIGURABLE COMPUTING, 2013, 2013
  • [48] RTL Verification and FPGA Implementation of Generalized Neural Networks: A High-Level Synthesis Approach
    Nagarale, Satyashil D.
    Patil, B.P.
    Lecture Notes on Data Engineering and Communications Technologies, 2022, 126 : 447 - 462
  • [49] A Highly Configurable High-Level Synthesis Functional Pattern Library
    Huang, Lan
    Gao, Teng
    Li, Dalin
    Wang, Zihao
    Wang, Kangping
    ELECTRONICS, 2021, 10 (05) : 1 - 16
  • [50] Accelerating Binarized Convolutional Neural Networks with Software-Programmable FPGAs
    Zhao, Ritchie
    Song, Weinan
    Zhang, Wentao
    Xing, Tianwei
    Lin, Jeng-Hau
    Srivastava, Mani
    Gupta, Rajesh
    Zhang, Zhiru
    FPGA'17: PROCEEDINGS OF THE 2017 ACM/SIGDA INTERNATIONAL SYMPOSIUM ON FIELD-PROGRAMMABLE GATE ARRAYS, 2017, : 15 - 24