Optimizing Bayesian Recurrent Neural Networks on an FPGA-based Accelerator

被引：4

作者：

Ferianc, Martin ^{[1
]}

Que, Zhiqiang ^{[2
]}

Fan, Hongxiang ^{[2
]}

Luk, Wayne ^{[2
]}

Rodrigues, Miguel ^{[1
]}

机构：

[1] UCL, Dept Elect & Elect Engn, London, England

[2] Imperial Coll London, Dept Comp, London, England

来源：

2021 INTERNATIONAL CONFERENCE ON FIELD-PROGRAMMABLE TECHNOLOGY (ICFPT) | 2021年

基金：

英国工程与自然科学研究理事会;

关键词：

Recurrent neural networks; Bayesian inference; Field-programmable gate array; Hardware acceleration;

D O I：

10.1109/ICFPT52863.2021.9609847

中图分类号：

TP39 [计算机的应用];

学科分类号：

081203 ; 0835 ;

摘要：

Neural networks have demonstrated their outstanding performance in a wide range of tasks. Specifically recurrent architectures based on long-short term memory (LSTM) cells have manifested excellent capability to model time dependencies in real-world data. However, standard recurrent architectures cannot estimate their uncertainty which is essential for safety-critical applications such as in medicine. In contrast, Bayesian recurrent neural networks (RNNs) are able to provide uncertainty estimation with improved accuracy. Nonetheless, Bayesian RNNs are computationally and memory demanding, which limits their practicality despite their advantages. To address this issue, we propose an FPGA-based hardware design to accelerate Bayesian LSTM-based RNNs. To further improve the overall algorithmic-hardware performance, a co-design framework is proposed to explore the most fitting algorithmic-hardware configurations for Bayesian RNNs. We conduct extensive experiments on healthcare applications to demonstrate the improvement of our design and the effectiveness of our framework. Compared with GPU implementation, our FPGA-based design can achieve up to 10 times speedup with nearly 106 times higher energy efficiency. To the best of our knowledge, this is the first work targeting acceleration of Bayesian RNNs on FPGAs.

引用

页码：19 / 28

页数：10

共 50 条

[1] High-Performance FPGA-based Accelerator for Bayesian Neural Networks
Fan, Hongxiang
Ferianc, Martin
Rodrigues, Miguel
Zhou, Hongyu
Niu, Xinyu
Luk, Wayne
[J]. 2021 58TH ACM/IEEE DESIGN AUTOMATION CONFERENCE (DAC), 2021, : 1063 - 1068
[2] Optimizing FPGA-based Convolutional Neural Networks Accelerator for Image Super-Resolution
Chang, Jung-Woo
Kang, Suk-Ju
[J]. 2018 23RD ASIA AND SOUTH PACIFIC DESIGN AUTOMATION CONFERENCE (ASP-DAC), 2018, : 343 - 348
[3] Optimizing a FPGA-based Neural Accelerator for Small IoT Devices
Hong, Seongmin
Lee, Inho
Park, Yongjun
[J]. 2018 INTERNATIONAL CONFERENCE ON ELECTRONICS, INFORMATION, AND COMMUNICATION (ICEIC), 2018, : 176 - 177
[4] Implementation of FPGA-based Accelerator for Deep Neural Networks
Tsai, Tsung-Han
Ho, Yuan-Chen
Sheu, Ming-Hwa
[J]. 2019 IEEE 22ND INTERNATIONAL SYMPOSIUM ON DESIGN AND DIAGNOSTICS OF ELECTRONIC CIRCUITS & SYSTEMS (DDECS), 2019,
[5] FPGA-based Accelerator for Long Short-Term Memory Recurrent Neural Networks
Guan, Yijin
Yuan, Zhihang
Sun, Guangyu
Cong, Jason
[J]. 2017 22ND ASIA AND SOUTH PACIFIC DESIGN AUTOMATION CONFERENCE (ASP-DAC), 2017, : 629 - 634
[6] FPGA-Based Acceleration for Bayesian Convolutional Neural Networks
Fan, Hongxiang
Ferianc, Martin
Que, Zhiqiang
Liu, Shuanglong
Niu, Xinyu
Rodrigues, Miguel R. D.
Luk, Wayne
[J]. IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2022, 41 (12) : 5343 - 5356
[7] A FPGA-based Hardware Accelerator for Bayesian Confidence Propagation Neural Network
Liu, Lizheng
Wang, Deyu
Wang, Yuning
Lansner, Anders
Hemani, Ahmed
Yang, Yu
Hu, Xiaoming
Zou, Zhuo
Zheng, Lirong
[J]. 2020 IEEE NORDIC CIRCUITS AND SYSTEMS CONFERENCE (NORCAS), 2020,
[8] FPGA-based Accelerator for Losslessly Quantized Convolutional Neural Networks
Sit, Mankit
Kazami, Ryosuke
Amano, Hideharu
[J]. 2017 INTERNATIONAL CONFERENCE ON FIELD PROGRAMMABLE TECHNOLOGY (ICFPT), 2017, : 295 - 298
[9] Customizable FPGA-based Accelerator for Binarized Graph Neural Networks
Wang, Ziwei
Que, Zhiqiang
Luk, Wayne
Fan, Hongxiang
[J]. 2022 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS 22), 2022, : 1968 - 1972
[10] An FPGA-based Accelerator Implementation for Deep Convolutional Neural Networks
Zhou, Yongmei
Jiang, Jingfei
[J]. PROCEEDINGS OF 2015 4TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND NETWORK TECHNOLOGY (ICCSNT 2015), 2015, : 829 - 832

← 1 2 3 4 5 →