Implementation and Optimization of the Accelerator Based on FPGA Hardware for LSTM Network

被引:10
|
作者
Zhang, Yiwei [1 ]
Wang, Chao
Gong, Lei
Lu, Yuntao
Sun, Fan
Xu, Chongchong
Li, Xi
Zhou, Xuehai
机构
[1] USTC, Dept Comp Sci & Technol, Hefei 230027, Anhui, Peoples R China
基金
美国国家科学基金会;
关键词
D O I
10.1109/ISPA/IUCC.2017.00098
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Today, artificial neural networks (ANNs) are important machine learning methods which are widely used in a variety of applications. As the emerging field of ANNs, recurrent neural networks (RNNs) are often used for sequence-related applications. And Long Short-Term Memory (LSTM) is an improved RNN which contains complex computational logic. To achieve high accuracy, researchers always build large-scale LSTM networks which are time-consuming and power-consuming. Thus the acceleration of LSTM networks, low power & energy consumption become the hot issues in today's research. In this paper, we present a hardware accelerator for the LSTM neural network layer based on FPGA Zedboard and use pipeline methods to parallelize the forward computing process. To optimize our implementation, we also use multiple methods including tiled matrix-vector multiplication, binary adder tree, and overlap of computation and data access. Through the acceleration and optimization methods, our accelerator is power-efficient and has a better performance than ARM Cortex A9 processor and Intel Core i5 processor.
引用
收藏
页码:614 / 621
页数:8
相关论文
共 50 条
  • [21] Hardware implementation of neural network-based engine model using FPGA
    Saady, Marina Magdy
    Essai, Mohamed Hassan
    ALEXANDRIA ENGINEERING JOURNAL, 2022, 61 (12) : 12039 - 12050
  • [22] An Efficient FPGA-based Depthwise Separable Convolutional Neural Network Accelerator with Hardware Pruning
    Liu, Zhengyan
    Liu, Qiang
    Yan, Shun
    Cheung, Ray C. C.
    ACM TRANSACTIONS ON RECONFIGURABLE TECHNOLOGY AND SYSTEMS, 2024, 17 (01)
  • [23] FPGA based hardware implementation of Bat Algorithm
    Ben Ameur, Mohamed Sadok
    Sakly, Anis
    APPLIED SOFT COMPUTING, 2017, 58 : 378 - 387
  • [24] An FPGA-based Hardware Accelerator for Simulating Spatiotemporal Neurons
    Tarawneh, Ghaith
    Read, Jenny
    2014 21ST IEEE INTERNATIONAL CONFERENCE ON ELECTRONICS, CIRCUITS AND SYSTEMS (ICECS), 2014, : 618 - 621
  • [25] Implementation of Data-optimized FPGA-based Accelerator for Convolutional Neural Network
    Cho, Mannhee
    Kim, Youngmin
    2020 INTERNATIONAL CONFERENCE ON ELECTRONICS, INFORMATION, AND COMMUNICATION (ICEIC), 2020,
  • [26] Reconfigurable FPGA-based hardware accelerator for embedded DSP
    Rubin, G.
    Omieljanowicz, M.
    Petrovsky, A.
    MIXDES 2007: Proceedings of the 14th International Conference on Mixed Design of Integrated Circuits and Systems:, 2007, : 147 - 151
  • [27] FPGA-based hardware implementation of chaotic opposition-based arithmetic optimization algorithm
    Zermani, Mohamed Aymen
    Manita, Ghaith
    Chhabra, Amit
    Feki, Elyes
    Mami, Abdelkader
    APPLIED SOFT COMPUTING, 2024, 154
  • [28] An FPGA-Based Hardware Accelerator for Traffic Sign Detection
    Shi, Weijing
    Li, Xin
    Yu, Zhiyi
    Overett, Gary
    IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, 2017, 25 (04) : 1362 - 1372
  • [29] FPGA based Hardware Accelerator for KAZE Feature Extraction Algorithm
    Kalms, Lester
    Elhossini, Ahmed
    Juurlink, Ben
    2016 INTERNATIONAL CONFERENCE ON FIELD-PROGRAMMABLE TECHNOLOGY (FPT), 2016, : 281 - 284
  • [30] A Hardware Accelerator for SSD Object Detection Algorithm Based on FPGA
    Xie H.
    Cao J.
    Li P.
    Zhao X.
    Zhang X.
    Beijing Daxue Xuebao (Ziran Kexue Ban)/Acta Scientiarum Naturalium Universitatis Pekinensis, 2022, 58 (06): : 1015 - 1022