Implementation and Optimization of the Accelerator Based on FPGA Hardware for LSTM Network

被引:10
|
作者
Zhang, Yiwei [1 ]
Wang, Chao
Gong, Lei
Lu, Yuntao
Sun, Fan
Xu, Chongchong
Li, Xi
Zhou, Xuehai
机构
[1] USTC, Dept Comp Sci & Technol, Hefei 230027, Anhui, Peoples R China
基金
美国国家科学基金会;
关键词
D O I
10.1109/ISPA/IUCC.2017.00098
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Today, artificial neural networks (ANNs) are important machine learning methods which are widely used in a variety of applications. As the emerging field of ANNs, recurrent neural networks (RNNs) are often used for sequence-related applications. And Long Short-Term Memory (LSTM) is an improved RNN which contains complex computational logic. To achieve high accuracy, researchers always build large-scale LSTM networks which are time-consuming and power-consuming. Thus the acceleration of LSTM networks, low power & energy consumption become the hot issues in today's research. In this paper, we present a hardware accelerator for the LSTM neural network layer based on FPGA Zedboard and use pipeline methods to parallelize the forward computing process. To optimize our implementation, we also use multiple methods including tiled matrix-vector multiplication, binary adder tree, and overlap of computation and data access. Through the acceleration and optimization methods, our accelerator is power-efficient and has a better performance than ARM Cortex A9 processor and Intel Core i5 processor.
引用
收藏
页码:614 / 621
页数:8
相关论文
共 50 条
  • [1] FPGA Hardware Implementation and Optimization for Neural Network based Chaotic System Design
    Schmitz, Jesse
    Zhang, Lei
    HEART 2018: PROCEEDINGS OF THE 9TH INTERNATIONAL SYMPOSIUM ON HIGHLY-EFFICIENT ACCELERATORS AND RECONFIGURABLE TECHNOLOGIES, 2018,
  • [2] Hardware Accelerator Implementation on FPGA for Video Processing
    Wong, Kenneth Part Kong
    Yap, VooiVoon
    Teh, Peh Chiong
    2013 IEEE CONFERENCE ON OPEN SYSTEMS (ICOS), 2013, : 47 - 51
  • [3] HARDWARE ACCELERATOR: IMPLEMENTATION OF CNN ON FPGA FOR DIGIT RECOGNITION
    Choudhari, Onkar
    Chopade, Marisha
    Chopde, Sourabh
    Dabhadkar, Swarali
    Ingale, V
    2020 24TH INTERNATIONAL SYMPOSIUM ON VLSI DESIGN AND TEST (VDAT), 2020,
  • [4] Implementation of a DPU-Based Intelligent Thermal Imaging Hardware Accelerator on FPGA
    Hussein, Abdelrahman S.
    Anwar, Ahmed
    Fahmy, Yasmine
    Mostafa, Hassan
    Salama, Khaled Nabil
    Kafafy, Mai
    ELECTRONICS, 2022, 11 (01)
  • [5] FPGA-Based Hardware Accelerator for an Embedded Factor Graph with Configurable Optimization
    Sugiarto, Indar
    Axenie, Cristian
    Conradt, Joerg
    JOURNAL OF CIRCUITS SYSTEMS AND COMPUTERS, 2019, 28 (02)
  • [6] Research on FPGA Accelerator Optimization Based on Graph Neural Network
    Wu, Jin
    Shi, Xiangyang
    Pang, Wenting
    Wang, Yu
    ADVANCES IN NATURAL COMPUTATION, FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY, ICNC-FSKD 2022, 2023, 153 : 536 - 542
  • [7] FPGA-based DNN Hardware Accelerator for Sensor Network Aggregation Node
    Mohamed, Nadya A.
    Cavallaro, Joseph R.
    2022 56TH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS, AND COMPUTERS, 2022, : 322 - 327
  • [8] A FPGA-based Hardware Accelerator for Bayesian Confidence Propagation Neural Network
    Liu, Lizheng
    Wang, Deyu
    Wang, Yuning
    Lansner, Anders
    Hemani, Ahmed
    Yang, Yu
    Hu, Xiaoming
    Zou, Zhuo
    Zheng, Lirong
    2020 IEEE NORDIC CIRCUITS AND SYSTEMS CONFERENCE (NORCAS), 2020,
  • [9] A FPGA based C runtime hardware accelerator
    Garcia, P.
    Salgado, F.
    Cardoso, P.
    Cabral, J.
    Ekpanyapong, M.
    Tavares, A.
    2011 9TH IEEE INTERNATIONAL CONFERENCE ON INDUSTRIAL INFORMATICS (INDIN), 2011,
  • [10] OPTIMIZATION OF CONVOLUTIONAL NEURAL NETWORK HARDWARE STRUCTURE BASED ON FPGA
    Zhu, Min
    Kuang, Qiqi
    Yang, Chunling
    Lin, Jianjun
    PROCEEDINGS OF THE 2018 13TH IEEE CONFERENCE ON INDUSTRIAL ELECTRONICS AND APPLICATIONS (ICIEA 2018), 2018, : 1797 - 1802