Exploring energy efficiency of LSTM accelerators: A parameterized architecture design for embedded FPGAs

被引:1
|
作者
Qian, Chao [1 ]
Ling, Tianheng [1 ]
Schiele, Gregor [1 ]
机构
[1] Univ Duisburg Essen, Intelligent Embedded Syst Lab, D-47057 Duisburg, Germany
关键词
Time-series analysis; LSTM accelerators; Embedded FPGAs; Parameterized architecture design; Activation function optimization; Energy efficiency;
D O I
10.1016/j.sysarc.2024.103181
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Long Short -Term Memory Networks (LSTMs) are pivotal in on -device time series analysis for embedded systems, particularly for managing sensor data streams. Yet, their deployment on resource -constrained embedded devices presents notable challenges. In response, we introduce a novel parameterized architecture for LSTM accelerators designed explicitly for embedded Field -Programmable Gate Arrays (FPGAs). Our approach involves strategic design choices, such as employing computationally efficient activation functions and optimizing clock frequency with a pipelined Arithmetic Logic Unit (ALU). These decisions drive our architecture towards enhanced energy efficiency while maintaining adaptability across diverse application scenarios. A key feature of our architecture is its configurable parameters, which allow for tailored optimization through the optional use of Digital Signal Processor Slices for ALUs and the selective implementation of activation functions. Our empirical evaluations conducted on the Spartan -7 XC7S15 FPGA demonstrate the robustness of our methodology, achieving a 2.33 x improvement in energy efficiency over previous solutions. Furthermore, our study examines the correlation between memory resource types and energy efficiency across various LSTM model sizes. Impressively, even with a 9 x increase in the hidden size of the LSTM cell, our accelerator maintains an energy efficiency of 10.03 GOP/s/W, with only a minor decrease of 14.65%. However, it is critical to note that our current design is not yet optimized for larger FPGA models such as the Spartan -7 XC7S25 and XC7S50 . For these models, timing constraints, rather than resource limitations, pose challenges to scaling, highlighting a potential area for future optimization.
引用
收藏
页数:16
相关论文
共 35 条
  • [1] Energy Efficient LSTM Accelerators for Embedded FPGAs Through Parameterised Architecture Design
    Qian, Chao
    Ling, Tianheng
    Schiele, Gregor
    ARCHITECTURE OF COMPUTING SYSTEMS, ARCS 2023, 2023, 13949 : 3 - 17
  • [2] Enhancing Energy-Efficiency by Solving the Throughput Bottleneck of LSTM Cells for Embedded FPGAs
    Qian, Chao
    Ling, Tianheng
    Schiele, Gregor
    MACHINE LEARNING AND PRINCIPLES AND PRACTICE OF KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2022, PT I, 2023, 1752 : 594 - 605
  • [3] Design flow for embedded FPGAs based on a flexible architecture template
    Neumann, B.
    von Sydow, T.
    Blume, H.
    Noll, T. G.
    2008 DESIGN, AUTOMATION AND TEST IN EUROPE, VOLS 1-3, 2008, : 54 - +
  • [4] Design of OpenCL-Compatible Multithreaded Hardware Accelerators with Dynamic Support for Embedded FPGAs
    Rodriguez, Alfonso
    Valverde, Juan
    de la Torre, Eduardo
    2015 INTERNATIONAL CONFERENCE ON RECONFIGURABLE COMPUTING AND FPGAS (RECONFIG), 2015,
  • [5] Synetgy: Algorithm-hardware Co-design for ConvNet Accelerators on Embedded FPGAs
    Yang, Yifan
    Huang, Qijing
    Wu, Bichen
    Zhang, Tianjun
    Ma, Liang
    Gambardella, Giulio
    Blott, Michaela
    Lavagno, Luciano
    Vissers, Kees
    Wawrzynek, John
    Keutzer, Kurt
    PROCEEDINGS OF THE 2019 ACM/SIGDA INTERNATIONAL SYMPOSIUM ON FIELD-PROGRAMMABLE GATE ARRAYS (FPGA'19), 2019, : 23 - 32
  • [6] Exploring Area and Delay Tradeoffs in FPGAs With Architecture and Automated Transistor Design
    Kuon, Ian
    Rose, Jonathan
    IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, 2011, 19 (01) : 71 - 84
  • [7] PermLSTM: A High Energy-Efficiency LSTM Accelerator Architecture
    Zheng, Yong
    Yang, Haigang
    Jia, Yiping
    Huang, Zhihong
    ELECTRONICS, 2021, 10 (08)
  • [8] MSBF-LSTM: Most-significant Bit-first LSTM Accelerators with Energy Efficiency Optimisations
    Bian, Sige
    Li, He
    Wang, Chengcheng
    Song, Changjun
    Tang, Yongming
    2023 IEEE 31ST ANNUAL INTERNATIONAL SYMPOSIUM ON FIELD-PROGRAMMABLE CUSTOM COMPUTING MACHINES, FCCM, 2023, : 218 - 218
  • [9] Exploring Energy Efficiency Model Generalization on Multicore Embedded Platforms
    Rexha, Hergys
    Lafond, Sebastien
    2018 26TH EUROMICRO INTERNATIONAL CONFERENCE ON PARALLEL, DISTRIBUTED, AND NETWORK-BASED PROCESSING (PDP 2018), 2018, : 494 - 498
  • [10] Research on Aesthetics and Energy Efficiency Design of Architecture
    Chen, Shuang
    Development of Industrial Manufacturing, 2014, 525 : 380 - 383