Performance-Driven LSTM Accelerator Hardware Using Split-Matrix-Based MVM

被引:0
|
作者
Tresa Joseph
T. S. Bindiya
机构
[1] National Institute of Technology Calicut,Department of Electronics and Communication Engineering
关键词
Recurrent neural network; Long short-term memory; Systolic array architecture; Parallel computing;
D O I
暂无
中图分类号
学科分类号
摘要
This paper proposes a new hardware approach for accelerating matrix vector multiplication (MVM) employing systolic array architecture and parallel data processing units, which is particularly useful in multiplication intensive applications such as neural networks. The hardware complexity of the parallel computations is reduced by a technique named as split-matrix approach, in which the larger matrices are split into smaller matrices. In the proposed architecture, 8-bit fixed-point representation is considered and matrices are treated to be circulant in nature. The resulting MVM architecture benefits with reduced implementation complexity in terms of cell area, reduced delay, and power consumption. It is found to result in a 13.9% reduction in logic cell area and a 38.15% reduction in total power consumption when compared to those of the latest baseline design. Also, the proposed architecture is able to achieve a considerably improved minimum permissible clock period of 0.410ns. The development of a long short-term memory (LSTM) architecture using the proposed design also serves to prove the effectiveness of the proposed MVM architecture. The LSTM developed using the proposed MVM provides a 37.57% reduction in the cell area and a 22.86% reduction in the total power in comparison with the latest baseline design and is able to achieve a minimum clock period of 0.42 ns.
引用
收藏
页码:6660 / 6683
页数:23
相关论文
共 50 条
  • [21] Performance-driven development of a web services application using MetaPL/HeSSE
    Mancini, E
    Villano, U
    Mazzocca, N
    Rak, M
    Torella, R
    13TH EUROMICRO CONFERENCE ON PARALLEL, DISTRIBUTED AND NETWORK-BASED PROCESSING, PROCEEDINGS, 2005, : 12 - 19
  • [22] Performance-driven Evaluation for Deploying IMS-based Interoperability Scenarios
    Oscar Fajardo, Jose
    Liberal, Fidel
    Li, Fudong
    Clarke, Nathan
    Mkwawa, Is-Haka
    Sun, Lingfen
    2014 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC), 2014, : 3019 - 3024
  • [23] Control Theory for Model-based Performance-driven Software Adaptation
    Arcelli, Davide
    Cortellessa, Vittorio
    Filieri, Antonio
    Leva, Alberto
    QOSA'15 PROCEEDINGS OF THE 11TH INTERNATIONAL ACM SIGSOFT CONFERENCE ON QUALITY OF SOFTWARE ARCHITECTURES, 2015, : 11 - 20
  • [24] FPGA-based Pipelined LSTM accelerator with Approximate matrix multiplication technique
    Chaudhary, Aniket
    Kumar, Arun
    Srivastava, Ayush
    Suneja, Kriti
    2021 5TH INTERNATIONAL CONFERENCE ON ELECTRICAL, ELECTRONICS, COMMUNICATION, COMPUTER TECHNOLOGIES AND OPTIMIZATION TECHNIQUES (ICEECCOT), 2021, : 438 - 442
  • [25] Early-Phase Performance-Driven Design Using Generative Models
    Ampanavos, Spyridon
    Malkawi, Ali
    COMPUTER-AIDED ARCHITECTURAL DESIGN: DESIGN IMPERATIVES: THE FUTURE IS NOW, 2022, 1465 : 87 - 106
  • [26] Performance-Driven Dynamic Thermal Management of MPSoC Based on Task Rescheduling
    Ganeshpure, Kunal
    Kundu, Sandip
    ACM TRANSACTIONS ON DESIGN AUTOMATION OF ELECTRONIC SYSTEMS, 2014, 19 (02)
  • [27] An Instruction-Driven Batch-Based High-Performance Resource-Efficient LSTM Accelerator on FPGA
    Mao, Ning
    Yang, Haigang
    Huang, Zhihong
    ELECTRONICS, 2023, 12 (07)
  • [28] Performance-Driven Multi-FPGA partitioning using functional clustering and replication
    Fang, WJ
    Wu, ACH
    1998 DESIGN AUTOMATION CONFERENCE, PROCEEDINGS, 1998, : 283 - 286
  • [29] Climate and performance-driven architectural floorplan optimization using deep graph networks
    Yang, Yang
    Luo, Hanzhong
    Adibhesami, Mohammad Anvar
    ENGINEERING CONSTRUCTION AND ARCHITECTURAL MANAGEMENT, 2025,
  • [30] Assessment of performance-driven investment strategies of distribution systems using reference networks
    Levi, V
    Strbac, G
    Allan, R
    IEE PROCEEDINGS-GENERATION TRANSMISSION AND DISTRIBUTION, 2005, 152 (01) : 1 - 10