A power efficient linear equation solver on a multi-FPGA accelerator

被引:0
|
作者
Sudarsanam A. [1 ]
Hauser T. [2 ]
Dasu A. [1 ]
Young S. [1 ]
机构
[1] Department of Electrical and Computer Engineering, Northwestern University, Evanston
[2] Academic and Research Technology, Northwestern University, Evanston
关键词
FPGA; Linear algebra; Right-hand side solver;
D O I
10.2316/Journal.202.2010.1.202-2478
中图分类号
学科分类号
摘要
This paper presents an approach to explore a commercial multi field programmable gate array (FPGA) system as high performance accelerator and the problem of solving an LU decomposed linear system of equations using forward and back substitution is addressed. Block-based right-hand-side solver algorithm is described and a novel data flow and memory architectures that can support arbitrary data types, block sizes and matrix sizes is proposed. These architectures have been implemented on a multi-FPGA system. Capabilities of the accelerator system are pushed to its limits by implementing the problem for double precision complex floatingpoint data. Detailed timing data is presented and augmented with data from a performance model proposed in this paper. Performance of the accelerator system is evaluated against that of a state of the art low power Beowulf cluster node running an optimized LAPACK implementation. Both systems are compared using the power efficiency (performance/watt) metric. FPGA system is about eleven times more power efficient than the compute node of a cluster.
引用
收藏
页码:56 / 72
页数:16
相关论文
共 50 条
  • [41] Multi-FPGA Prototyping Board Issue: the FPGA I/O Bottleneck
    Tang, Qingshan
    Mehrez, Habib
    Tuna, Matthieu
    2014 INTERNATIONAL CONFERENCE ON EMBEDDED COMPUTER SYSTEMS: ARCHITECTURES, MODELING, AND SIMULATION (SAMOS XIV), 2014, : 207 - 214
  • [42] Inter-FPGA interconnect topologies exploration for multi-FPGA systems
    Farooq, Umer
    Mehrez, Habib
    Bhatti, Muhammad Khurram
    DESIGN AUTOMATION FOR EMBEDDED SYSTEMS, 2018, 22 (1-2) : 117 - 140
  • [43] Inter-FPGA interconnect topologies exploration for multi-FPGA systems
    Umer Farooq
    Habib Mehrez
    Muhammad Khurram Bhatti
    Design Automation for Embedded Systems, 2018, 22 : 117 - 140
  • [44] Efficient CNN Accelerator on FPGA
    Kala, S.
    Nalesh, S.
    IETE JOURNAL OF RESEARCH, 2020, 66 (06) : 733 - 740
  • [45] A Profiler for a Heterogeneous Multi-Core Multi-FPGA System
    Nunes, Daniel
    Saldana, Manuel
    Chow, Paul
    PROCEEDINGS OF THE 2008 INTERNATIONAL CONFERENCE ON FIELD-PROGRAMMABLE TECHNOLOGY, 2008, : 113 - +
  • [46] A Framework for Multi-FPGA Interconnection using Multi Gigabit Transceivers
    Dreschmann, Michael
    Heisswolf, Jan
    Geiger, Michael
    Haussecker, Manuel
    Becker, Juergen
    2015 28TH SYMPOSIUM ON INTEGRATED CIRCUITS AND SYSTEMS DESIGN (SBCCI), 2015,
  • [47] Accelerating Radiation Dose Calculation: A Multi-FPGA Solution
    Zhou, Bo
    Hu, Xiaobo Sharon
    Chen, Danny Z.
    Yu, Cedric X.
    ACM TRANSACTIONS ON EMBEDDED COMPUTING SYSTEMS, 2013, 13
  • [48] Design of Hardened Embedded Systems on Multi-FPGA Platforms
    Bolchini, Cristiana
    Sandionigi, Chiara
    ACM TRANSACTIONS ON DESIGN AUTOMATION OF ELECTRONIC SYSTEMS, 2014, 20 (01) : 1 - 26
  • [49] A Scalable Multi-FPGA Platform for Complex Networking Applications
    Muehlbach, Sascha
    Koch, Andreas
    2011 IEEE 19TH ANNUAL INTERNATIONAL SYMPOSIUM ON FIELD-PROGRAMMABLE CUSTOM COMPUTING MACHINES (FCCM), 2011, : 81 - 84
  • [50] A reliability-aware partitioner for multi-FPGA platforms
    Bolchini, Cristiana
    Sandionigi, Chiara
    2011 IEEE INTERNATIONAL SYMPOSIUM ON DEFECT AND FAULT TOLERANCE IN VLSI AND NANOTECHNOLOGY SYSTEMS (DFT), 2011, : 34 - 40