A power efficient linear equation solver on a multi-FPGA accelerator

被引:0
|
作者
Sudarsanam A. [1 ]
Hauser T. [2 ]
Dasu A. [1 ]
Young S. [1 ]
机构
[1] Department of Electrical and Computer Engineering, Northwestern University, Evanston
[2] Academic and Research Technology, Northwestern University, Evanston
关键词
FPGA; Linear algebra; Right-hand side solver;
D O I
10.2316/Journal.202.2010.1.202-2478
中图分类号
学科分类号
摘要
This paper presents an approach to explore a commercial multi field programmable gate array (FPGA) system as high performance accelerator and the problem of solving an LU decomposed linear system of equations using forward and back substitution is addressed. Block-based right-hand-side solver algorithm is described and a novel data flow and memory architectures that can support arbitrary data types, block sizes and matrix sizes is proposed. These architectures have been implemented on a multi-FPGA system. Capabilities of the accelerator system are pushed to its limits by implementing the problem for double precision complex floatingpoint data. Detailed timing data is presented and augmented with data from a performance model proposed in this paper. Performance of the accelerator system is evaluated against that of a state of the art low power Beowulf cluster node running an optimized LAPACK implementation. Both systems are compared using the power efficiency (performance/watt) metric. FPGA system is about eleven times more power efficient than the compute node of a cluster.
引用
收藏
页码:56 / 72
页数:16
相关论文
共 50 条
  • [31] Using Multi-FPGA Systems for ASIC Prototyping
    Melnikova, Olga
    Hahanova, Irina
    Mostovaya, Karina
    EXPERIENCE OF DESIGNING AND APPLICATION OF CAD SYSTEMS IN MICROELECTRONICS, 2009, : 237 - 239
  • [32] Parallel Implementation of CNN on Multi-FPGA Cluster
    Fukushima, Yasuyu
    Iizuka, Kensuke
    Amano, Hideharu
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2023, E106D (07) : 1198 - 1208
  • [33] An Efficient Design Flow for Accelerating Complicated-connected CNNs on a Multi-FPGA Platform
    Wang, Deguang
    Shen, Junzhong
    Wen, Mei
    Zhang, Chunyuan
    PROCEEDINGS OF THE 48TH INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING (ICPP 2019), 2019,
  • [34] A communication scheduling algorithm for multi-FPGA systems
    Suh, J
    Kang, DI
    Crago, SP
    2000 IEEE SYMPOSIUM ON FIELD-PROGRAMMABLE CUSTOM COMPUTING MACHINES, PROCEEDINGS, 2000, : 299 - 300
  • [35] Timing driven multi-FPGA board partitioning
    Burra, R
    Bhatia, D
    ELEVENTH INTERNATIONAL CONFERENCE ON VLSI DESIGN, PROCEEDINGS, 1997, : 234 - 237
  • [36] Toward a training of CNNs on a multi-FPGA system
    Hiruma, Aoi
    Iizuka, Kensuke
    Amano, Hideharu
    2022 TENTH INTERNATIONAL SYMPOSIUM ON COMPUTING AND NETWORKING WORKSHOPS, CANDARW, 2022, : 229 - 235
  • [37] Extending the BEAGLE library to a multi-FPGA platform
    Jin, Zheming
    Bakos, Jason D.
    BMC BIOINFORMATICS, 2013, 14
  • [38] Parallel Implementation of CNN on Multi-FPGA Cluster
    Fukushima, Yasuyu
    Iizuka, Kensuke
    Amano, Hideharu
    2021 IEEE 14TH INTERNATIONAL SYMPOSIUM ON EMBEDDED MULTICORE/MANY-CORE SYSTEMS-ON-CHIP (MCSOC 2021), 2021, : 77 - 83
  • [39] Extending the BEAGLE library to a multi-FPGA platform
    Zheming Jin
    Jason D Bakos
    BMC Bioinformatics, 14
  • [40] Multi-FPGA Evaluation Platform for Disaggregated Computing
    Theodoropoulos, Dimitris
    Alachiotis, Nikolaos
    Pnevmatikatos, Dionisios
    2017 IEEE 25TH ANNUAL INTERNATIONAL SYMPOSIUM ON FIELD-PROGRAMMABLE CUSTOM COMPUTING MACHINES (FCCM 2017), 2017, : 193 - 193