Open-Source SpMV Multiplication Hardware Accelerator for FPGA-Based HPC Systems

被引:0
|
作者
Mpakos, Panagiotis [1 ]
Tasou, Ioanna [1 ]
Alverti, Chloe [3 ]
Miliadis, Panagiotis [1 ]
Malakonakis, Pavlos [2 ]
Theodoropoulos, Dimitris [1 ]
Goumas, Georgios [1 ]
Pnevmatikatos, Dionisios N. [1 ]
Koziris, Nectarios [1 ]
机构
[1] Natl Tech Univ Athens, Comp Syst Lab, Athens, Greece
[2] Tech Univ Crete, Khania, Greece
[3] Univ Illinois, Champaign, IL USA
基金
欧盟地平线“2020”;
关键词
Open-Source; SpMV; Sparse Matrix; HLS;
D O I
10.1007/978-3-031-55673-9_2
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The Sparse Matrix Vector (SpMV) multiplication kernel is a key component of many high-performance computing applications, but at the same time one of the most challenging to optimize, primarily due to its low flop-per-byte ratio and irregular memory accesses. As such, modern FPGAs, combined with High-Bandwidth Memory (HBM) modules, are much better-suited to the memory-bound nature of this kernel, compared to general purpose CPUs. Current FPGA-based approaches on SpMV support only single-precision floating point arithmetic. Moreover, they target for highly-streamed implementations that, although enhance performance, facilitate custom matrix storage formats, which (i) can increase the matrix footprint up to 3x, and (ii) drop the burden of input matrix transformation to developers. Towards widening the spectrum of FPGA-supported floating point formats for sparse algebra, this paper presents a first set of effective optimizations for double-precision SpMV hardware kernels using High-Level Synthesis (HLS) tools on HBM-equipped FPGAs. Results show that our work can provide 52.4x on average better performance compared to state-of-practice SpMV double-precision multiplication implementations on FPGAs for applications with volatile matrices, and up to 5.1x better performance-per-Watt compared to server-class CPUs.
引用
收藏
页码:19 / 32
页数:14
相关论文
共 50 条
  • [21] FPGA-based hardware accelerator of the heat equation with applications on infrared thermography
    Pardo, F.
    Lopez, P.
    Cabello, D.
    [J]. 2008 INTERNATIONAL CONFERENCE ON APPLICATION-SPECIFIC SYSTEMS, ARCHITECTURES AND PROCESSORS, 2008, : 179 - +
  • [22] FPGA-based Hardware Accelerator for Image Reconstruction in Magnetic Resonance Imaging
    Pezzotti, Emanuele
    Iacobucci, Alex
    Nash, Gregory
    Cheema, Umer
    Vinella, Paolo
    Ansari, Rashid
    [J]. FPGA'17: PROCEEDINGS OF THE 2017 ACM/SIGDA INTERNATIONAL SYMPOSIUM ON FIELD-PROGRAMMABLE GATE ARRAYS, 2017, : 293 - 293
  • [23] FPGA-Based Hardware Accelerator for an Embedded Factor Graph with Configurable Optimization
    Sugiarto, Indar
    Axenie, Cristian
    Conradt, Joerg
    [J]. JOURNAL OF CIRCUITS SYSTEMS AND COMPUTERS, 2019, 28 (02)
  • [24] A FPGA-based Hardware Accelerator for Bayesian Confidence Propagation Neural Network
    Liu, Lizheng
    Wang, Deyu
    Wang, Yuning
    Lansner, Anders
    Hemani, Ahmed
    Yang, Yu
    Hu, Xiaoming
    Zou, Zhuo
    Zheng, Lirong
    [J]. 2020 IEEE NORDIC CIRCUITS AND SYSTEMS CONFERENCE (NORCAS), 2020,
  • [25] QubiC: An Open-Source FPGA-Based Control and Measurement System for Superconducting Quantum Information Processors
    Xu Y.
    Huang G.
    Balewski J.
    Naik R.
    Morvan A.
    Mitchell B.
    Nowrouzi K.
    Santiago D.I.
    Siddiqi I.
    [J]. IEEE Transactions on Quantum Engineering, 2021, 2
  • [27] POWER/AREA ANALYSIS OF A FPGA-BASED OPEN-SOURCE PROCESSOR USING PARTIAL DYNAMIC RECONFIGURATION
    Zaidi, Izhar
    Nabina, Atukem
    Canagarajah, C. N.
    Nunez-Yanez, Jose
    [J]. 11TH EUROMICRO CONFERENCE ON DIGITAL SYSTEM DESIGN - ARCHITECTURES, METHODS AND TOOLS : DSD 2008, PROCEEDINGS, 2008, : 592 - 598
  • [28] An FPGA-based hardware abstraction of quantum computing systems
    Madiha Khalid
    Umar Mujahid
    Atif Jafri
    Hongsik Choi
    Najam ul Islam Muhammad
    [J]. Journal of Computational Electronics, 2021, 20 : 2001 - 2018
  • [29] Hardware Decompression Techniques for FPGA-Based Embedded Systems
    Koch, Dirk
    Beckhoff, Christian
    Teich, Juergen
    [J]. ACM TRANSACTIONS ON RECONFIGURABLE TECHNOLOGY AND SYSTEMS, 2009, 2 (02)
  • [30] An FPGA-based hardware abstraction of quantum computing systems
    Khalid, Madiha
    Mujahid, Umar
    Jafri, Atif
    Choi, Hongsik
    Muhammad, Najam ul Islam
    [J]. JOURNAL OF COMPUTATIONAL ELECTRONICS, 2021, 20 (05) : 2001 - 2018