Shuhai: Benchmarking High Bandwidth Memory on FPGAs

被引:45
|
作者
Wang, Zeke [1 ]
Huang, Hongjing [1 ]
Zhang, Jie [1 ]
Alonso, Gustavo [2 ]
机构
[1] Zhejiang Univ, Collaborat Innovat Ctr Artificial Intelligence, Hangzhou, Peoples R China
[2] Swiss Fed Inst Technol, Syst Grp, Zurich, Switzerland
基金
中国国家自然科学基金;
关键词
D O I
10.1109/FCCM48280.2020.00024
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
FPGAs are starting to be enhanced with High Bandwidth Memory (HBM) as a way to reduce the memory bandwidth bottleneck encountered in some applications and to give the FPGA more capacity to deal with application state. However, the performance characteristics of HBM are still not well specified, especially in the context of FPGAs. In this paper, we bridge the gap between nominal specifications and actual performance by benchmarking HBM on a state-of-the-art FPGA, i.e., a Xilinx Alveo U280 featuring a two-stack HBM subsystem. To this end, we propose Shuhai, a benchmarking tool that allows us to demystify all the underlying details of HBM on an FPGA. FPGA-based benchmarking should also provide a more accurate picture of HBM than doing so on CPUs/GPUs, since CPUs/GPUs are noisier systems due to their complex control logic and cache hierarchy. Since the memory itself is complex, leveraging custom hardware logic to benchmark inside an FPGA provides more details as well as accurate and deterministic measurements. We observe that 1) HBM is able to provide up to 425 GB/s memory bandwidth, and 2) how HBM is used has a significant impact on performance, which in turn demonstrates the importance of unveiling the performance characteristics of HBM so as to select the best approach. Shuhai can be easily generalized to other FPGA boards or other generations of memory, e.g., HBM3, and DDR3. We will make Shuhai open-source, benefiting the community.
引用
收藏
页码:111 / 119
页数:9
相关论文
共 50 条
  • [21] Designing and Implementing Multibeam Smart Antennas for High Bandwidth UAV Communications using FPGAs
    Porcello, John C.
    [J]. 2013 IEEE AEROSPACE CONFERENCE, 2013,
  • [22] Towards High-Bandwidth-Utilization SpMV on FPGAs via Partial Vector Duplication
    Liu, Bowen
    Liu, Dajiang
    [J]. 2023 28TH ASIA AND SOUTH PACIFIC DESIGN AUTOMATION CONFERENCE, ASP-DAC, 2023, : 33 - 38
  • [23] Bandwidth-Sensitivity-Aware Arbitration for FPGAs
    Hao, Lu
    Stitt, Greg
    [J]. IEEE EMBEDDED SYSTEMS LETTERS, 2012, 4 (03) : 73 - 76
  • [24] Joins in a Heterogeneous Memory Hierarchy: Exploiting High-Bandwidth Memory
    Pohl, Constantin
    Sattler, Kai-Uwe
    [J]. 14TH INTERNATIONAL WORKSHOP ON DATA MANAGEMENT ON NEW HARDWARE (DAMON 2018), 2018,
  • [25] Joins on high-bandwidth memory: a new level in the memory hierarchy
    Constantin Pohl
    Kai-Uwe Sattler
    Goetz Graefe
    [J]. The VLDB Journal, 2020, 29 : 797 - 817
  • [26] Joins on high-bandwidth memory: a new level in the memory hierarchy
    Pohl, Constantin
    Sattler, Kai-Uwe
    Graefe, Goetz
    [J]. VLDB JOURNAL, 2020, 29 (2-3): : 797 - 817
  • [27] Optimizing Use of Different Types of Memory for FPGAs in High Performance Computing
    Huang, Kai
    Gungor, Mehmet
    Ioannidis, Stratis
    Leeser, Miriam
    [J]. 2020 IEEE HIGH PERFORMANCE EXTREME COMPUTING CONFERENCE (HPEC), 2020,
  • [28] OpenCL Memory Infrastructure for FPGAs
    Chin, S. Alexander
    Chow, Paul
    [J]. FPGA 12: PROCEEDINGS OF THE 2012 ACM-SIGDA INTERNATIONAL SYMPOSIUM ON FIELD PROGRAMMABLE GATE ARRAYS, 2012, : 269 - 270
  • [29] Embedded Memory Binding in FPGAs
    Elizeh, Kaveh
    Nicolici, Nicola
    [J]. PROCEEDINGS OF THE 47TH DESIGN AUTOMATION CONFERENCE, 2010, : 457 - 462
  • [30] Performance evaluation of High Bandwidth Memory for HPC Workloads
    Kabat, Amit Kumar
    Pandey, Shubhang
    Gopalakrishnan, Venkatesh Tiruchirai
    [J]. 2022 IEEE 35TH INTERNATIONAL SYSTEM-ON-CHIP CONFERENCE (IEEE SOCC 2022), 2022, : 172 - 177