Tinsel: a manythread overlay for FPGA clusters

被引:16
|
作者
Naylor, Matthew [1 ]
Moore, Simon W. [1 ]
Thomas, David [2 ]
机构
[1] Univ Cambridge, Cambridge, England
[2] Imperial Coll London, London, England
基金
英国工程与自然科学研究理事会;
关键词
D O I
10.1109/FPL.2019.00066
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Commodity FPGA boards with advanced networking facilities have great potential in the construction of high-performance compute clusters that scale. However, low-level design tools and long synthesis times are major barriers to productivity for application developers. In this paper, we explore the potential of a distributed soft-processor overlay, programmed in software at a high-level of abstraction, to deliver a useful level of performance for FPGA clusters. In particular, we demonstrate the use of hardware multhreading to achieve a fast, space-efficient, high-throughput overlay, and compare a 12-FPGA instance of it (12,288 RISC-V threads) against a conventional Xeon cluster on the problem of distributed graph processing.
引用
收藏
页码:375 / 383
页数:9
相关论文
共 50 条
  • [21] An Adaptive Virtual Overlay for Fast Trigger Insertion for FPGA Debug
    Eslami, Fatemeh
    Wilton, Steven J. E.
    2015 INTERNATIONAL CONFERENCE ON FIELD PROGRAMMABLE TECHNOLOGY (FPT), 2015, : 32 - 39
  • [22] An Efficient FPGA Overlay for Portable Custom Instruction Set Extensions
    Koch, Dirk
    Beckhoff, Christian
    Lemieux, Guy G. F.
    2013 23RD INTERNATIONAL CONFERENCE ON FIELD PROGRAMMABLE LOGIC AND APPLICATIONS (FPL 2013) PROCEEDINGS, 2013,
  • [23] DLA: Compiler and FPGA Overlay for Neural Network Inference Acceleration
    Abdelfattah, Mohamed S.
    Han, David
    Bitar, Andrew
    DiCecco, Roberto
    O'Connell, Shane
    Shanker, Nitika
    Chu, Joseph
    Prins, Ian
    Fender, Joshua
    Ling, Andrew C.
    Chiu, Gordon R.
    2018 28TH INTERNATIONAL CONFERENCE ON FIELD PROGRAMMABLE LOGIC AND APPLICATIONS (FPL), 2018, : 411 - 418
  • [24] High Performance Programmable FPGA Overlay for Digital Signal Processing
    McGettrick, Seamas
    Patel, Kunjan
    Bleakley, Chris
    RECONFIGURABLE COMPUTING: ARCHITECTURES, TOOLS AND APPLICATIONS, 2011, 6578 : 375 - 384
  • [25] Deflection Routing for Multi-Level FPGA Overlay NoCs
    Kumar, Chethan H. B.
    Agarwal, Shubham
    Kapre, Nachiket
    2016 INTERNATIONAL CONFERENCE ON FIELD-PROGRAMMABLE TECHNOLOGY (FPT), 2016, : 149 - 156
  • [26] Distributed Recommendation Inference on FPGA Clusters
    Zhu, Yu
    He, Zhenhao
    Jiang, Wenqi
    Zeng, Kai
    Zhou, Jingren
    Alonso, Gustavo
    2021 31ST INTERNATIONAL CONFERENCE ON FIELD-PROGRAMMABLE LOGIC AND APPLICATIONS (FPL 2021), 2021, : 279 - 285
  • [27] Leveraging FPGA clusters for SAT computations
    Kouril, Michal
    PARALLEL COMPUTING: ON THE ROAD TO EXASCALE, 2016, 27 : 525 - 532
  • [28] Packet switched vs. time multiplexed FPGA overlay networks
    Kapre, Nachiket
    Mehta, Nikil
    deLorimier, Michael
    Rubin, Raphael
    Barnor, Henry
    Wilson, Michael J.
    Wrighton, Michael
    DeHon, Andre
    FCCM 2006: 14TH ANNUAL IEEE SYMPOSIUM ON FIELD-PROGRAMMABLE CUSTOM COMPUTING MACHINES, PROCEEDINGS, 2006, : 205 - +
  • [29] Implementing FPGA overlay NoCs using the Xilinx UltraScale memory cascades
    Kapre, Nachiket
    2017 IEEE 25TH ANNUAL INTERNATIONAL SYMPOSIUM ON FIELD-PROGRAMMABLE CUSTOM COMPUTING MACHINES (FCCM 2017), 2017, : 40 - 47
  • [30] FTDL: A Tailored FPGA-Overlay for Deep Learning with High Scalability
    Shi, Runbin
    Ding, Yuhao
    Wei, Xuechao
    Li, He
    Liu, Hang
    So, Hayden K. H.
    Ding, Caiwen
    PROCEEDINGS OF THE 2020 57TH ACM/EDAC/IEEE DESIGN AUTOMATION CONFERENCE (DAC), 2020,