Architecture and Application Co-Design for Beyond-FPGA Reconfigurable Acceleration Devices

被引:5
|
作者
Boutros, Andrew [1 ,2 ]
Nurvitadhi, Eriko [2 ]
Betz, Vaughn [1 ]
机构
[1] Univ Toronto, Dept Elect & Comp Engn, Toronto, ON M5S 3G4, Canada
[2] Intel Corp, Programmable Solut Grp, Santa Clara, CA 95054 USA
基金
加拿大自然科学与工程研究理事会;
关键词
Deep learning; field-programmable gate arrays; hardware acceleration; network-on-chip; reconfigurable computing; EMBEDDED NETWORKS; CHIP;
D O I
10.1109/ACCESS.2022.3204664
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In recent years, field-programmable gate arrays (FPGAs) have been increasingly deployed in datacenters as programmable accelerators that can offer software-like flexibility and custom-hardware-like efficiency for key datacenter workloads. To improve the efficiency of FPGAs for these new datacenter use cases and data-intensive applications, a new class of reconfigurable acceleration devices (RADs) is emerging. In these devices, the FPGA fine-grained reconfigurable fabric is a component of a bigger monolithic or multi-die system-in-package that can incorporate general-purpose software-programmable cores, domain-specialized accelerator blocks, and high-performance networks-on-chip (NoCs) for efficient communication between these system components. The integration of all these components in a RAD results in a huge design space and requires re-thinking the implementation of applications that need to be migrated from conventional FPGAs to these novel devices. In this work, we introduce RAD-Sim, an architecture simulator that allows rapid design space exploration for RADs and facilitates the study of complex interactions between their various components. We also present a case study that highlights the utility of RAD-Sim in re-designing applications for these novel RADs by mapping a state-of-the-art deep learning (DL) inference FPGA overlay to different RAD instances. Our case study illustrates how RAD-Sim can capture a wide variety of reconfigurable architectures, from conventional FPGAs to devices augmented with hard NoCs, specialized matrix-vector blocks, and 3D-stacked multi-die devices. In addition, we show that our tool can help architects evaluate the effect of specific RAD architecture parameters on end-to-end workload performance. Through RAD-Sim, we also show that novel RADs can potentially achieve 2.6x better performance on average compared to conventional FPGAs in the key DL application domain.
引用
收藏
页码:95067 / 95082
页数:16
相关论文
共 50 条
  • [21] From Model to FPGA: Software-Hardware Co-Design for Efficient Neural Network Acceleration
    Guo, Kaiyun
    Sui, Lingzhi
    Qui, Jiantao
    Yao, Song
    Han, Song
    Wang, Yu
    Yang, Huanzhang
    2016 IEEE HOT CHIPS 28 SYMPOSIUM (HCS), 2016,
  • [22] FPGA Implementation for GPR Signal Processing Based on HW/SW Co-Design Architecture
    Srimuk, Pachara
    Boonpoonga, Akkarat
    Burintramart, Santana
    2015 IEEE CONFERENCE ON ANTENNA MEASUREMENTS & APPLICATIONS (CAMA), 2015,
  • [23] Model-Architecture Co-Design for High Performance Temporal GNN Inference on FPGA
    Zhou, Hongkuan
    Zhang, Bingyi
    Kannan, Rajgopal
    Prasanna, Viktor
    Busart, Carl
    2022 IEEE 36TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM (IPDPS 2022), 2022, : 1108 - 1117
  • [24] Structuring the co-design of requirements and architecture
    Pohl, Klaus
    Sikora, Ernst
    REQUIREMENTS ENGINEERING: FOUNDATION FOR SOFTWARE QUALITY, 2007, 4542 : 48 - +
  • [25] Computational co-design of fibrous architecture
    Achim Menges
    Fabian Kannenberg
    Christoph Zechmeister
    Architectural Intelligence , 1 (1):
  • [26] Multi-fidelity Surrogate Modeling for Application/Architecture Co-design
    Zhang, Yiming
    Neelakantan, Aravind
    Kumar, Nalini
    Park, Chanyoung
    Haftka, Raphael T.
    Kim, Nam H.
    Lam, Herman
    HIGH PERFORMANCE COMPUTING SYSTEMS: PERFORMANCE MODELING, BENCHMARKING, AND SIMULATION (PMBS 2017), 2018, 10724 : 179 - 196
  • [27] AutoML for Multilayer Perceptron and FPGA Co-design
    Colangelo, Philip
    Segal, Oren
    Speicher, Alex
    Margala, Martin
    2020 IEEE 33RD INTERNATIONAL SYSTEM-ON-CHIP CONFERENCE (SOCC), 2020, : 265 - 266
  • [28] Hardware-software co-design of a dynamically reconfigurable FPGA-based fuzzy logic controller
    Fons, Francisco
    Fons, Mariano
    Canto, Enrique
    2006 13TH IEEE INTERNATIONAL CONFERENCE ON ELECTRONICS, CIRCUITS AND SYSTEMS, VOLS 1-3, 2006, : 1228 - 1231
  • [29] Virtual component co-design - Applying function architecture co-design to automotive applications
    Schirrmeister, F
    Sangiovanni-Vincentelli, A
    IVEC 2001: PROCEEDINGS OF THE IEEE INTERNATIONAL VEHICLE ELECTRONICS CONFERENCE, 2002, : 221 - 226
  • [30] ReAAP: A Reconfigurable and Algorithm-Oriented Array Processor With Compiler-Architecture Co-Design
    Zheng, Jianwei
    Liu, Yu
    Liu, Xuejiao
    Liang, Luhong
    Chen, Deming
    Cheng, Kwang-Ting
    IEEE TRANSACTIONS ON COMPUTERS, 2022, 71 (12) : 3088 - 3100