Architecture and Application Co-Design for Beyond-FPGA Reconfigurable Acceleration Devices

被引:5
|
作者
Boutros, Andrew [1 ,2 ]
Nurvitadhi, Eriko [2 ]
Betz, Vaughn [1 ]
机构
[1] Univ Toronto, Dept Elect & Comp Engn, Toronto, ON M5S 3G4, Canada
[2] Intel Corp, Programmable Solut Grp, Santa Clara, CA 95054 USA
基金
加拿大自然科学与工程研究理事会;
关键词
Deep learning; field-programmable gate arrays; hardware acceleration; network-on-chip; reconfigurable computing; EMBEDDED NETWORKS; CHIP;
D O I
10.1109/ACCESS.2022.3204664
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In recent years, field-programmable gate arrays (FPGAs) have been increasingly deployed in datacenters as programmable accelerators that can offer software-like flexibility and custom-hardware-like efficiency for key datacenter workloads. To improve the efficiency of FPGAs for these new datacenter use cases and data-intensive applications, a new class of reconfigurable acceleration devices (RADs) is emerging. In these devices, the FPGA fine-grained reconfigurable fabric is a component of a bigger monolithic or multi-die system-in-package that can incorporate general-purpose software-programmable cores, domain-specialized accelerator blocks, and high-performance networks-on-chip (NoCs) for efficient communication between these system components. The integration of all these components in a RAD results in a huge design space and requires re-thinking the implementation of applications that need to be migrated from conventional FPGAs to these novel devices. In this work, we introduce RAD-Sim, an architecture simulator that allows rapid design space exploration for RADs and facilitates the study of complex interactions between their various components. We also present a case study that highlights the utility of RAD-Sim in re-designing applications for these novel RADs by mapping a state-of-the-art deep learning (DL) inference FPGA overlay to different RAD instances. Our case study illustrates how RAD-Sim can capture a wide variety of reconfigurable architectures, from conventional FPGAs to devices augmented with hard NoCs, specialized matrix-vector blocks, and 3D-stacked multi-die devices. In addition, we show that our tool can help architects evaluate the effect of specific RAD architecture parameters on end-to-end workload performance. Through RAD-Sim, we also show that novel RADs can potentially achieve 2.6x better performance on average compared to conventional FPGAs in the key DL application domain.
引用
收藏
页码:95067 / 95082
页数:16
相关论文
共 50 条
  • [1] A Whole New World: How to Architect Beyond-FPGA Reconfigurable Acceleration Devices?
    Boutros, Andrew
    More, Stephen
    Betz, Vaughn
    2023 33RD INTERNATIONAL CONFERENCE ON FIELD-PROGRAMMABLE LOGIC AND APPLICATIONS, FPL, 2023, : 265 - 270
  • [2] Reconfigurable Architecture for Deinterlacer based on Algorithm/Architecture Co-Design
    Lee, Gwo Giun
    Wang, Ming-Jiun
    Chen, Bo-Han
    Chen, JiunFu
    Jao, Ping-Keng
    Hsiao, Ching Jui
    Wei, Ling-Fei
    JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2011, 63 (02): : 181 - 189
  • [3] Reconfigurable Architecture for Deinterlacer based on Algorithm/Architecture Co-Design
    Gwo Giun Lee
    Ming-Jiun Wang
    Bo-Han Chen
    JiunFu Chen
    Ping-Keng Jao
    Ching Jui Hsiao
    Ling-Fei Wei
    Journal of Signal Processing Systems, 2011, 63 : 181 - 189
  • [4] A design process for harware/software system co-design and its application to designing a reconfigurable FPGA
    Moreno, Felix
    Lopez, Ignacio
    Sanz, Ricardo
    13TH EUROMICRO CONFERENCE ON DIGITAL SYSTEM DESIGN: ARCHITECTURES, METHODS AND TOOLS, 2010, : 556 - 562
  • [5] Algorithm and Hardware Co-Design for Efficient PMCMC Acceleration on FPGA
    Wang, Fulong
    Peng, Shiyu
    Gong, Yuanhao
    Li, Minghan
    Liu, Shuanglong
    2024 9TH INTERNATIONAL CONFERENCE ON SIGNAL AND IMAGE PROCESSING, ICSIP, 2024, : 330 - 334
  • [6] NEURO INSPIRED RECONFIGURABLE ARCHITECTURE FOR HARDWARE/SOFTWARE CO-DESIGN
    Ghani, Arfan
    McDaid, Liam J.
    Belatreche, Ammar
    Ahmed, Waqar
    IEEE INTERNATIONAL SOC CONFERENCE, PROCEEDINGS, 2009, : 287 - +
  • [7] Integrated Heuristic for Hardware/Software Co-design on Reconfigurable Devices
    Liu, Peng
    Wu, Jigang
    Wang, Yongji
    2012 13TH INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED COMPUTING, APPLICATIONS, AND TECHNOLOGIES (PDCAT 2012), 2012, : 370 - 375
  • [8] An FPGA Architecture for ASIC-FPGA Co-Design to Streamline Processing of IDSs
    Sato, Tomoaki
    Chivapreecha, Sorawat
    Moungnoul, Phichet
    Higuchi, Kohji
    2016 INTERNATIONAL CONFERENCE ON COLLABORATION TECHNOLOGIES AND SYSTEMS (CTS), 2016, : 412 - 417
  • [9] FPGA integrated co-design
    Haskell, RE
    Hanna, DM
    2001 International Conference on Microelectronic Systems Education, Proceedings: DESIGNING MICROSYSTEMS IN THE NEW MILLENNIUM, 2001, : 30 - 31
  • [10] Algorithm-hardware Co-design of Attention Mechanism on FPGA Devices
    Zhang, Xinyi
    Wu, Yawen
    Zhou, Peipei
    Tang, Xulong
    Hu, Jingtong
    ACM TRANSACTIONS ON EMBEDDED COMPUTING SYSTEMS, 2021, 20 (05)