Architecture and Application Co-Design for Beyond-FPGA Reconfigurable Acceleration Devices

被引:5
|
作者
Boutros, Andrew [1 ,2 ]
Nurvitadhi, Eriko [2 ]
Betz, Vaughn [1 ]
机构
[1] Univ Toronto, Dept Elect & Comp Engn, Toronto, ON M5S 3G4, Canada
[2] Intel Corp, Programmable Solut Grp, Santa Clara, CA 95054 USA
基金
加拿大自然科学与工程研究理事会;
关键词
Deep learning; field-programmable gate arrays; hardware acceleration; network-on-chip; reconfigurable computing; EMBEDDED NETWORKS; CHIP;
D O I
10.1109/ACCESS.2022.3204664
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In recent years, field-programmable gate arrays (FPGAs) have been increasingly deployed in datacenters as programmable accelerators that can offer software-like flexibility and custom-hardware-like efficiency for key datacenter workloads. To improve the efficiency of FPGAs for these new datacenter use cases and data-intensive applications, a new class of reconfigurable acceleration devices (RADs) is emerging. In these devices, the FPGA fine-grained reconfigurable fabric is a component of a bigger monolithic or multi-die system-in-package that can incorporate general-purpose software-programmable cores, domain-specialized accelerator blocks, and high-performance networks-on-chip (NoCs) for efficient communication between these system components. The integration of all these components in a RAD results in a huge design space and requires re-thinking the implementation of applications that need to be migrated from conventional FPGAs to these novel devices. In this work, we introduce RAD-Sim, an architecture simulator that allows rapid design space exploration for RADs and facilitates the study of complex interactions between their various components. We also present a case study that highlights the utility of RAD-Sim in re-designing applications for these novel RADs by mapping a state-of-the-art deep learning (DL) inference FPGA overlay to different RAD instances. Our case study illustrates how RAD-Sim can capture a wide variety of reconfigurable architectures, from conventional FPGAs to devices augmented with hard NoCs, specialized matrix-vector blocks, and 3D-stacked multi-die devices. In addition, we show that our tool can help architects evaluate the effect of specific RAD architecture parameters on end-to-end workload performance. Through RAD-Sim, we also show that novel RADs can potentially achieve 2.6x better performance on average compared to conventional FPGAs in the key DL application domain.
引用
收藏
页码:95067 / 95082
页数:16
相关论文
共 50 条
  • [31] Design and FPGA Implementation of a Reconfigurable 1024-Channel Channelization Architecture for SDR Application
    Liu, Xue
    Wang, Ze-Ke
    Deng, Qing-Xu
    IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, 2016, 24 (07) : 2449 - 2461
  • [32] DESIGN OF A RECONFIGURABLE DYNAMIC TESTBED FOR CO-DESIGN METHOD VALIDATION
    Deshmukh, Anand P.
    Lohan, Danny J.
    Allison, James T.
    PROCEEDINGS OF THE ASME INTERNATIONAL DESIGN ENGINEERING TECHNICAL CONFERENCES AND COMPUTERS AND INFORMATION IN ENGINEERING CONFERENCE, 2017, VOL 2A, 2017,
  • [33] Toward Efficient Co-Design of CNN Quantization and HW Architecture on FPGA Hybrid-Accelerator
    Zhang, Yiran
    Li, Guiying
    Yuan, Bo
    2024 INTERNATIONAL SYMPOSIUM OF ELECTRONICS DESIGN AUTOMATION, ISEDA 2024, 2024, : 678 - 683
  • [34] Algorithm/Architecture Co-Design for Energy-Efficient Acceleration of Multi-Task DNN
    Shin, Jaekang
    Choi, Seungkyu
    Ra, Jongwoo
    Kim, Lee -Sup
    PROCEEDINGS OF THE 59TH ACM/IEEE DESIGN AUTOMATION CONFERENCE, DAC 2022, 2022, : 253 - 258
  • [35] A platform for co-design and co-synthesis based on FPGA
    Mosanya, E
    Goeke, M
    Linder, J
    Perrier, JY
    Rampogna, F
    Sanchez, E
    SEVENTH IEEE INTERNATIONAL WORKSHOP ON RAPID SYSTEM PROTOTYPING, PROCEEDINGS: SHORTENING THE PATH FROM SPECIFICATION TO PROTOTYPE, 1996, : 11 - 16
  • [36] EPICURE: A partitioning and co-design framework for reconfigurable computing
    Diguet, Jean Philippe
    Gogniat, Guy
    Philippe, Jean Luc
    Le Moullec, Yannick
    Bilavarn, Sebastien
    Gamrat, Christian
    Ben Chehida, Karim
    Auguin, Michel
    Fornari, Xavier
    Kajfasz, Philippe
    MICROPROCESSORS AND MICROSYSTEMS, 2006, 30 (06) : 367 - 387
  • [37] Algorithm and Hardware Co-design for Reconfigurable CNN Accelerator
    Fan, Hongxiang
    Ferianc, Martin
    Que, Zhiqiang
    Li, He
    Liu, Shuanglong
    Niu, Xinyu
    Luk, Wayne
    27TH ASIA AND SOUTH PACIFIC DESIGN AUTOMATION CONFERENCE, ASP-DAC 2022, 2022, : 250 - 255
  • [38] A CO-DESIGN PLATFORM FOR ALGORITHM/ARCHITECTURE DESIGN EXPLORATION
    Lucarz, Christophe
    Mattavelli, Marco
    Dubois, Julien
    2008 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOLS 1-4, 2008, : 1069 - +
  • [39] Application specific reconfigurable architecture design
    Akoglu, A
    Panchanathan, S
    ERSA'05: Proceedings of the 2005 International Conference on Engineering of Reconfigurable Systems and Algorithms, 2005, : 247 - 250
  • [40] Robot Co-design: Beyond the Monotone Case
    Carlone, Luca
    Pinciroli, Carlo
    2019 INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2019, : 3024 - 3030