FDRA: A Framework for a Dynamically Reconfigurable Accelerator Supporting Multi-Level Parallelism

被引：0

作者：

Qiu, Yunhui ^{[1
]}

Mao, Yiqing ^{[1
]}

Gao, Xuchen ^{[1
]}

Chen, Sichao ^{[1
]}

Li, Jiangnan ^{[1
]}

Yin, Wenbo ^{[1
]}

Wang, Lingli ^{[1
]}

机构：

[1] Fudan Univ, State Key Lab ASIC & Syst, 825 Zhangheng Rd, Shanghai 201203, Peoples R China

来源：

ACM TRANSACTIONS ON RECONFIGURABLE TECHNOLOGY AND SYSTEMS | 2024年 / 17卷 / 01期

基金：

中国国家自然科学基金;

关键词：

CGRA; dynamically reconfigurable accelerator; instruction-level parallelism;

D O I：

10.1145/3614224

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

Coarse-grained reconfigurable architectures (CGRAs) have emerged as promising accelerators due to their high flexibility and energy efficiency. However, existing open source works often lack integration of CGRAs with CPU systems and corresponding toolchains. Moreover, there is rare support for the accelerator instruction pipelining to overlap data communication, computation, and configuration across multiple tasks. In this article, we propose FDRA, an open source exploration framework for a heterogeneous system-on-chip (SoC) with a RISC-V processor and a dynamically reconfigurable accelerator (DRA) supporting loop, instruction, and task levels of parallelism. FDRA encompasses parameterized SoC modeling, Verilog generation, source-to-source application code transformation using frontend and DRA compilers, SoC simulation, and FPGA prototyping. FDRA incorporates the extraction of periodic accumulative operators and multi-dimensional linear load/store operators from nested loops. The DRA enables accessing the shared L2 cache with virtual addresses and supports direct memory access with arbitrary start addresses and data lengths. Integrated into the RISC-V Rocket SoC, our DRA achieves a remarkable 55x acceleration for loop kernels and improves energy efficiency by 29x. Compared to state-of-the-art RISC-V vector units, our DRA demonstrates a 2.9x speed improvement and 3.5x greater energy efficiency. In contrast to previous CGRA+RISC-V SoCs, our SoC achieves a minimum speedup of 5.2x.

引用

页数：26

共 50 条

[1] MARS: Exploiting Multi-Level Parallelism for DNN Workloads on Adaptive Multi-Accelerator Systems
Shen, Guan
Zhao, Jieru
Wang, Zeke
Lin, Zhe
Ding, Wenchao
Wu, Chentao
Chen, Quan
Guo, Minyi
[J]. 2023 60TH ACM/IEEE DESIGN AUTOMATION CONFERENCE, DAC, 2023,
[2] Exploiting operation level parallelism through dynamically reconfigurable datapaths
Huang, ZN
Malik, S
[J]. 39TH DESIGN AUTOMATION CONFERENCE, PROCEEDINGS 2002, 2002, : 337 - 342
[3] Multi-Level Parallelism for the Cardiac Bidomain Equations
Carolina Ribeiro Xavier
Rafael Sachetto Oliveira
Vinicius da Fonseca Vieira
Rodrigo Weber dos Santos
Wagner Meira
[J]. International Journal of Parallel Programming, 2009, 37 : 572 - 592
[4] Multi-level parallelism in the computational modeling of the heart
Xavier, Carolina
Sachetto, Rafael
Vieira, Vinicius
dos Santos, Rodrigo Weber
Meira, Wagner
[J]. 19TH INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE AND HIGH PERFORMANCE COMPUTING, PROCEEDINGS, 2007, : 3 - +
[5] Multi-Level Parallelism for the Cardiac Bidomain Equations
Xavier, Carolina Ribeiro
Oliveira, Rafael Sachetto
Vieira, Vinicius da Fonseca
dos Santos, Rodrigo Weber
Meira, Wagner, Jr.
[J]. INTERNATIONAL JOURNAL OF PARALLEL PROGRAMMING, 2009, 37 (06) : 572 - 592
[6] Towards a Multi-level Framework for Supporting Systematic Review - a pilot study
Li, Dingcheng
Wang, Zhen
Shen, Feichen
Murad, Mohammad Hassan
Liu, Hongfang
[J]. 2014 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2014,
[7] POAS: a framework for exploiting accelerator level parallelism in heterogeneous environments
Martinez, Pablo Antonio
Bernabe, Gregorio
Garcia, Jose Manuel
[J]. JOURNAL OF SUPERCOMPUTING, 2024, 80 (10): : 14666 - 14693
[8] Multi-level parallelism for protein prediction on the parallel computers
Chen, J.
Mo, Z. Y.
Song, L.
[J]. MOLECULAR & CELLULAR PROTEOMICS, 2004, 3 (10) : S248 - S248
[9] Exploring multi-level parallelism in cellular automata networks
Calidonna, CR
Di Napoli, C
Giordano, M
Furnari, MM
[J]. HIGH PERFORMANCE COMPUTING, PROCEEDINGS, 2000, 1940 : 336 - 343
[10] The Introduction of Multi-level Parallelism Solvers in Multibody Dynamics
Andreev, Andrey
Egunov, Vitaly
Movchan, Evgenia
Cherednikov, Nikita
Kharkov, Egor
Kohtashvili, Natalia
[J]. CREATIVITY IN INTELLIGENT TECHNOLOGIES AND DATA SCIENCE, PT II, 2019, 1084 : 166 - 180

← 1 2 3 4 5 →