FDRA: A Framework for a Dynamically Reconfigurable Accelerator Supporting Multi-Level Parallelism

被引:0
|
作者
Qiu, Yunhui [1 ]
Mao, Yiqing [1 ]
Gao, Xuchen [1 ]
Chen, Sichao [1 ]
Li, Jiangnan [1 ]
Yin, Wenbo [1 ]
Wang, Lingli [1 ]
机构
[1] Fudan Univ, State Key Lab ASIC & Syst, 825 Zhangheng Rd, Shanghai 201203, Peoples R China
基金
中国国家自然科学基金;
关键词
CGRA; dynamically reconfigurable accelerator; instruction-level parallelism;
D O I
10.1145/3614224
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Coarse-grained reconfigurable architectures (CGRAs) have emerged as promising accelerators due to their high flexibility and energy efficiency. However, existing open source works often lack integration of CGRAs with CPU systems and corresponding toolchains. Moreover, there is rare support for the accelerator instruction pipelining to overlap data communication, computation, and configuration across multiple tasks. In this article, we propose FDRA, an open source exploration framework for a heterogeneous system-on-chip (SoC) with a RISC-V processor and a dynamically reconfigurable accelerator (DRA) supporting loop, instruction, and task levels of parallelism. FDRA encompasses parameterized SoC modeling, Verilog generation, source-to-source application code transformation using frontend and DRA compilers, SoC simulation, and FPGA prototyping. FDRA incorporates the extraction of periodic accumulative operators and multi-dimensional linear load/store operators from nested loops. The DRA enables accessing the shared L2 cache with virtual addresses and supports direct memory access with arbitrary start addresses and data lengths. Integrated into the RISC-V Rocket SoC, our DRA achieves a remarkable 55x acceleration for loop kernels and improves energy efficiency by 29x. Compared to state-of-the-art RISC-V vector units, our DRA demonstrates a 2.9x speed improvement and 3.5x greater energy efficiency. In contrast to previous CGRA+RISC-V SoCs, our SoC achieves a minimum speedup of 5.2x.
引用
收藏
页数:26
相关论文
共 50 条
  • [1] MARS: Exploiting Multi-Level Parallelism for DNN Workloads on Adaptive Multi-Accelerator Systems
    Shen, Guan
    Zhao, Jieru
    Wang, Zeke
    Lin, Zhe
    Ding, Wenchao
    Wu, Chentao
    Chen, Quan
    Guo, Minyi
    [J]. 2023 60TH ACM/IEEE DESIGN AUTOMATION CONFERENCE, DAC, 2023,
  • [2] Exploiting operation level parallelism through dynamically reconfigurable datapaths
    Huang, ZN
    Malik, S
    [J]. 39TH DESIGN AUTOMATION CONFERENCE, PROCEEDINGS 2002, 2002, : 337 - 342
  • [3] Multi-Level Parallelism for the Cardiac Bidomain Equations
    Carolina Ribeiro Xavier
    Rafael Sachetto Oliveira
    Vinicius da Fonseca Vieira
    Rodrigo Weber dos Santos
    Wagner Meira
    [J]. International Journal of Parallel Programming, 2009, 37 : 572 - 592
  • [4] Multi-level parallelism in the computational modeling of the heart
    Xavier, Carolina
    Sachetto, Rafael
    Vieira, Vinicius
    dos Santos, Rodrigo Weber
    Meira, Wagner
    [J]. 19TH INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE AND HIGH PERFORMANCE COMPUTING, PROCEEDINGS, 2007, : 3 - +
  • [5] Multi-Level Parallelism for the Cardiac Bidomain Equations
    Xavier, Carolina Ribeiro
    Oliveira, Rafael Sachetto
    Vieira, Vinicius da Fonseca
    dos Santos, Rodrigo Weber
    Meira, Wagner, Jr.
    [J]. INTERNATIONAL JOURNAL OF PARALLEL PROGRAMMING, 2009, 37 (06) : 572 - 592
  • [6] Towards a Multi-level Framework for Supporting Systematic Review - a pilot study
    Li, Dingcheng
    Wang, Zhen
    Shen, Feichen
    Murad, Mohammad Hassan
    Liu, Hongfang
    [J]. 2014 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2014,
  • [7] POAS: a framework for exploiting accelerator level parallelism in heterogeneous environments
    Martinez, Pablo Antonio
    Bernabe, Gregorio
    Garcia, Jose Manuel
    [J]. JOURNAL OF SUPERCOMPUTING, 2024, 80 (10): : 14666 - 14693
  • [8] Multi-level parallelism for protein prediction on the parallel computers
    Chen, J.
    Mo, Z. Y.
    Song, L.
    [J]. MOLECULAR & CELLULAR PROTEOMICS, 2004, 3 (10) : S248 - S248
  • [9] Exploring multi-level parallelism in cellular automata networks
    Calidonna, CR
    Di Napoli, C
    Giordano, M
    Furnari, MM
    [J]. HIGH PERFORMANCE COMPUTING, PROCEEDINGS, 2000, 1940 : 336 - 343
  • [10] The Introduction of Multi-level Parallelism Solvers in Multibody Dynamics
    Andreev, Andrey
    Egunov, Vitaly
    Movchan, Evgenia
    Cherednikov, Nikita
    Kharkov, Egor
    Kohtashvili, Natalia
    [J]. CREATIVITY IN INTELLIGENT TECHNOLOGIES AND DATA SCIENCE, PT II, 2019, 1084 : 166 - 180