Mixed-granularity Parallel Coarse-grained Reconfigurable Architecture

被引:2
|
作者
Deng, Jinyi [1 ]
Zhang, Linyun [1 ]
Wang, Lei [1 ]
Liu, Jiawei [1 ]
Deng, Kexiang [1 ]
Tang, Shibin [1 ]
Gu, Jiangyuan [1 ]
Han, Boxiao [2 ]
Xu, Fei [2 ]
Liu, Leibo [1 ]
Wei, Shaojun [1 ]
Yin, Shouyi [1 ]
机构
[1] Tsinghua Univ, Sch Integrated Circuits, Beijing 100084, Peoples R China
[2] China Mobile Res Inst, Beijing 100053, Peoples R China
关键词
CGRA; Parallel; Spatial Architecture; Reconfigurable Accelerator;
D O I
10.1145/3489517.3530454
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Coarse-Grained Reconfigurable Architecture (CGRA) is a highperformance computing architecture. However, existing CGRA silicon utilization is low due to the lack of fine-grained parallelism inside Processing Element (PE) and general coarse-grained parallel approach on PE array. No fine-grained parallelism in PE not only leads to low silicon utilization of PE, but also makes the mapping loose and irregular. No generalized parallel method for the mapping cause low PE utilization on CGRA. Our goal is to design an execution model and a Mixed-granularity Parallel CGRA (MP-CGRA), which is capable to fine-grained parallelize operators excution in PEs and parallelize data transmission in channels, leading to a compact mapping. A coarse-grained general parallel method is proposed to vectorize the compact mapping. Evaluated with Machsuite, MPCGRA achieves an improvement of 104.65% silicon utilization on PE array and a 91.40% performance per area improvement compared with baseline-CGRA.
引用
收藏
页码:343 / 348
页数:6
相关论文
共 50 条
  • [1] A Coarse-Grained Reconfigurable Architecture for a PRET Machine
    Siqueira, Hadley
    Kreutz, Marcio
    [J]. 2018 VIII BRAZILIAN SYMPOSIUM ON COMPUTING SYSTEMS ENGINEERING (SBESC 2018), 2018, : 237 - 242
  • [2] Coarse-Grained Reconfigurable Computing with the Versat Architecture
    Lopes, Joao D.
    Vestias, Mario P.
    Duarte, Rui Policarpo
    Neto, Horacio C.
    de Sousa, Jose T.
    [J]. ELECTRONICS, 2021, 10 (06) : 1 - 23
  • [3] Efficient Scheduling Mapping Algorithm for Row Parallel Coarse-Grained Reconfigurable Architecture
    Naijin Chen
    Zhen Wang
    Ruixiang He
    Jianhui Jiang
    Fei Cheng
    Chenghao Han
    [J]. Tsinghua Science and Technology, 2021, 26 (05) : 724 - 735
  • [4] Efficient Scheduling Mapping Algorithm for Row Parallel Coarse-Grained Reconfigurable Architecture
    Chen, Naijin
    Wang, Zhen
    He, Ruixiang
    Jiang, Jianhui
    Cheng, Fei
    Han, Chenghao
    [J]. TSINGHUA SCIENCE AND TECHNOLOGY, 2021, 26 (05) : 724 - 735
  • [5] Mapping Parallel FFT Algorithm onto SmartCell Coarse-Grained Reconfigurable Architecture
    Liang, Cao
    Huang, Xinming
    [J]. IEICE TRANSACTIONS ON ELECTRONICS, 2010, E93C (03): : 407 - 415
  • [6] Mapping Parallel FFT Algorithm onto SmartCell Coarse-Grained Reconfigurable Architecture
    Liang, Cao
    Huang, Xinming
    [J]. 2009 20TH IEEE INTERNATIONAL CONFERENCE ON APPLICATION-SPECIFIC SYSTEMS, ARCHITECTURES AND PROCESSORS, 2009, : 231 - 234
  • [7] A fine-grained/coarse-grained heterogeneous reconfigurable DSP architecture based on parallel processing
    Yang, Y
    Mao, ZG
    Lai, FC
    [J]. PROCEEDINGS OF THE SECOND INTERNATIONAL SYMPOSIUM ON INSTRUMENTATION SCIENCE AND TECHNOLOGY, VOL 3, 2002, : 558 - 562
  • [8] Register file architecture optimization in a coarse-grained reconfigurable architecture
    Kwok, Z
    Wilton, SJE
    [J]. FCCM 2005: 13TH ANNUAL IEEE SYMPOSIUM ON FIELD-PROGRAMMABLE CUSTOM COMPUTING MACHINES, PROCEEDINGS, 2005, : 35 - 44
  • [9] Compiling parallel applications to Coarse-Grained Reconfigurable Architectures
    Tuhin, Mohammed Ashraful Alam
    Norvell, Theodore S.
    [J]. 2008 CANADIAN CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING, VOLS 1-4, 2008, : 1649 - +
  • [10] COARSE-GRAINED DYNAMICALLY RECONFIGURABLE ARCHITECTURE WITH FLEXIBLE RELIABILITY
    Alnajjar, Dawood
    Ko, Younghun
    Imagawa, Takashi
    Konoura, Hiroaki
    Hiromoto, Masayuki
    Mitsuyama, Yukio
    Hashimoto, Masanori
    Ochi, Hiroyuki
    Onoye, Takao
    [J]. FPL: 2009 INTERNATIONAL CONFERENCE ON FIELD PROGRAMMABLE LOGIC AND APPLICATIONS, 2009, : 186 - +