Exploiting Parallelism of Imperfect Nested Loops on Coarse-Grained Reconfigurable Architectures

被引:7
|
作者
Yin, Shouyi [1 ]
Lin, Xinhan [1 ]
Liu, Leibo [2 ]
Wei, Shaojun [1 ]
机构
[1] Tsinghua Univ, Inst Microelect, Beijing, Peoples R China
[2] Tsinghua Univ, Inst Microelect, Natl Lab Informat Sci & Technol, Beijing, Peoples R China
关键词
CGRA; software pipelining; imperfect nested loop; sibling inner loops; outer-level pipelining; kernel compression;
D O I
10.1109/TPDS.2016.2531678
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Coarse-grained reconfigurable architecture (CGRA) is a promising parallel computing platform that provides high performance, high power efficiency and flexibility. However, for imperfect nested loops, the existing loop mapping methods often result in low execution performance and poor hardware utilization. To tackle this problem, this paper makes three contributions: 1) a highly effective and general approach to map imperfect loops on CGRA; 2) a global optimization strategy to search the optimal initiation intervals (IIs); 3) a powerful kernel compression method to reduce the oversized kernel. Experiment results show that our approach can reduce the total computing latency by 20.5, 58.5 and 73.2 percent compared to the state-of-the-art approaches on 2 x 2, 4 x 4 and 8 x 8 CGRA respectively. Moreover, the compilation time and configuration context size is acceptable in practice.
引用
收藏
页码:3199 / 3213
页数:15
相关论文
共 50 条
  • [31] An Elastic Task Scheduling Scheme on Coarse-Grained Reconfigurable Architectures
    Chen, Longlong
    Zhu, Jianfeng
    Deng, Yangdong
    Li, Zhaoshi
    Chen, Jian
    Jiang, Xiaowei
    Yin, Shouyi
    Wei, Shaojun
    Liu, Leibo
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2021, 32 (12) : 3066 - 3080
  • [32] Still Image Processing on Coarse-Grained Reconfigurable Array Architectures
    Hartmann, Matthias
    Pantazis, Vasileios
    Vander Aa, Tom
    Berekovic, Mladen
    Hochberger, Christian
    JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2010, 60 (02): : 225 - 237
  • [33] High Throughput Data Mapping for Coarse-Grained Reconfigurable Architectures
    Kim, Yongjoo
    Lee, Jongeun
    Shrivastava, Aviral
    Yoon, Jonghee W.
    Cho, Doosan
    Paek, Yunheung
    IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2011, 30 (11) : 1599 - 1609
  • [34] A Data Prefetch and Reuse Strategy for Coarse-Grained Reconfigurable Architectures
    Ge, Wei
    Qi, Zhi
    Du, Yue
    Ma, Lu
    Shi, Longxing
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2013, E96D (03): : 616 - 623
  • [35] Coarse-grained reconfigurable architectures for radio baseband processing: A survey
    Hassan, Zohaib
    Ometov, Aleksandr
    Lohan, Elena Simona
    Nurmi, Jari
    JOURNAL OF SYSTEMS ARCHITECTURE, 2024, 154
  • [36] Tuning coarse-grained reconfigurable architectures towards an application domain
    Oliveira, Julio
    Schweizer, Thomas
    Oppold, Tobias
    Kuhn, Tommy
    Rosenstiel, Wolfgang
    ReConFig 2006: Proceedings of the 2006 IEEE International Conference on Reconfigurable Computing and FPGA's, 2006, : 71 - 77
  • [37] A Survey on Coarse-Grained Reconfigurable Architectures From a Performance Perspective
    Podobas, Artur
    Sano, Kentaro
    Matsuoka, Satoshi
    IEEE ACCESS, 2020, 8 : 146719 - 146743
  • [38] Memory Access Optimization in Compilation for Coarse-Grained Reconfigurable Architectures
    Kim, Yongjoo
    Lee, Jongeun
    Shrivastava, Aviral
    Paek, Yunheung
    ACM TRANSACTIONS ON DESIGN AUTOMATION OF ELECTRONIC SYSTEMS, 2011, 16 (04)
  • [39] Still Image Processing on Coarse-Grained Reconfigurable Array Architectures
    Matthias Hartmann
    Vasileios (Vassilis) Pantazis
    Tom Vander Aa
    Mladen Berekovic
    Christian Hochberger
    Journal of Signal Processing Systems, 2010, 60 : 225 - 237
  • [40] Fault Recovery Time Analysis for Coarse-Grained Reconfigurable Architectures
    Lee, Ganghee
    Cetin, Ediz
    Diessel, Oliver
    ACM TRANSACTIONS ON EMBEDDED COMPUTING SYSTEMS, 2018, 17 (02)