A coarse-grained reconfigurable computing architecture with loop self-pipelining

被引:2
|
作者
Dou Yong [1 ]
Wu GuiMing [1 ]
Xu JinHui [1 ]
Zhou XingMing [1 ]
机构
[1] Natl Univ Def Technol, Natl Lab Parallel & Distributed Proc, Changsha 410073, Hunan, Peoples R China
来源
基金
国家高技术研究发展计划(863计划); 中国国家自然科学基金;
关键词
reconfigurable computing; loop pipelining; data driven; register promotion;
D O I
10.1007/s11432-008-0146-6
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Reconfigurable computing tries to achieve the balance between high efficiency of custom computing and flexibility of general-purpose computing. This paper presents the implementation techniques in LEAP, a coarse-grained reconfigurable array, and proposes a speculative execution mechanism for dynamic loop scheduling with the goal of one iteration per cycle and implementation techniques to support decoupling synchronization between the token generator and the collector. This paper also introduces the techniques of exploiting both data dependences of intra- and inter-iteration, with the help of two instructions for special data reuses in the loop-carried dependences. The experimental results show that the number of memory accesses reaches on average 3% of an RISC processor simulator with no memory optimization. In a practical image matching application, LEAP architecture achieves about 34 times of speedup in execution cycles, compared with general-purpose processors.
引用
收藏
页码:575 / 587
页数:13
相关论文
共 50 条
  • [41] Design and implementation of a coarse-grained dynamically reconfigurable hardware architecture
    Becker, J
    Pionteck, T
    Habermann, C
    Glesner, M
    IEEE COMPUTER SOCIETY WORKSHOP ON VLSI 2001, PROCEEDINGS, 2001, : 41 - 46
  • [42] CORAL: Coarse-grained Reconfigurable Architecture for ConvoLutional Neural Networks
    Yuan, Zhe
    Liu, Yongpan
    Yue, Jinshan
    Li, Jinyang
    Yang, Huazhong
    2017 IEEE/ACM INTERNATIONAL SYMPOSIUM ON LOW POWER ELECTRONICS AND DESIGN (ISLPED), 2017,
  • [43] Architecture exploration and tools for pipelined coarse-grained reconfigurable arrays
    Stock, Florian
    Koch, Andreas
    2006 INTERNATIONAL CONFERENCE ON FIELD PROGRAMMABLE LOGIC AND APPLICATIONS, PROCEEDINGS, 2006, : 53 - 58
  • [44] Design of Coarse-Grained Dynamically Reconfigurable Architecture for DSP Applications
    Zhang, Chenxin
    Lenart, Thomas
    Svensson, Henrik
    Owall, Viktor
    2009 INTERNATIONAL CONFERENCE ON RECONFIGURABLE COMPUTING AND FPGAS, 2009, : 338 - 343
  • [45] Allocating resources based on a model of coarse-grained reconfigurable architecture
    Zhang, Huizhen
    Pan, Yubiao
    Zhang, Yiwen
    Wang, Cheng
    JOURNAL OF ENGINEERING-JOE, 2019, 2019 (10): : 7272 - 7278
  • [46] Accurate constraints aware mapping on Coarse-Grained Reconfigurable Architecture
    Zhang, Peng
    Luo, Huiqiang
    Man, K. L.
    2011 NINTH IEEE INTERNATIONAL SYMPOSIUM ON PARALLEL AND DISTRIBUTED PROCESSING WITH APPLICATIONS WORKSHOPS (ISPAW), 2011, : 39 - 44
  • [47] Low Power Reconfiguration Technique for Coarse-Grained Reconfigurable Architecture
    Kim, Yoonjin
    Mahapatra, Rabi N.
    Park, Ilhyun
    Choi, Kiyoung
    IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, 2009, 17 (05) : 593 - 603
  • [48] Exploiting Outer Loop Parallelism of Nested Loop on Coarse-Grained Reconfigurable Architectures
    Liu, Dajiang
    Yin, Shouyi
    Liu, Leibo
    Wei, Shaojun
    2014 IEEE 22ND ANNUAL INTERNATIONAL SYMPOSIUM ON FIELD-PROGRAMMABLE CUSTOM COMPUTING MACHINES (FCCM 2014), 2014, : 32 - 32
  • [49] Conflict-Free Loop Mapping for Coarse-Grained Reconfigurable Architecture with Multi-Bank Memory
    Yin, Shouyi
    Yao, Xianqing
    Lu, Tianyi
    Liu, Dajiang
    Gu, Jiangyuan
    Liu, Leibo
    Wei, Shaojun
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2017, 28 (09) : 2471 - 2485
  • [50] The Research of Loop Unrolling for Coarse-Grained Reconfigurable Block Encryption Array
    Yang, Yu-Hang
    Li, Wei
    Xu, Jin-Fu
    Liu, Lu
    2016 13TH IEEE INTERNATIONAL CONFERENCE ON SOLID-STATE AND INTEGRATED CIRCUIT TECHNOLOGY (ICSICT), 2016, : 1268 - 1271