Modulo scheduling for a fully-distributed clustered VLIW architecture

被引:0
|
作者
Sánchez, J [1 ]
González, A [1 ]
机构
[1] Univ Politecn Cataluna, Dept Comp Architecture, Barcelona, Spain
关键词
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Clustering is an approach that many microprocessors are adopting in recent times in order to mitigate the increasing penalties of wire delays. In this work we propose a novel clustered VLIW architecture which has all its resources partitioned among clusters, including the cache memory. A modulo scheduling scheme for this architecture is also proposed. This algorithm takes into account both register and memory inter-cluster communications so that the final schedule results in a cluster assignment that favors cluster locality in cache references and register accesses. It has been evaluated for both 2- and 4-cluster configurations and for differing number and latencies of inter-cluster buses. The proposed algorithm produces schedules with very low communication requirements and outperforms previous cluster-oriented schedulers.
引用
收藏
页码:124 / 133
页数:10
相关论文
共 50 条
  • [1] Integrated Modulo Scheduling for Clustered VLIW Architectures
    Eriksson, Mattias V.
    Kessler, Christoph W.
    HIGH PERFORMANCE EMBEDDED ARCHITECTURES AND COMPILERS, PROCEEDINGS, 2009, 5409 : 65 - 79
  • [2] Modulo scheduling with integrated register spilling for clustered VLIW architectures
    Zalamea, J
    Llosa, J
    Ayguadé, E
    Valero, M
    34TH ACM/IEEE INTERNATIONAL SYMPOSIUM ON MICROARCHITECTURE, MICRO-34, PROCEEDINGS, 2001, : 160 - 169
  • [3] The effectiveness of loop unrolling for modulo scheduling in clustered VLIW architectures
    Sánchez, J
    González, A
    2000 INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING, PROCEEDINGS, 2000, : 555 - 562
  • [4] Instruction scheduling algorithm for register file connectivity clustered VLIW architecture
    Institute of Microelectronics, Tsinghua University, Beijing 100084, China
    Jisuanji Xuebao, 2008, 1 (127-132):
  • [5] Instruction scheduling for clustered VLIW architectures
    Sánchez, J
    González, A
    13TH INTERNATIONAL SYMPOSIUM ON SYSTEM SYNTHESIS, PROCEEDINGS, 2000, : 41 - 46
  • [6] Instruction scheduling for clustered VLIW DSPs
    Leupers, R
    2000 INTERNATIONAL CONFERENCE ON PARALLEL ARCHITECTURES AND COMPILATION TECHNIQUES, PROCEEDINGS, 2000, : 291 - 300
  • [7] Compiler-directed ILP extraction for clustered VLIW/EPIC machines: Predication, speculation and modulo scheduling
    Pillai, S
    Jacome, MF
    DESIGN, AUTOMATION AND TEST IN EUROPE CONFERENCE AND EXHIBITION, PROCEEDINGS, 2003, : 422 - 427
  • [8] Register aware scheduling for distributed cache clustered architecture
    Wang, Z
    Hu, XS
    Sha, EHM
    ASP-DAC 2003: PROCEEDINGS OF THE ASIA AND SOUTH PACIFIC DESIGN AUTOMATION CONFERENCE, 2003, : 71 - 76
  • [9] Local scheduling techniques for memory coherence in a clustered VLIW processor with a distributed data cache
    Gibert, E
    Sánchez, J
    González, A
    CGO 2003: INTERNATIONAL SYMPOSIUM ON CODE GENERATION AND OPTIMIZATION, 2003, : 193 - 203
  • [10] Pragmatic integrated scheduling for clustered VLIW architectures
    Nagpal, Rahul
    Srikant, Y. N.
    SOFTWARE-PRACTICE & EXPERIENCE, 2008, 38 (03): : 227 - 257