Modulo scheduling for a fully-distributed clustered VLIW architecture

被引:0
|
作者
Sánchez, J [1 ]
González, A [1 ]
机构
[1] Univ Politecn Cataluna, Dept Comp Architecture, Barcelona, Spain
关键词
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Clustering is an approach that many microprocessors are adopting in recent times in order to mitigate the increasing penalties of wire delays. In this work we propose a novel clustered VLIW architecture which has all its resources partitioned among clusters, including the cache memory. A modulo scheduling scheme for this architecture is also proposed. This algorithm takes into account both register and memory inter-cluster communications so that the final schedule results in a cluster assignment that favors cluster locality in cache references and register accesses. It has been evaluated for both 2- and 4-cluster configurations and for differing number and latencies of inter-cluster buses. The proposed algorithm produces schedules with very low communication requirements and outperforms previous cluster-oriented schedulers.
引用
收藏
页码:124 / 133
页数:10
相关论文
共 50 条
  • [31] Masterless Coded Computing: A Fully-Distributed Coded FFT Algorithm
    Jeong, Haewon
    Low, Tze Meng
    Grover, Pulkit
    2018 56TH ANNUAL ALLERTON CONFERENCE ON COMMUNICATION, CONTROL, AND COMPUTING (ALLERTON), 2018, : 887 - 894
  • [32] A distributed control path architecture for VLIW processors
    Zhong, HT
    Fan, K
    Mahlke, S
    Schlansker, M
    PACT 2005: 14TH INTERNATIONAL CONFERENCE ON PARALLEL ARCHITECTURES AND COMPILATION TECHNIQUES, 2005, : 197 - 206
  • [33] A graph matching based integrated scheduling framework for clustered VLIW processors
    Nagpal, R
    Srikant, YN
    2004 INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING WORKSHOPS, PROCEEDINGS, 2004, : 530 - 537
  • [34] Instruction scheduling for a clustered VLIW processor with a word-interleaved cache
    Gibert, Enric
    Sanchez, Jesus
    Gonzalez, Antonio
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2006, 18 (11): : 1391 - 1411
  • [35] Instruction scheduling with k-successor tree for clustered VLIW processors
    Zhang, Xuemeng
    Wu, Hui
    Xue, Jingling
    DESIGN AUTOMATION FOR EMBEDDED SYSTEMS, 2013, 17 (02) : 439 - 458
  • [36] Effective instruction scheduling techniques for an interleaved cache clustered VLIW processor
    Gibert, E
    Sánchez, J
    González, A
    35TH ANNUAL IEEE/ACM INTERNATIONAL SYMPOSIUM ON MICROARCHITECTURE (MICRO-35), PROCEEDINGS, 2002, : 123 - 133
  • [37] Instruction scheduling with k-successor tree for clustered VLIW processors
    Xuemeng Zhang
    Hui Wu
    Jingling Xue
    Design Automation for Embedded Systems, 2013, 17 : 439 - 458
  • [38] A unified modulo scheduling and register allocation technique for clustered processors
    Codina, JM
    Sánchez, J
    González, A
    2001 INTERNATIONAL CONFERENCE ON PARALLEL ARCHITECTURES AND COMPILATION TECHNIQUES, PROCEEDINGS, 2001, : 175 - 184
  • [39] Uniform Circle Formation by Asynchronous Robots: A Fully-Distributed Approach
    Jiang, Shan
    Cao, Jiannong
    Wang, Jia
    Stojmenovic, Milos
    Bourgeois, Julien
    2017 26TH INTERNATIONAL CONFERENCE ON COMPUTER COMMUNICATION AND NETWORKS (ICCCN 2017), 2017,
  • [40] A New Fully-Distributed Arbitration-Based Membership Protocol
    Ahsan, Shegufta Bakht
    Gupta, Indranil
    IEEE INFOCOM 2020 - IEEE CONFERENCE ON COMPUTER COMMUNICATIONS, 2020, : 716 - 725