Modulo scheduling for a fully-distributed clustered VLIW architecture

被引：0

作者：

Sánchez, J ^{[1
]}

González, A ^{[1
]}

机构：

[1] Univ Politecn Cataluna, Dept Comp Architecture, Barcelona, Spain

来源：

33RD ANNUAL IEEE/ACM INTERNATIONAL SYMPOSIUM ON MICROARCHITECTURE: MICRO-33 2000, PROCEEDINGS | 2000年

关键词：

D O I：

暂无

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

Clustering is an approach that many microprocessors are adopting in recent times in order to mitigate the increasing penalties of wire delays. In this work we propose a novel clustered VLIW architecture which has all its resources partitioned among clusters, including the cache memory. A modulo scheduling scheme for this architecture is also proposed. This algorithm takes into account both register and memory inter-cluster communications so that the final schedule results in a cluster assignment that favors cluster locality in cache references and register accesses. It has been evaluated for both 2- and 4-cluster configurations and for differing number and latencies of inter-cluster buses. The proposed algorithm produces schedules with very low communication requirements and outperforms previous cluster-oriented schedulers.

引用

页码：124 / 133

页数：10

共 50 条

[31] Masterless Coded Computing: A Fully-Distributed Coded FFT Algorithm
Jeong, Haewon
Low, Tze Meng
Grover, Pulkit
2018 56TH ANNUAL ALLERTON CONFERENCE ON COMMUNICATION, CONTROL, AND COMPUTING (ALLERTON), 2018, : 887 - 894
[32] A distributed control path architecture for VLIW processors
Zhong, HT
Fan, K
Mahlke, S
Schlansker, M
PACT 2005: 14TH INTERNATIONAL CONFERENCE ON PARALLEL ARCHITECTURES AND COMPILATION TECHNIQUES, 2005, : 197 - 206
[33] A graph matching based integrated scheduling framework for clustered VLIW processors
Nagpal, R
Srikant, YN
2004 INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING WORKSHOPS, PROCEEDINGS, 2004, : 530 - 537
[34] Instruction scheduling for a clustered VLIW processor with a word-interleaved cache
Gibert, Enric
Sanchez, Jesus
Gonzalez, Antonio
CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2006, 18 (11): : 1391 - 1411
[35] Instruction scheduling with k-successor tree for clustered VLIW processors
Zhang, Xuemeng
Wu, Hui
Xue, Jingling
DESIGN AUTOMATION FOR EMBEDDED SYSTEMS, 2013, 17 (02) : 439 - 458
[36] Effective instruction scheduling techniques for an interleaved cache clustered VLIW processor
Gibert, E
Sánchez, J
González, A
35TH ANNUAL IEEE/ACM INTERNATIONAL SYMPOSIUM ON MICROARCHITECTURE (MICRO-35), PROCEEDINGS, 2002, : 123 - 133
[37] Instruction scheduling with k-successor tree for clustered VLIW processors
Xuemeng Zhang
Hui Wu
Jingling Xue
Design Automation for Embedded Systems, 2013, 17 : 439 - 458
[38] A unified modulo scheduling and register allocation technique for clustered processors
Codina, JM
Sánchez, J
González, A
2001 INTERNATIONAL CONFERENCE ON PARALLEL ARCHITECTURES AND COMPILATION TECHNIQUES, PROCEEDINGS, 2001, : 175 - 184
[39] Uniform Circle Formation by Asynchronous Robots: A Fully-Distributed Approach
Jiang, Shan
Cao, Jiannong
Wang, Jia
Stojmenovic, Milos
Bourgeois, Julien
2017 26TH INTERNATIONAL CONFERENCE ON COMPUTER COMMUNICATION AND NETWORKS (ICCCN 2017), 2017,
[40] A New Fully-Distributed Arbitration-Based Membership Protocol
Ahsan, Shegufta Bakht
Gupta, Indranil
IEEE INFOCOM 2020 - IEEE CONFERENCE ON COMPUTER COMMUNICATIONS, 2020, : 716 - 725

← 1 2 3 4 5 →