Effective instruction scheduling techniques for an interleaved cache clustered VLIW processor

被引:8
|
作者
Gibert, E [1 ]
Sánchez, J [1 ]
González, A [1 ]
机构
[1] Univ Politecn Cataluna, Dept Comp Architecture, ES-08034 Barcelona, Spain
关键词
D O I
10.1109/MICRO.2002.1176244
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Clustering is a common, technique to overcome the wire delay problem incurred by the evolution of technology. Fully-distributed architectures, where the register file, the functional units and the data cache are partitioned,, are particularly effective to deal with these constraints and besides they are very scalable. In this paper effective instruction scheduling techniques for a clustered VLIW processor with a word-interleaved cache are proposed Such scheduling techniques,,rely on: (i) loop unrolling and variable alignment to increase the percentage of local accesses, (ii) a latency assignment process to schedule memory operations with an appropriate latency and (iii) different heuristics to assign instructions to clusters.. In particular the number of local accesses is increased,by more than 25% if these techniques are used and the ratio of stall time over compute time is small. Next, the main source, of remote accesses and stall time is investigated Stall time is mainly due to remote hits, and Attraction Buffers are used to increase local accesses and reduce stall time. Stall time is reduced. by 29% and 34% depending on the scheduling heuristic. IPC results for a word-interleaved cache clustered VLIW processor are similar to those of the multiVLIW (a cache-coherent clustered processor with a more complex hardware design), and are 10% and 5% better (depending on the scheduling heuristic) than the IPC for a clustered processor with a unified cache.
引用
收藏
页码:123 / 133
页数:11
相关论文
共 50 条
  • [1] Instruction scheduling for a clustered VLIW processor with a word-interleaved cache
    Gibert, Enric
    Sanchez, Jesus
    Gonzalez, Antonio
    [J]. CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2006, 18 (11): : 1391 - 1411
  • [2] Local scheduling techniques for memory coherence in a clustered VLIW processor with a distributed data cache
    Gibert, E
    Sánchez, J
    González, A
    [J]. CGO 2003: INTERNATIONAL SYMPOSIUM ON CODE GENERATION AND OPTIMIZATION, 2003, : 193 - 203
  • [3] Instruction scheduling for clustered VLIW architectures
    Sánchez, J
    González, A
    [J]. 13TH INTERNATIONAL SYMPOSIUM ON SYSTEM SYNTHESIS, PROCEEDINGS, 2000, : 41 - 46
  • [4] Instruction scheduling for clustered VLIW DSPs
    Leupers, R
    [J]. 2000 INTERNATIONAL CONFERENCE ON PARALLEL ARCHITECTURES AND COMPILATION TECHNIQUES, PROCEEDINGS, 2000, : 291 - 300
  • [5] An Efficient Heuristic for Instruction Scheduling on Clustered VLIW Processors
    Zhang, Xuemeng
    Wu, Hui
    Xue, Jingling
    [J]. PROCEEDINGS OF THE PROCEEDINGS OF THE 14TH INTERNATIONAL CONFERENCE ON COMPILERS, ARCHITECTURES AND SYNTHESIS FOR EMBEDDED SYSTEMS (CASES '11), 2011, : 35 - 44
  • [6] Aligned Scheduling: Cache-Efficient Instruction Scheduling for VLIW Processors
    Porpodas, Vasileios
    Cintra, Marcelo
    [J]. LANGUAGES AND COMPILERS FOR PARALLEL COMPUTING, LCPC 2013, 2014, 8664 : 275 - 291
  • [7] Simultaneous Reconfiguration of Issue-width and Instruction Cache for a VLIW Processor
    Anjam, Fakhar
    Wong, Stephan
    Carro, Luigi
    Nazar, Gabriel L.
    Rutzig, Mateus B.
    [J]. 2012 INTERNATIONAL CONFERENCE ON EMBEDDED COMPUTER SYSTEMS (SAMOS): ARCHITECTURES, MODELING AND SIMULATION, 2012, : 183 - 192
  • [8] Instruction scheduling with k-successor tree for clustered VLIW processors
    Zhang, Xuemeng
    Wu, Hui
    Xue, Jingling
    [J]. DESIGN AUTOMATION FOR EMBEDDED SYSTEMS, 2013, 17 (02) : 439 - 458
  • [9] Instruction scheduling with k-successor tree for clustered VLIW processors
    Xuemeng Zhang
    Hui Wu
    Jingling Xue
    [J]. Design Automation for Embedded Systems, 2013, 17 : 439 - 458
  • [10] Instruction Decompressor Design for a VLIW Processor
    Buzdar, Abdul Rehman
    Sun, Liguo
    Latif, Azhar
    Buzdar, Abdullah
    [J]. INFORMACIJE MIDEM-JOURNAL OF MICROELECTRONICS ELECTRONIC COMPONENTS AND MATERIALS, 2015, 45 (04): : 225 - 236