CSMT: Simultaneous Multithreading for Clustered VLIW Processors

被引:3
|
作者
Gupta, Manoj [1 ]
Sanchez, Fermin [1 ]
Llosa, Josep [1 ]
机构
[1] Univ Politecn Cataluna, Dept Arquitectura Computadors, ES-08034 Barcelona, Spain
关键词
ILP; VLIW architectures; clustered VLIW architectures; multithreaded processors; simultaneous multithreading; ARCHITECTURE; PERFORMANCE;
D O I
10.1109/TC.2009.96
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Simultaneous MultiThreading (SMT) is a well-known technique that improves resource utilization by exploiting thread-level parallelism at the instruction grain level. However, implementing SMT for VLIWs requires complex structures, which is contrary to the VLIW philosophy of hardware simplicity. In this paper, we propose Cluster-level Simultaneous MultiThreading (CSMT) to allow some degree of SMT in clustered VLIW processors with low hardware cost and complexity. CSMT considers the set of operations that execute simultaneously in a given cluster as the assignment unit. To minimize cluster conflicts between threads, a very simple hardware-based cluster renaming mechanism is proposed. The hardware required to implement CSMT is cheap, realistic, and practical for a clustered VLIW processor. An analysis of the hardware required to implement CSMT shows that it is quite scalable, with up to eight threads easily supported at low hardware cost. The experimental results show that CSMT significantly improves performance when compared with other multithreading approaches suited for VLIW. For instance, with four threads, CSMT shows an average speedup of 110 percent over a single-thread VLIW architecture and 40 percent over Interleaved MultiThreading (IMT). In some cases, speedup can be as high as 225 percent over single-thread architecture and 84 percent over IMT.
引用
下载
收藏
页码:385 / 399
页数:15
相关论文
共 50 条
  • [11] Supporting speculative multithreading on simultaneous multithreaded processors
    Packirisamy, Venkatesan
    Wang, Shengyue
    Zhai, Antonia
    Hsu, Wei-Chung
    Yew, Pen-Chung
    HIGH PERFORMANCE COMPUTING - HIPC 2006, PROCEEDINGS, 2006, 4297 : 148 - +
  • [12] Compiler-assisted energy optimization for clustered VLIW processors
    Nagpal, Rahul
    Srikant, Y. N.
    JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2012, 72 (08) : 944 - 959
  • [13] Simultaneous Multithreading VLIW DSP Architecture with Dynamic Dispatch Mechanism
    Shen, Zheng
    He, Hu
    Sun, Yihe
    PROCEEDINGS OF THE 2009 12TH EUROMICRO CONFERENCE ON DIGITAL SYSTEM DESIGN, ARCHITECTURES, METHODS AND TOOLS, 2009, : 505 - 512
  • [14] New schemes in clustered VLIW processors applied to turbo decoding
    Ituero, Pablo
    Lopez-Vallejo, Marisa
    IEEE 17TH INTERNATIONAL CONFERENCE ON APPLICATION-SPECIFIC SYSTEMS, ARCHITECTURES AND PROCESSORS, PROCEEDINGS, 2006, : 291 - +
  • [15] Lifetime Holes Aware Register Allocation for Clustered VLIW Processors
    Zhang, Xuemeng
    Wu, Hui
    Sun, Haiyan
    Xue, Jingling
    2014 DESIGN, AUTOMATION AND TEST IN EUROPE CONFERENCE AND EXHIBITION (DATE), 2014,
  • [16] An Effective Software Pipelining Algorithm for Clustered Embedded VLIW Processors
    Cagdas Akturan
    Margarida F. Jacome
    Design Automation for Embedded Systems, 2002, 7 : 115 - 138
  • [17] Inter-cluster communication models for clustered VLIW processors
    Terechko, A
    Le Thenaff, E
    Garg, M
    van Eijndhoven, J
    Corporaal, H
    NINTH INTERNATIONAL SYMPOSIUM ON HIGH-PERFORMANCE COMPUTER ARCHITECTURE, PROCEEDINGS, 2003, : 354 - 364
  • [18] CALiBeR: A software pipelining algorithm for clustered embedded VLIW processors
    Akturan, C
    Jacome, MF
    ICCAD 2001: IEEE/ACM INTERNATIONAL CONFERENCE ON COMPUTER AIDED DESIGN, DIGEST OF TECHNICAL PAPERS, 2001, : 112 - 118
  • [19] Register Allocation by Incremental Graph Colouring for Clustered VLIW Processors
    Zhang, Xuemeng
    Wu, Hui
    Sun, Haiyan
    2013 12TH IEEE INTERNATIONAL CONFERENCE ON TRUST, SECURITY AND PRIVACY IN COMPUTING AND COMMUNICATIONS (TRUSTCOM 2013), 2013, : 927 - 934
  • [20] Fault Injection Analysis of Transient Faults in Clustered VLIW Processors
    Sterpone, L.
    Sabena, D.
    Campagna, S.
    Reorda, M. Sonza
    2011 IEEE 14TH INTERNATIONAL SYMPOSIUM ON DESIGN AND DIAGNOSTICS OF ELECTRONIC CIRCUITS AND SYSTEMS (DDECS), 2011, : 207 - 212