Loop scheduling for multithreaded processors

被引:2
|
作者
Dimitriou, G [1 ]
Polychronopoulos, C [1 ]
机构
[1] Univ Thessaly, Dept Comp Engn, Volos, Greece
关键词
D O I
10.1109/PCEE.2004.1335651
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
The presence of multiple active threads on the same processor can mask latency by rapid context switching, but it can adversely affect performance due to competition for shared datapath resources. In this paper we present Macro Software Pipelining (MSWP), a loop scheduling technique for multithreaded processors, which is based on the loop distribution transformation for loop pipelining. MSWP constructs loop schedules by partitioning the loop body into tasks and assigning each task to a thread that executes all iterations for that particular task. MSWP is applied top-down on a hierarchical program representation, and utilizes thread-level speculation for maximal exploitation of parallelism. We tested MSWP on a multithreaded architectural model, Coral 2000, using synthetic and, SPEC benchmarks. We obtained speedups of up to 30% with respect to highly optimized superblock-based schedules on loops with unpredictable branches, and a speedup of up to 25% on perl, a highly sequential SPEC95 integer benchmark.
引用
收藏
页码:361 / 366
页数:6
相关论文
共 50 条
  • [41] Preliminary evaluation of a binary translation system for multithreaded processors
    Ootsu, K
    Yokota, T
    Ono, T
    Baba, T
    INTERNATIONAL WORKSHOP ON INNOVATIVE ARCHITECTURE FOR FUTURE GENERATION HIGH-PERFORMANCE PROCESSORS AND SYSTEMS, 2002, : 77 - 84
  • [42] Round Robin Thread Selection Optimization in Multithreaded Processors
    Carroll, Shane
    Lin, Wei-Ming
    PARALLEL PROCESSING LETTERS, 2019, 29 (01)
  • [43] An evaluation of speculative instruction execution on simultaneous multithreaded processors
    Swanson, S
    McDowell, LK
    Swift, MM
    Eggers, SJ
    Levy, HM
    ACM TRANSACTIONS ON COMPUTER SYSTEMS, 2003, 21 (03): : 314 - 340
  • [44] An Accurate Analytical Design Model for Multithreaded Network Processors
    Hafezan, Mohamad
    Beigi, Leila
    2016 24TH EUROMICRO INTERNATIONAL CONFERENCE ON PARALLEL, DISTRIBUTED, AND NETWORK-BASED PROCESSING (PDP), 2016, : 216 - 223
  • [45] An evaluation of OpenMP on current and emerging multithreaded/multicore processors
    Curtis-Maury, Matthew
    Ding, Xiaoning
    Antonopoulos, Christos D.
    Nikolopoulos, Dimitrios S.
    OPENMP SHARED MEMORY PARALLEL PROGRAMMING, PROCEEDINGS, 2008, 4315 : 133 - 144
  • [46] Improving server software support for simultaneous multithreaded processors
    McDowell, LK
    Eggers, SJ
    Gribble, SD
    ACM SIGPLAN NOTICES, 2003, 38 (10) : 37 - 48
  • [47] Optimal Task Assignment in Multithreaded Processors: A Statistical Approach
    Radojkovic, Petar
    Cakarevic, Vladimir
    Moreto, Miquel
    Verdu, Javier
    Pajuelo, Alex
    Cazorla, Francisco J.
    Nemirovsky, Mario
    Valero, Mateo
    ACM SIGPLAN NOTICES, 2012, 47 (04) : 235 - 248
  • [48] Thread Assignment in Multicore/Multithreaded Processors: A Statistical Approach
    Radojkovic, Petar
    Carpenter, Paul M.
    Moreto, Miguel
    Cakarevic, Vladimir
    Verdu, Javier
    Pajuelo, Alex
    Cazorla, Francisco J.
    Nemirovsky, Mario
    Valero, Mateo
    IEEE TRANSACTIONS ON COMPUTERS, 2016, 65 (01) : 256 - 269
  • [49] A Garbage Collection Technique for Embedded Multithreaded Multicore Processors
    Uhrig, Sascha
    Ungerer, Theo
    ARCHITECTURE OF COMPUTING SYSTEMS-ARCS 2009, 22ND INTERNATIONAL CONFERENCE, 2009, 5455 : 207 - 218
  • [50] An effective instruction fetch policy for simultaneous multithreaded processors
    He, LQ
    Liu, ZY
    SEVENTH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND GRID IN ASIA PACIFIC REGION, PROCEEDINGS, 2004, : 162 - 168