SYNCHRONIZATION AND COMMUNICATION COSTS OF LOOP PARTITIONING ON SHARED-MEMORY MULTIPROCESSOR SYSTEMS

被引:1
|
作者
GUPTA, R
机构
[1] Department of Computer Science, University of Pittsburgh, Pittsburgh, PA
关键词
COMMUNICATION; PARALLELIZING COMPILERS; PROGRAM DECOMPOSITION; RUN-TIME SCHEDULING; SHARED MEMORY MULTIPROCESSOR SYSTEMS; SYNCHRONIZATION;
D O I
10.1109/71.149968
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
To exploit loop level parallelism on shared memory multiprocessor systems, loops are decomposed and their execution scheduled on different processors in parallel. This paper presents strategies for static loop decomposition and scheduling as well as compiler assisted run-time scheduling that take into account, in addition to the cost of performing operations, the overhead costs associated with a decomposition and schedule. An algorithm for static decomposition of multidimensional loops based upon the operation execution costs, communication costs, and synchronization costs is discussed. Following the decomposition of a program, synchronization instructions are introduced to ensure correct program execution. An algorithm for determining the explicit synchronization instructions that should be introduced in a program to ensure correct execution of the program with arbitrarily nested loops is presented. Techniques for reducing run-time scheduling, communication and synchronization costs due to self scheduling, a compiler assisted run-time scheduling technique, of multidimensional loops are also presented. Experiments performed on the Encore multiprocessor system demonstrate that the techniques developed can reduce overhead costs.
引用
收藏
页码:505 / 512
页数:8
相关论文
共 50 条
  • [21] SOLUTION OF SPARSE POSITIVE DEFINITE SYSTEMS ON A SHARED-MEMORY MULTIPROCESSOR
    GEORGE, A
    HEATH, MT
    LIU, J
    NG, E
    INTERNATIONAL JOURNAL OF PARALLEL PROGRAMMING, 1986, 15 (04) : 309 - 325
  • [22] Scalable Parallel Fault Simulation for Shared-Memory Multiprocessor Systems
    Hadjitheophanous, Stavros
    Neophytou, Stelios N.
    Michael, Maria K.
    2016 IEEE 34TH VLSI TEST SYMPOSIUM (VTS), 2016,
  • [23] A HIERARCHICAL TASK QUEUE ORGANIZATION FOR SHARED-MEMORY MULTIPROCESSOR SYSTEMS
    DANDAMUDI, SP
    CHENG, PSP
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 1995, 6 (01) : 1 - 16
  • [24] FINE-GRAIN SCHEDULER FOR SHARED-MEMORY MULTIPROCESSOR SYSTEMS
    SHIEH, JJ
    LEE, YC
    CHEN, HR
    IEE PROCEEDINGS-COMPUTERS AND DIGITAL TECHNIQUES, 1995, 142 (02): : 98 - 106
  • [26] An adaptive loop scheduling algorithm on shared-memory systems
    Jin, CM
    Yan, Y
    Zhang, XD
    EIGHTH IEEE SYMPOSIUM ON PARALLEL AND DISTRIBUTED PROCESSING, PROCEEDINGS, 1996, : 250 - 257
  • [27] Dedicated bus system for the shared-memory multiprocessor
    Kim, WW
    Chun, HS
    COMPUTERS AND THEIR APPLICATIONS, 2001, : 175 - 178
  • [28] PARALLEL CHOLESKY FACTORIZATION ON A SHARED-MEMORY MULTIPROCESSOR
    GEORGE, A
    HEATH, MT
    LIU, J
    LINEAR ALGEBRA AND ITS APPLICATIONS, 1986, 77 : 165 - 187
  • [29] Parallel VLSI test in a shared-memory multiprocessor
    Gil, C
    Ortega, J
    Montoya, MG
    CONCURRENCY-PRACTICE AND EXPERIENCE, 2000, 12 (05): : 311 - 326
  • [30] DYNAMIC-PROGRAMMING ON A SHARED-MEMORY MULTIPROCESSOR
    EDMONDS, P
    CHU, E
    GEORGE, A
    PARALLEL COMPUTING, 1993, 19 (01) : 9 - 22