Code Generation for Parallel Execution of a Class of Irregular Loops on Distributed Memory Systems

被引:0
|
作者
Ravishankar, Mahesh [1 ]
Eisenlohr, John [1 ]
Pouchet, Louis-Noel [1 ]
Ramanujam, J. [2 ]
Rountev, Atanas [1 ]
Sadayappan, P. [1 ]
机构
[1] Ohio State Univ, Columbus, OH 43210 USA
[2] Louisiana State Univ, Louvain, Belgium
基金
美国国家科学基金会;
关键词
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Parallelization and locality optimization of affine loop nests has been successfully addressed for shared-memory machines. However, many large-scale simulation applications must be executed in a distributed-memory environment, and use irregular/sparse computations where the control-flow and array-access patterns are data-dependent. In this paper, we propose an approach for effective parallel execution of a class of irregular loop computations in a distributed-memory environment, using a combination of static and runtime analysis. We discuss algorithms that analyze sequential code to generate an inspector and an executor. The inspector captures the data-dependent behavior of the computation in parallel and without requiring complete replication of any of the data structures used in the original computation. The executor performs the computation in parallel. The effectiveness of the framework is demonstrated on several benchmarks and a climate modeling application.
引用
收藏
页数:11
相关论文
共 50 条
  • [1] Automatic parallelization of a class of irregular loops for distributed memory systems
    Ravishankar, Mahesh
    Eisenlohr, John
    Pouchet, Louis-Noël
    Ramanujam, J.
    Rountev, Atanas
    Sadayappan, P.
    [J]. Ravishankar, Mahesh (ravishan@cse.ohio-state.edu), 1600, Association for Computing Machinery, 2 Penn Plaza, Suite 701, New York, NY 10121-0701, United States (01):
  • [2] From Serial Loops to Parallel Execution on Distributed Systems
    Bosilca, George
    Bouteiller, Aurelien
    Danalis, Anthony
    Herault, Thomas
    Dongarra, Jack
    [J]. EURO-PAR 2012 PARALLEL PROCESSING, 2012, 7484 : 246 - 257
  • [3] Automatic parallel code generation for message passing on distributed memory systems
    Johnson, SP
    Ierotheou, CS
    Cross, M
    [J]. PARALLEL COMPUTING, 1996, 22 (02) : 227 - 258
  • [4] EFFICIENT EXECUTION OF DOACROSS LOOPS ON DISTRIBUTED-MEMORY SYSTEMS
    ZAAFRANI, A
    ITO, MR
    [J]. IFIP TRANSACTIONS A-COMPUTER SCIENCE AND TECHNOLOGY, 1993, 23 : 27 - 38
  • [5] Distributed Memory Code Generation for Mixed Irregular/Regular Computations
    Ravishankar, Mahesh
    Dathathri, Roshan
    Elango, Venmugil
    Pouchet, Louis-Noel
    Ramanujam, J.
    Rountev, Atanas
    Sadayappan, P.
    [J]. ACM SIGPLAN NOTICES, 2015, 50 (08) : 65 - 75
  • [6] Possibilities of Optimal Execution of Parallel Programs Containing Simple and Iterated Loops on Heterogeneous Parallel Computational Systems with Distributed Memory
    A. I. Avetisyan
    S. S. Gaisaryan
    O. I. Samovarov
    [J]. Programming and Computer Software, 2002, 28 : 28 - 40
  • [7] Possibilities of optimal execution of parallel programs containing simple and iterated loops on heterogeneous parallel computational systems with distributed memory
    Avetisyan, A.I.
    Gajsaryan, S.S.
    Samovarov, O.I.
    [J]. Programmirovanie, 2002, 28 (01): : 38 - 55
  • [8] Possibilities of optimal execution of parallel programs containing simple and iterated loops on heterogeneous parallel computational systems with distributed memory
    Avetisyan, AI
    Gaisaryan, SS
    Samovarov, OI
    [J]. PROGRAMMING AND COMPUTER SOFTWARE, 2002, 28 (01) : 28 - 40
  • [9] Models for Estimating the Execution Time of Software Loops in Parallel and Distributed Systems
    Wrobel, Magdalena
    [J]. THEORY AND ENGINEERING OF COMPLEX SYSTEMS AND DEPENDABILITY, 2015, 365 : 533 - 542
  • [10] Adaptively scheduling parallel loops in distributed shared-memory systems
    Yan, Y
    Jin, CM
    Zhang, XD
    [J]. IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 1997, 8 (01) : 70 - 81