Combining compile-time and run-time support for efficient software distributed shared memory

被引:6
|
作者
Dwarkadas, S [1 ]
Lu, HH
Cox, AL
Rajamony, R
Zwaenepoel, W
机构
[1] Univ Rochester, Dept Comp Sci, Rochester, NY 14627 USA
[2] Rice Univ, Dept Elect & Comp Engn, Houston, TX 77005 USA
[3] Rice Univ, Dept Comp Sci, Houston, TX 77005 USA
[4] IBM Corp, Austin Res Lab, Austin, TX 78758 USA
基金
美国国家科学基金会;
关键词
automatic aggregation of data transfer; distributed shared memory; integrated compiler and run-time system; optimizing computer for irregular parallel programs;
D O I
10.1109/5.747868
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
We describe an integrated compile-time and run-time system for efficient shared memory parallel computing on distributed memory machines. The combined system presents the user with a shared memory programming model, with its well-known benefits in terms of ease of use. The run-time system implements a consistent shared memory abstraction using memory access detection and automatic data caching. The compiler improves the efficiency of the shared memory implementation by directing the run-time system to exploit the message passing capabilities of the underlying hardware. To do so, the compiler analyzes shared memory accesses and transforms the code to insert calls to the run-time system that provide it with thew access information computed by the compiler. The run-time system is augmented with the appropriate entry points to use this information to implement bulk data transfer and to reduce the overhead of run-time consistency maintenance. In those cases where the compiler analysis succeeds for the entire program, we demonstrate that the combined system achieves performance comparable to that produced by compilers that directly target message passing. If the compiler analysis is successful only for parts of the program, for instance, because of irregular accesses to some of the arrays, the resulting optimizations can be applied to those parts for which the analysis succeeds. If the compiler analysis fails entirely, we rely on the run-time's maintenance of shared memory and thereby avoid the complexity and the limitations of compilers that directly target message passing. The result is a single system that combines efficient support for both regular and irregular memory access patterns.
引用
收藏
页码:476 / 486
页数:11
相关论文
共 50 条
  • [1] An integrated compile-time/run-time software distributed shared memory system
    Dwarkadas, S
    Cox, AL
    Zwaenepoel, W
    [J]. ACM SIGPLAN NOTICES, 1996, 31 (09) : 186 - 197
  • [2] Integrated compile-time/run-time software distributed shared memory system
    Rice Univ, United States
    [J]. Comput Archit News, Special Issu (186-197):
  • [4] The Interplay of Compile-time and Run-time Options for Performance Prediction
    Lesoil, Luc
    Acher, Mathieu
    Ternava, Xhevahire
    Blouin, Arnaud
    Jezequel, Jean-Marc
    [J]. SPLC '21: PROCEEDINGS OF THE 25TH ACM INTERNATIONAL SYSTEMS AND SOFTWARE PRODUCT LINE CONFERENCE, VOL A, 2021,
  • [5] Compile-time composition of run-time data and iteration reorderings
    Strout, MM
    Carter, L
    Ferrante, J
    [J]. ACM SIGPLAN NOTICES, 2003, 38 (05) : 91 - 102
  • [6] Compile/run-time support for threaded MPI execution on multiprogrammed shared memory machines
    Tang, H
    Shen, K
    Yang, T
    [J]. ACM SIGPLAN NOTICES, 1999, 34 (08) : 107 - 118
  • [7] Specialization of Run-time Configuration Space at Compile-time: An Exploratory Study
    Ternava, Xhevahire
    Acher, Mathieu
    Combemale, Benoit
    [J]. 38TH ANNUAL ACM SYMPOSIUM ON APPLIED COMPUTING, SAC 2023, 2023, : 1459 - 1468
  • [8] OBJECT DISTRIBUTION IN ORCA USING COMPILE-TIME AND RUN-TIME TECHNIQUES
    BAL, HE
    KAASHOEK, MF
    [J]. SIGPLAN NOTICES, 1993, 28 (10): : 162 - 177
  • [9] Combining Run-time Checks and Compile-time Analysis to Improve Control Flow Auto-Vectorization
    Liu, Bangtian
    Laird, Avery
    Tsang, Wai Hung
    Mahjour, Bardia
    Dehnavi, Maryam Mehri
    [J]. PROCEEDINGS OF THE 2022 31ST INTERNATIONAL CONFERENCE ON PARALLEL ARCHITECTURES AND COMPILATION TECHNIQUES, PACT 2022, 2022, : 439 - 450
  • [10] Compile-time support for efficient data race detection in shared-memory parallel programs
    Mellor-Crummey, John
    [J]. SIGPLAN Notices (ACM Special Interest Group on Programming Languages), 1993, 28 (12):