Optimizing Irregular Shared-Memory Applications for Clusters

被引:0
|
作者
Min, Seung-Jai [1 ]
Eigenmann, Rudolf [1 ]
机构
[1] Purdue Univ, Sch Elect & Comp Engn, W Lafayette, IN 47907 USA
关键词
Compiler Analysis; Runtime Techniques; OpenMP; MPI; Irregular Data Accesses; Performance;
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Irregular applications pose challenges in optimizing communication, due to the difficulty of analyzing irregular data accesses accurately and efficiently. This challenge is especially big when translating irregular shared-memory applications to message-passing form for clusters. The lack of effective irregular data analysis in the translation system results in unnecessary or redundant communication, which limits application scalability. In this paper, we present a Lean Distributed Shared Memory (LDSM) system, which features a fast and accurate irregular data access (IDA) analysis. The analysis uses a region-based diff method and makes use of a runtime library that is optimized for irregular applications. We describe three optimizations that improve the LDSM system performance. A parallel array reduction transformation reduces overheads in the analysis. A packed communication optimization and a differential communication optimization effectively eliminate unnecessary and redundant messages. We evaluate the performance of the optimized LDSM system on a set of representative irregular benchmarks. The optimized LDSM executes irregular applications on average 45% faster than the hand-tuned MPI applications.
引用
收藏
页码:256 / 265
页数:10
相关论文
共 50 条
  • [21] Shared-memory performance profiling
    Xu, ZC
    Larus, JR
    Miller, BP
    ACM SIGPLAN NOTICES, 1997, 32 (07) : 240 - 251
  • [22] SHARED-MEMORY AND MESSAGE QUEUES
    LAM, RB
    DR DOBBS JOURNAL, 1995, 20 (05): : 28 - &
  • [23] AND OR PARALLELISM ON SHARED-MEMORY MULTIPROCESSORS
    GUPTA, G
    JAYARAMAN, B
    JOURNAL OF LOGIC PROGRAMMING, 1993, 17 (01): : 59 - 89
  • [24] SMALL SHARED-MEMORY MULTIPROCESSORS
    BASKETT, F
    HENNESSY, JL
    SCIENCE, 1986, 231 (4741) : 963 - 967
  • [25] UNAMBIGUOUS SHARED-MEMORY SYSTEMS
    Morin, Remi
    INTERNATIONAL JOURNAL OF FOUNDATIONS OF COMPUTER SCIENCE, 2010, 21 (04) : 665 - 685
  • [26] ATOMIC SNAPSHOTS OF SHARED-MEMORY
    AFEK, Y
    ATTIYA, H
    DOLEV, D
    GAFNI, E
    MERRITT, M
    SHAVIT, N
    JOURNAL OF THE ACM, 1993, 40 (04) : 873 - 890
  • [27] Cellular Disco: resource management using virtual clusters on shared-memory multiprocessors
    Govil, K
    Teodosiu, D
    Huang, YQ
    Rosenblum, M
    OPERATING SYSTEMS REVIEW, VOL 33, NO 5, DECEMBER 1999, 1999, : 154 - 169
  • [28] Cellular Disco: Resource management using virtual clusters on shared-memory multiprocessors
    Govil, Kinshuk
    Teodosiu, Dan
    Huang, Yongqiang
    Rosenblum, Mendel
    Operating Systems Review (ACM), 1999, 33 (05): : 154 - 169
  • [29] Cellular disco: resource management using virtual clusters on shared-memory multiprocessors
    Govil, K
    Teodosiu, D
    Huang, YQ
    Rosenblum, M
    ACM TRANSACTIONS ON COMPUTER SYSTEMS, 2000, 18 (03): : 229 - 262
  • [30] Exploiting Distributed-Memory and Shared-Memory Parallelism on Clusters of SMPs with Data Parallel Programs
    Siegfried Benkner
    Viera Sipkova
    International Journal of Parallel Programming, 2003, 31 : 3 - 19