Optimizing Irregular Shared-Memory Applications for Clusters

被引:0
|
作者
Min, Seung-Jai [1 ]
Eigenmann, Rudolf [1 ]
机构
[1] Purdue Univ, Sch Elect & Comp Engn, W Lafayette, IN 47907 USA
关键词
Compiler Analysis; Runtime Techniques; OpenMP; MPI; Irregular Data Accesses; Performance;
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Irregular applications pose challenges in optimizing communication, due to the difficulty of analyzing irregular data accesses accurately and efficiently. This challenge is especially big when translating irregular shared-memory applications to message-passing form for clusters. The lack of effective irregular data analysis in the translation system results in unnecessary or redundant communication, which limits application scalability. In this paper, we present a Lean Distributed Shared Memory (LDSM) system, which features a fast and accurate irregular data access (IDA) analysis. The analysis uses a region-based diff method and makes use of a runtime library that is optimized for irregular applications. We describe three optimizations that improve the LDSM system performance. A parallel array reduction transformation reduces overheads in the analysis. A packed communication optimization and a differential communication optimization effectively eliminate unnecessary and redundant messages. We evaluate the performance of the optimized LDSM system on a set of representative irregular benchmarks. The optimized LDSM executes irregular applications on average 45% faster than the hand-tuned MPI applications.
引用
收藏
页码:256 / 265
页数:10
相关论文
共 50 条
  • [41] Deployment and Scheduling Synthesis for Mixed-Critical Shared-Memory Applications
    Voss, Sebastian
    Schaetz, Bernhard
    2013 20TH ANNUAL IEEE INTERNATIONAL CONFERENCE AND WORKSHOPS ON THE ENGINEERING OF COMPUTER BASED SYSTEMS (ECBS 2013), 2013, : 100 - 118
  • [42] GEM: A Framework for Developing Shared-Memory Parallel GEnomic Applications on Memory Constrained Architectures
    Kutlu, Mucahid
    Agrawal, Gagan
    2015 44TH INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING (ICPP), 2015, : 829 - 838
  • [43] BOUNDS ON SHARED-MEMORY FOR MUTUAL EXCLUSION
    BURNS, JE
    LYNCH, NA
    INFORMATION AND COMPUTATION, 1993, 107 (02) : 171 - 184
  • [44] Compiler and software distributed shared memory support for irregular applications
    Lu, HH
    Cox, AL
    Dwarkadas, S
    Rajamony, R
    Zwaenepoel, W
    ACM SIGPLAN NOTICES, 1997, 32 (07) : 48 - 56
  • [45] Comparative evaluation of parallel genetic algorithms on message passing and software shared-memory clusters
    Tiamkaew, E
    Louis, SJ
    Kongmunvattana, A
    PARALLEL AND DISTRIBUTED COMPUTING SYSTEMS, 2002, : 443 - 448
  • [46] SHARED-MEMORY CONTROLLERS LINK PROCESSORS
    AOUIZERAT, R
    MINI-MICRO SYSTEMS, 1983, 16 (11): : 272 - 274
  • [47] APPLICATIVE PARALLELISM ON A SHARED-MEMORY MULTIPROCESSOR
    OLDEHOEFT, RR
    CANN, DC
    IEEE SOFTWARE, 1988, 5 (01) : 62 - 70
  • [48] A SHARED-MEMORY MULTIPROCESSOR LOGIC SIMULATOR
    BEIHL, G
    EIGHTH ANNUAL INTERNATIONAL PHOENIX CONFERENCE ON COMPUTERS AND COMMUNICATIONS: 1989 CONFERENCE PROCEEDINGS, 1989, : 26 - 28
  • [49] Shared-Memory Communication for Containerized Workflows
    Hobson, Tanner
    Yildiz, Orcun
    Nicolae, Bogdan
    Huang, Jian
    Peterka, Tom
    21ST IEEE/ACM INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND INTERNET COMPUTING (CCGRID 2021), 2021, : 123 - 132
  • [50] REDUCING CONTENTION IN SHARED-MEMORY MULTIPROCESSORS
    STENSTROM, P
    COMPUTER, 1988, 21 (11) : 26 - 35