Data Partitioning Strategies for Stencil Computations on NUMA Systems

被引:0
|
作者
Feinbube, Frank [1 ]
Plauth, Max [1 ]
Knaust, Marius [1 ]
Polze, Andreas [1 ]
机构
[1] Univ Potsdam, Hasso Plattner Inst Software Syst Engn, Operating Syst & Middleware Grp, Potsdam, Germany
关键词
NUMA; Stencil computation; Data partitioning; OPTIMIZATION;
D O I
10.1007/978-3-319-75178-8_48
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Many scientific problems rely on the efficient execution of stencil computations, which are usually memory-bound. In this paper, stencils on two-dimensional data are executed on NUMA architectures. Each node of a NUMA system processes a distinct partition of the input data independent from other nodes. However, processors may need access to the memory of other nodes at the edges of the partitions. This paper demonstrates two techniques based on machine learning for identifying partitioning strategies that reduce the occurrence of remote memory access. One approach is generally applicable and is based on an uninformed search. The second approach caps the search space by employing geometric decomposition. The partitioning strategies obtained with these techniques are analyzed theoretically. Finally, an evaluation on a real NUMA machine is conducted, which demonstrates that the expected reduction of the remote memory accesses can be achieved.
引用
收藏
页码:597 / 609
页数:13
相关论文
共 50 条
  • [1] Automatic Partitioning of Stencil Computations on Heterogeneous Systems
    Pereira, Alyson D.
    Rocha, Rodrigo C. O.
    Ramos, Luiz
    Castro, Marcio
    Goes, Luis F. W.
    [J]. 2017 INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE AND HIGH PERFORMANCE COMPUTING WORKSHOPS (SBAC-PADW), 2017, : 43 - 48
  • [2] NUMA Aware Iterative Stencil Computations on Many-Core Systems
    Shaheen, Mohammed
    Strzodka, Robert
    [J]. 2012 IEEE 26TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM (IPDPS), 2012, : 461 - 473
  • [3] Multilevel parallelism optimization of stencil computations on SIMDlized NUMA architectures
    Zhang, Kaifang
    Su, Huayou
    Dou, Yong
    [J]. JOURNAL OF SUPERCOMPUTING, 2021, 77 (11): : 13584 - 13600
  • [4] Multilevel parallelism optimization of stencil computations on SIMDlized NUMA architectures
    Kaifang Zhang
    Huayou Su
    Yong Dou
    [J]. The Journal of Supercomputing, 2021, 77 : 13584 - 13600
  • [5] Automatically Optimizing Stencil Computations on Many-Core NUMA Architectures
    Lin, Pei-Hung
    Yi, Qing
    Quinlan, Daniel
    Liao, Chunhua
    Yan, Yongqing
    [J]. LANGUAGES AND COMPILERS FOR PARALLEL COMPUTING, LCPC 2016, 2017, 10136 : 137 - 152
  • [6] Islands-of-Cores Approach for Harnessing SMP/NUMA Architectures in Heterogeneous Stencil Computations
    Szustak, Lukasz
    Wyrzykowski, Roman
    Jakl, Ondrej
    [J]. PARALLEL COMPUTING TECHNOLOGIES (PACT 2017), 2017, 10421 : 351 - 364
  • [7] Data access collection and data partitioning for NUMA architectures
    Calidonna, CR
    Furnari, MM
    [J]. ADVANCES IN COMPUTATIONAL MECHANICS WITH HIGH PERFORMANCE COMPUTING, 1998, : 33 - 40
  • [8] Strategy for data-flow synchronizations in stencil parallel computations on multi-/manycore systems
    Szustak, Lukasz
    [J]. JOURNAL OF SUPERCOMPUTING, 2018, 74 (04): : 1534 - 1546
  • [9] Strategy for data-flow synchronizations in stencil parallel computations on multi-/manycore systems
    Lukasz Szustak
    [J]. The Journal of Supercomputing, 2018, 74 : 1534 - 1546
  • [10] PROGRAM PARTITIONING FOR NUMA MULTIPROCESSOR COMPUTER-SYSTEMS
    WOLSKI, RM
    FEO, JT
    [J]. JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 1993, 19 (03) : 203 - 218