Data Partitioning Strategies for Stencil Computations on NUMA Systems

被引：0

作者：

Feinbube, Frank ^{[1
]}

Plauth, Max ^{[1
]}

Knaust, Marius ^{[1
]}

Polze, Andreas ^{[1
]}

机构：

[1] Univ Potsdam, Hasso Plattner Inst Software Syst Engn, Operating Syst & Middleware Grp, Potsdam, Germany

来源：

EURO-PAR 2017: PARALLEL PROCESSING WORKSHOPS | 2018年 / 10659卷

关键词：

NUMA; Stencil computation; Data partitioning; OPTIMIZATION;

D O I：

10.1007/978-3-319-75178-8_48

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

Many scientific problems rely on the efficient execution of stencil computations, which are usually memory-bound. In this paper, stencils on two-dimensional data are executed on NUMA architectures. Each node of a NUMA system processes a distinct partition of the input data independent from other nodes. However, processors may need access to the memory of other nodes at the edges of the partitions. This paper demonstrates two techniques based on machine learning for identifying partitioning strategies that reduce the occurrence of remote memory access. One approach is generally applicable and is based on an uninformed search. The second approach caps the search space by employing geometric decomposition. The partitioning strategies obtained with these techniques are analyzed theoretically. Finally, an evaluation on a real NUMA machine is conducted, which demonstrates that the expected reduction of the remote memory accesses can be achieved.

引用

页码：597 / 609

页数：13

共 50 条

[1] Automatic Partitioning of Stencil Computations on Heterogeneous Systems
Pereira, Alyson D.
Rocha, Rodrigo C. O.
Ramos, Luiz
Castro, Marcio
Goes, Luis F. W.
[J]. 2017 INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE AND HIGH PERFORMANCE COMPUTING WORKSHOPS (SBAC-PADW), 2017, : 43 - 48
[2] NUMA Aware Iterative Stencil Computations on Many-Core Systems
Shaheen, Mohammed
Strzodka, Robert
[J]. 2012 IEEE 26TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM (IPDPS), 2012, : 461 - 473
[3] Multilevel parallelism optimization of stencil computations on SIMDlized NUMA architectures
Zhang, Kaifang
Su, Huayou
Dou, Yong
[J]. JOURNAL OF SUPERCOMPUTING, 2021, 77 (11): : 13584 - 13600
[4] Multilevel parallelism optimization of stencil computations on SIMDlized NUMA architectures
Kaifang Zhang
Huayou Su
Yong Dou
[J]. The Journal of Supercomputing, 2021, 77 : 13584 - 13600
[5] Automatically Optimizing Stencil Computations on Many-Core NUMA Architectures
Lin, Pei-Hung
Yi, Qing
Quinlan, Daniel
Liao, Chunhua
Yan, Yongqing
[J]. LANGUAGES AND COMPILERS FOR PARALLEL COMPUTING, LCPC 2016, 2017, 10136 : 137 - 152
[6] Islands-of-Cores Approach for Harnessing SMP/NUMA Architectures in Heterogeneous Stencil Computations
Szustak, Lukasz
Wyrzykowski, Roman
Jakl, Ondrej
[J]. PARALLEL COMPUTING TECHNOLOGIES (PACT 2017), 2017, 10421 : 351 - 364
[7] Data access collection and data partitioning for NUMA architectures
Calidonna, CR
Furnari, MM
[J]. ADVANCES IN COMPUTATIONAL MECHANICS WITH HIGH PERFORMANCE COMPUTING, 1998, : 33 - 40
[8] Strategy for data-flow synchronizations in stencil parallel computations on multi-/manycore systems
Szustak, Lukasz
[J]. JOURNAL OF SUPERCOMPUTING, 2018, 74 (04): : 1534 - 1546
[9] Strategy for data-flow synchronizations in stencil parallel computations on multi-/manycore systems
Lukasz Szustak
[J]. The Journal of Supercomputing, 2018, 74 : 1534 - 1546
[10] PROGRAM PARTITIONING FOR NUMA MULTIPROCESSOR COMPUTER-SYSTEMS
WOLSKI, RM
FEO, JT
[J]. JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 1993, 19 (03) : 203 - 218

← 1 2 3 4 5 →