Using distributed memory parallel computers and GPU clusters for multidimensional Monte Carlo integration

被引:8
|
作者
Szalkowski, Dominik [1 ]
Stpiczynski, Przemyslaw [1 ]
机构
[1] Marie Curie Sklodowska Univ, Inst Math, PL-20031 Lublin, Poland
来源
关键词
multidimensional integration; Monte Carlo methods; parallelized pseudorandom number generators; linear congruential generator; lagged Fibonacci generator; distributed memory parallel computers; GPU clusters; efficiency; scalability; LINEAR CONGRUENTIAL GENERATORS; LIBRARY;
D O I
10.1002/cpe.3365
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
The aim of this paper is to show that the multidimensional Monte Carlo integration can be efficiently implemented on various distributed memory parallel computers and clusters of multicore nodes using recently developed parallel versions of linear congruential generator and lagged Fibonacci generator pseudorandom number generators. We show how to accelerate the overall performance by offloading some computations to Graphics Processing Units (GPUs), and we discuss how to transform Message Passing Interface (MPI)+OpenMP programs to MPI+OpenMP+CUDA model. We explain how to utilize multiple cores of CPUs together with multiple GPU accelerators within a single node and how to achieve reasonable load balancing of all computational resources of GPU-accelerated multicore nodes. We present and discuss the results of experiments performed on the following target architectures: IBM Blue Gene/Q parallel computer, a cluster of Intel Xeon E5-2660 servers, and a Tesla-based GPU cluster with Intel Xeon X5650 multicore processors. The results are presented from two points of view: strong scaling and weak scaling. We also compare the performance of all considered architectures. Copyright (c) 2014 John Wiley & Sons, Ltd.
引用
收藏
页码:923 / 936
页数:14
相关论文
共 50 条
  • [1] Multidimensional Monte Carlo Integration on Clusters with Hybrid GPU-Accelerated Nodes
    Szalkowski, Dominik
    Stpiczynski, Przemyslaw
    [J]. PARALLEL PROCESSING AND APPLIED MATHEMATICS (PPAM 2013), PT I, 2014, 8384 : 603 - 612
  • [2] Monte Carlo integration on GPU
    Kanzaki, J.
    [J]. EUROPEAN PHYSICAL JOURNAL C, 2011, 71 (02):
  • [3] Monte Carlo integration on GPU
    J. Kanzaki
    [J]. The European Physical Journal C, 2011, 71
  • [4] Monte Carlo on parallel and vector computers
    Stauffer, D
    [J]. ADVANCES IN COMPUTER SIMULATION, 1998, 501 : 82 - 87
  • [5] Vectorized algorithm for multidimensional Monte Carlo integration on modern GPU, CPU and MIC architectures
    Przemysław Stpiczyński
    [J]. The Journal of Supercomputing, 2018, 74 : 936 - 952
  • [6] Vectorized algorithm for multidimensional Monte Carlo integration on modern GPU, CPU and MIC architectures
    Stpiczynski, Przemyslaw
    [J]. JOURNAL OF SUPERCOMPUTING, 2018, 74 (02): : 936 - 952
  • [7] Piecewise cubic interpolation on distributed memory parallel computers and clusters of workstations
    Stpiczynski, Przemyslaw
    Potiopa, Joanna
    [J]. PAR ELEC 2006: INTERNATIONAL SYMPOSIUM ON PARALLEL COMPUTING IN ELECTRICAL ENGINEERING, PROCEEDINGS, 2006, : 284 - +
  • [8] Memory Reduction using a Ring Abstraction over GPU RDMA for Distributed Quantum Monte Carlo Solver
    Wei, Weile
    D'Azevedo, Eduardo
    Huck, Kevin
    Chatterjee, Arghya
    Hernandez, Oscar
    Kaiser, Hartmut
    [J]. PROCEEDINGS OF THE PLATFORM FOR ADVANCED SCIENTIFIC COMPUTING CONFERENCE (PASC '21), 2021,
  • [9] Parallel Monte Carlo Tree Search on GPU
    Rocki, Kamil
    Suda, Reiji
    [J]. ELEVENTH SCANDINAVIAN CONFERENCE ON ARTIFICIAL INTELLIGENCE (SCAI 2011), 2011, 227 : 80 - 89
  • [10] The Hybrid Monte Carlo algorithm on parallel computers
    Kennedy, AD
    [J]. PARALLEL COMPUTING, 1999, 25 (10-11) : 1311 - 1339