New multi-GPU implementation for smoothed particle hydrodynamics on heterogeneous clusters

被引:147
|
作者
Dominguez, J. M. [1 ]
Crespo, A. J. C. [1 ]
Valdez-Balderas, D. [2 ]
Rogers, B. D. [2 ]
Gomez-Gesteira, M. [1 ]
机构
[1] Univ Vigo, EPHYSLAB Environm Phys Lab, Vigo, Spain
[2] Univ Manchester, Sch Mech Aerosp & Civil Engn MACE, Modelling & Simulat Ctr MaSC, Manchester M13 9PL, Lancs, England
基金
英国工程与自然科学研究理事会;
关键词
HPC; GPU; Multi-GPU; MPI; CUDA; SPH; Meshfree methods; SURFACE FLUID SOLVER; WAVE IMPACT; SIMULATION; SPHYSICS;
D O I
10.1016/j.cpc.2013.03.008
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
A massively parallel SPH scheme using heterogeneous clusters of Central Processing Units (CPUs) and Graphics Processing Units (GPUs) has been developed. The new implementation originates from the single-GPU DualSPHysics code previously demonstrated to be powerful, stable and accurate. A combination of different parallel programming languages is combined to exploit not only one device (CPU or GPU) but also the combination of different machines. Communication among devices uses an improved Message Passing Interface (MPI) implementation which addresses some of the well-known drawbacks of MPI such as including a dynamic load balancing and overlapping data communications and computation tasks. The efficiency and scalability (strong and weak scaling) obtained with the new DualSPHysics code are analysed for different numbers of particles and different number of GPUs. Last, an application with more than 10(9) particles is presented to show the capability of the code to handle simulations that otherwise require large CPU clusters or supercomputers. (C) 2013 Elsevier B.V. All rights reserved.
引用
收藏
页码:1848 / 1860
页数:13
相关论文
共 50 条
  • [1] Advances in Multi-GPU Smoothed Particle Hydrodynamics Simulations
    Rustico, Eugenio
    Bilotta, Giuseppe
    Herault, Alexis
    Del Negro, Ciro
    Gallo, Giovanni
    [J]. IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2014, 25 (01) : 43 - 52
  • [2] Towards accelerating smoothed particle hydrodynamics simulations for free-surface flows on multi-GPU clusters
    Valdez-Balderas, Daniel
    Dominguez, Jose M.
    Rogers, Benedict D.
    Crespo, Alejandro J. C.
    [J]. JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2013, 73 (11) : 1483 - 1493
  • [3] Explicit incompressible smoothed particle hydrodynamics in a multi-GPU environment for large-scale simulations
    Morikawa, Daniel
    Senadheera, Harini
    Asai, Mitsuteru
    [J]. COMPUTATIONAL PARTICLE MECHANICS, 2021, 8 (03) : 493 - 510
  • [4] Explicit incompressible smoothed particle hydrodynamics in a multi-GPU environment for large-scale simulations
    Daniel Morikawa
    Harini Senadheera
    Mitsuteru Asai
    [J]. Computational Particle Mechanics, 2021, 8 : 493 - 510
  • [5] Large scale water entry simulation with smoothed particle hydrodynamics on single- and multi-GPU systems
    Ji, Zhe
    Xu, Fei
    Takahashi, Akiyuki
    Sun, Yu
    [J]. COMPUTER PHYSICS COMMUNICATIONS, 2016, 209 : 1 - 12
  • [6] GPU-Centered Parallel Model on Heterogeneous Multi-GPU Clusters
    Wang, Feng
    [J]. PROCEEDINGS OF 2012 2ND INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND NETWORK TECHNOLOGY (ICCSNT 2012), 2012, : 1865 - 1868
  • [7] SMOOTHED PARTICLE HYDRODYNAMICS ON GPU COMPUTING
    Crespo, A. J. C.
    Dominguez, J. M.
    Valdez-Balderas, D.
    Rogers, B. D.
    Gomez-Gesteira, M.
    [J]. PARTICLE-BASED METHODS II: FUNDAMENTALS AND APPLICATIONS, 2011, : 922 - 929
  • [8] Efficient implementation of data flow graphs on multi-gpu clusters
    Vincent Boulos
    Sylvain Huet
    Vincent Fristot
    Luc Salvo
    Dominique Houzet
    [J]. Journal of Real-Time Image Processing, 2014, 9 : 217 - 232
  • [9] Efficient implementation of data flow graphs on multi-gpu clusters
    Boulos, Vincent
    Huet, Sylvain
    Fristot, Vincent
    Salvo, Luc
    Houzet, Dominique
    [J]. JOURNAL OF REAL-TIME IMAGE PROCESSING, 2014, 9 (01) : 217 - 232
  • [10] MAPREDUCE IMPLEMENTATION WITH MULTI-GPU
    Chen, Yi
    Chen, Su
    Jiang, Hai
    [J]. INTERNATIONAL SYMPOSIUM ON COMPUTER SCIENCE & TECHNOLOGY: PROCEEDINGS, 2012, : 21 - 25