A Multi-GPU PCISPH Implementation with Efficient Memory Transfers

被引:0
|
作者
Verma, Kevin [1 ,2 ]
Peng, Chong [1 ]
Szewc, Kamil [1 ]
Wille, Robert [2 ]
机构
[1] ESS Engn Software Steyr GmbH, Steyr, Austria
[2] Johannes Kepler Univ Linz, Inst Integrated Circuits, Linz, Austria
关键词
SMOOTHED PARTICLE HYDRODYNAMICS; SPH;
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Smoothed Particle Hydrodynamics (SPH) is a particle-based method for fluid flow modeling. One promising variant of SPH is Predictive-Corrective Incompressible SPH (PCISPH), which employs a dedicate prediction-correction scheme and, by this, outperforms other SPH variants by almost one order of magnitude. However, similar to other particle-based methods, it suffers from a huge numerical complexity. In order to simulate real world phenomena, several millions of particles need to be considered. To make SPH applicable to real world engineering problems, it is hence common to exploit massive parallelism of multi-GPU architectures. However, certain algorithmic characteristics of PCISPH make it a non-trivial task to efficiently parallelize this method on multi-GPUs. In this work, we are, for the first time, proposing a multi-GPU implementation for PCISPH. To this end, we are proposing a scheme which allows to overlap the memory transfers between GPUs by actual computations and, by this, avoids the drawbacks caused by the mentioned algorithmic characteristics of PCISPH. Experimental evaluations confirm the efficiency of the proposed methods.
引用
收藏
页数:7
相关论文
共 50 条
  • [1] Efficient Implementation of MrBayes on Multi-GPU
    Bao, Jie
    Xia, Hongju
    Zhou, Jianfu
    Liu, Xiaoguang
    Wang, Gang
    [J]. MOLECULAR BIOLOGY AND EVOLUTION, 2013, 30 (06) : 1471 - 1479
  • [2] A PCISPH implementation using distributed multi-GPU acceleration for simulating industrial engineering applications
    Verma, Kevin
    McCabe, Christopher
    Peng, Chong
    Wille, Robert
    [J]. INTERNATIONAL JOURNAL OF HIGH PERFORMANCE COMPUTING APPLICATIONS, 2020, 34 (04): : 450 - 464
  • [3] Efficient Multi-GPU Shared Memory via Automatic Optimization of Fine-Grained Transfers
    Muthukrishnan, Harini
    Nellans, David
    Lustig, Daniel
    Fessler, Jeffrey A.
    Wenisch, Thomas F.
    [J]. 2021 ACM/IEEE 48TH ANNUAL INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE (ISCA 2021), 2021, : 139 - 152
  • [4] MAPREDUCE IMPLEMENTATION WITH MULTI-GPU
    Chen, Yi
    Chen, Su
    Jiang, Hai
    [J]. INTERNATIONAL SYMPOSIUM ON COMPUTER SCIENCE & TECHNOLOGY: PROCEEDINGS, 2012, : 21 - 25
  • [5] Efficient implementation of data flow graphs on multi-gpu clusters
    Boulos, Vincent
    Huet, Sylvain
    Fristot, Vincent
    Salvo, Luc
    Houzet, Dominique
    [J]. JOURNAL OF REAL-TIME IMAGE PROCESSING, 2014, 9 (01) : 217 - 232
  • [6] Efficient implementation of data flow graphs on multi-gpu clusters
    Vincent Boulos
    Sylvain Huet
    Vincent Fristot
    Luc Salvo
    Dominique Houzet
    [J]. Journal of Real-Time Image Processing, 2014, 9 : 217 - 232
  • [7] Efficient Multi-GPU Memory Management for Deep Learning Acceleration
    Kim, Youngrang
    Lee, Jaehwan
    Kim, Jik-Soo
    Jei, Hyunseung
    Roh, Hongchan
    [J]. 2018 IEEE 3RD INTERNATIONAL WORKSHOPS ON FOUNDATIONS AND APPLICATIONS OF SELF* SYSTEMS (FAS*W), 2018, : 37 - 43
  • [8] Multi-GPU Implementation of LU Factorization
    Jia, Yulu
    Luszczek, Piotr
    Dongarra, Jack
    [J]. PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE, ICCS 2012, 2012, 9 : 106 - 115
  • [9] Scalable multi-GPU implementation of the MAGFLOW simulator
    Rustico, Eugenio
    Bilotta, Giuseppe
    Herault, Alexis
    Del Negro, Ciro
    Gallo, Giovanni
    [J]. ANNALS OF GEOPHYSICS, 2011, 54 (05) : 592 - 599
  • [10] Towards a Multi-GPU Implementation of a Seismic Application
    Rigon, Pedro H. C.
    Schussler, Brenda S.
    Padoin, Edson L.
    Lorenzon, Arthur F.
    Carissimi, Alexandre
    Navaux, Philippe O. A.
    [J]. HIGH PERFORMANCE COMPUTING, CARLA 2023, 2024, 1887 : 146 - 159