A Multi-GPU PCISPH Implementation with Efficient Memory Transfers

被引：0

作者：

Verma, Kevin ^{[1
,2
]}

Peng, Chong ^{[1
]}

Szewc, Kamil ^{[1
]}

Wille, Robert ^{[2
]}

机构：

[1] ESS Engn Software Steyr GmbH, Steyr, Austria

[2] Johannes Kepler Univ Linz, Inst Integrated Circuits, Linz, Austria

来源：

2018 IEEE HIGH PERFORMANCE EXTREME COMPUTING CONFERENCE (HPEC) | 2018年

关键词：

SMOOTHED PARTICLE HYDRODYNAMICS; SPH;

D O I：

暂无

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

Smoothed Particle Hydrodynamics (SPH) is a particle-based method for fluid flow modeling. One promising variant of SPH is Predictive-Corrective Incompressible SPH (PCISPH), which employs a dedicate prediction-correction scheme and, by this, outperforms other SPH variants by almost one order of magnitude. However, similar to other particle-based methods, it suffers from a huge numerical complexity. In order to simulate real world phenomena, several millions of particles need to be considered. To make SPH applicable to real world engineering problems, it is hence common to exploit massive parallelism of multi-GPU architectures. However, certain algorithmic characteristics of PCISPH make it a non-trivial task to efficiently parallelize this method on multi-GPUs. In this work, we are, for the first time, proposing a multi-GPU implementation for PCISPH. To this end, we are proposing a scheme which allows to overlap the memory transfers between GPUs by actual computations and, by this, avoids the drawbacks caused by the mentioned algorithmic characteristics of PCISPH. Experimental evaluations confirm the efficiency of the proposed methods.

引用

页数：7

共 50 条

[1] Efficient Implementation of MrBayes on Multi-GPU
Bao, Jie
Xia, Hongju
Zhou, Jianfu
Liu, Xiaoguang
Wang, Gang
[J]. MOLECULAR BIOLOGY AND EVOLUTION, 2013, 30 (06) : 1471 - 1479
[2] A PCISPH implementation using distributed multi-GPU acceleration for simulating industrial engineering applications
Verma, Kevin
McCabe, Christopher
Peng, Chong
Wille, Robert
[J]. INTERNATIONAL JOURNAL OF HIGH PERFORMANCE COMPUTING APPLICATIONS, 2020, 34 (04): : 450 - 464
[3] Efficient Multi-GPU Shared Memory via Automatic Optimization of Fine-Grained Transfers
Muthukrishnan, Harini
Nellans, David
Lustig, Daniel
Fessler, Jeffrey A.
Wenisch, Thomas F.
[J]. 2021 ACM/IEEE 48TH ANNUAL INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE (ISCA 2021), 2021, : 139 - 152
[4] MAPREDUCE IMPLEMENTATION WITH MULTI-GPU
Chen, Yi
Chen, Su
Jiang, Hai
[J]. INTERNATIONAL SYMPOSIUM ON COMPUTER SCIENCE & TECHNOLOGY: PROCEEDINGS, 2012, : 21 - 25
[5] Efficient implementation of data flow graphs on multi-gpu clusters
Boulos, Vincent
Huet, Sylvain
Fristot, Vincent
Salvo, Luc
Houzet, Dominique
[J]. JOURNAL OF REAL-TIME IMAGE PROCESSING, 2014, 9 (01) : 217 - 232
[6] Efficient implementation of data flow graphs on multi-gpu clusters
Vincent Boulos
Sylvain Huet
Vincent Fristot
Luc Salvo
Dominique Houzet
[J]. Journal of Real-Time Image Processing, 2014, 9 : 217 - 232
[7] Efficient Multi-GPU Memory Management for Deep Learning Acceleration
Kim, Youngrang
Lee, Jaehwan
Kim, Jik-Soo
Jei, Hyunseung
Roh, Hongchan
[J]. 2018 IEEE 3RD INTERNATIONAL WORKSHOPS ON FOUNDATIONS AND APPLICATIONS OF SELF* SYSTEMS (FAS*W), 2018, : 37 - 43
[8] Multi-GPU Implementation of LU Factorization
Jia, Yulu
Luszczek, Piotr
Dongarra, Jack
[J]. PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE, ICCS 2012, 2012, 9 : 106 - 115
[9] Scalable multi-GPU implementation of the MAGFLOW simulator
Rustico, Eugenio
Bilotta, Giuseppe
Herault, Alexis
Del Negro, Ciro
Gallo, Giovanni
[J]. ANNALS OF GEOPHYSICS, 2011, 54 (05) : 592 - 599
[10] Towards a Multi-GPU Implementation of a Seismic Application
Rigon, Pedro H. C.
Schussler, Brenda S.
Padoin, Edson L.
Lorenzon, Arthur F.
Carissimi, Alexandre
Navaux, Philippe O. A.
[J]. HIGH PERFORMANCE COMPUTING, CARLA 2023, 2024, 1887 : 146 - 159

← 1 2 3 4 5 →