Multi-GPU Kinetic Solvers using MPI and CUDA

被引:2
|
作者
Zabelok, Sergey [1 ]
Arslanbekov, Robert [1 ]
Kolobov, Vladimir [1 ]
机构
[1] Russian Acad Sci, Ctr Comp, Moscow 119333, Russia
关键词
unified flow solver; adaptive mesh refinement; discrete velocity method; boltzmann kinetic equation; direct simulation monte carlo; lattice boltzmann method; graphics processing units; CUDA; MPI; ALGORITHM;
D O I
10.1063/1.4902640
中图分类号
O59 [应用物理学];
学科分类号
摘要
This paper describes recent progress towards porting a Unified Flow Solver OHS to heterogeneous parallel computing. The main challenge of porting CIF'S to graphics processing units (GPUs) comes from the dynamically adapted mesh, which causes irregular data access. We describe the implementation of CUDA kernels for three modules in UFS: the direct Boltzmann solver using discrete velocity method (DVM), the DSMC module, and the Lattice Boltzmann Method (LBM) solver, all using octree Cartesian mesh with adaptive Mesh Refinement (AMR). Double digit speedup on single GPU and good scaling for multi-GPU has been demonstrated.
引用
收藏
页码:539 / 546
页数:8
相关论文
共 50 条
  • [1] Parallel QR Factorization using Givens Rotations in MPI-CUDA for Multi-GPU
    Tapia-Romero, Miguel
    Meneses-Viveros, Amilcar
    Hernandez-Rubio, Erika
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2020, 11 (05) : 636 - 645
  • [2] Optimizing MPI Communication on Multi-GPU Systems using CUDA Inter-Process Communication
    Potluri, S.
    Wang, H.
    Bureddy, D.
    Singh, A. K.
    Rosales, C.
    Panda, D. K.
    [J]. 2012 IEEE 26TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS & PHD FORUM (IPDPSW), 2012, : 1848 - 1857
  • [3] Hybrid MPI and CUDA Parallelization for CFD Applications on Multi-GPU HPC Clusters
    Lai, Jianqi
    Yu, Hang
    Tian, Zhengyu
    Li, Hua
    [J]. SCIENTIFIC PROGRAMMING, 2020, 2020
  • [4] Simulation and reconstruction for 3D elastic wave using multi-GPU and CUDA-aware MPI
    Cai, Wei
    Zhu, Peimin
    Li, Ziang
    [J]. COMPUTERS & GEOSCIENCES, 2024, 190
  • [5] Hybrid MPI and CUDA paralleled finite volume unstructured CFD simulations on a multi-GPU system
    Zhang, Xi
    Guo, Xiaohu
    Weng, Yue
    Zhang, Xianwei
    Lu, Yutong
    Zhao, Zhong
    [J]. FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2023, 139 : 1 - 16
  • [6] Effective Multi-GPU Communication Using Multiple CUDA Streams and Threads
    Sourouri, Mohammed
    Gillberg, Tor
    Baden, Scott B.
    Cai, Xing
    [J]. 2014 20TH IEEE INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED SYSTEMS (ICPADS), 2014, : 981 - 986
  • [7] Multi-GPU thermal lattice Boltzmann simulations using OpenACC and MPI
    Xu, Ao
    Li, Bo -Tao
    [J]. INTERNATIONAL JOURNAL OF HEAT AND MASS TRANSFER, 2023, 201
  • [8] Multi-GPU thermal lattice Boltzmann simulations using OpenACC and MPI
    Xu, Ao
    Li, Bo-Tao
    [J]. INTERNATIONAL JOURNAL OF HEAT AND MASS TRANSFER, 2023, 201
  • [9] Massively parallel modeling of electromagnetic field in conductive media: An MPI-CUDA implementation on Multi-GPU computers
    Tu, Xiaolei
    Bowles-Martinez, Esteban Jeremy
    Schultz, Adam
    [J]. COMPUTERS & GEOSCIENCES, 2024, 192
  • [10] A multi-GPU and CUDA-aware MPI-based spectral element formulation for ultrasonic wave propagation in solid media
    Li, Feilong
    Zou, Fangxin
    Rao, Jing
    [J]. ULTRASONICS, 2023, 134