Towards Efficient Decomposition and Parallelization of MPDATA on Hybrid CPU-GPU Cluster

被引:10
|
作者
Wyrzykowski, Roman [1 ]
Szustak, Lukasz [1 ]
Rojek, Krzysztof [1 ]
Tomas, Adam [1 ]
机构
[1] Czestochowa Tech Univ, PL-42201 Czestochowa, Poland
关键词
D O I
10.1007/978-3-662-43880-0_52
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
EULAG (Eulerian/semi-Lagrangian fluid solver) is an established computational model for simulating thermo-fluid flows across a wide range of scales and physical scenarios. The multidimensional positive definite advection transport algorithm (MPDATA) is among the most time-consuming components of EULAG. New supercomputing architectures based on multi-and many-core processors, such as hybrid CPU-GPU platforms, offer notable advantages over traditional supercomputers. In our previous works we considered adaptation of 2-dimensional (2D) MPDATA computations to a single CPU-GPU node. The main goal of this paper is to study tenets of optimal parallel formulation of 3D MPDATA on heterogeneous CPU-GPU cluster. Such supercomputer architecture requires not only a different philosophy of memory management than traditional massively parallel supercomputers, but also a comprehensive look at load balancing in the heterogeneous co-processing computing model. In this paper we propose an approach to implementation of 3D MPDATA algorithm on hybrid CPU-GPU cluster, using a mixture of MPI, OpenMP, and CUDA programming standards. This approach focuses on the donor-cell numerical scheme, and is based on a hierarchical decomposition including level of cluster, as well as distribution of computations between CPU and GPU components of each node, and within CPU and GPU devices. We discuss preliminary performance results for the proposed approach running on a single cluster node consisting of two AMD Opteron Interlagos CPUs and one or two NVIDIA Fermi GPUs.
引用
收藏
页码:457 / 464
页数:8
相关论文
共 50 条
  • [41] CPU-GPU Hybrid Parallel Binomial American Option Pricing
    Zhang, Nan
    Lim, Eng Gee
    Man, Ka Lok
    Lei, Chi-Un
    INTERNATIONAL MULTICONFERENCE OF ENGINEERS AND COMPUTER SCIENTIST, IMECS 2012, VOL II, 2012, : 1157 - 1162
  • [42] Optimizing tensor contraction expressions for hybrid CPU-GPU execution
    Ma, Wenjing
    Krishnamoorthy, Sriram
    Villa, Oreste
    Kowalski, Karol
    Agrawal, Gagan
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2013, 16 (01): : 131 - 155
  • [43] Heterogeneous CPU-GPU parallelization for modeling supersonic reacting flows with detailed chemical kinetics
    Rao, Sihang
    Chen, Bing
    Xu, Xu
    COMPUTER PHYSICS COMMUNICATIONS, 2024, 300
  • [44] Hybrid-Smash: A Heterogeneous CPU-GPU Compression Library
    Penaranda, Cristian
    Reano, Carlos
    Silla, Federico
    IEEE ACCESS, 2024, 12 : 32706 - 32723
  • [45] Fast Snippet Generation Based On CPU-GPU Hybrid System
    Liu, Ding
    Li, Ruixuan
    Gu, Xiwu
    Wen, Kunmei
    He, Heng
    Gao, Guoqiang
    2011 IEEE 17TH INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED SYSTEMS (ICPADS), 2011, : 252 - 259
  • [46] A user mode CPU-GPU scheduling framework for hybrid workloads
    Wang, Bin
    Ma, Ruhui
    Qi, Zhengwei
    Yao, Jianguo
    Guan, Haibing
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2016, 63 : 25 - 36
  • [47] PARALLEL SOLVER FOR SHIFTED SYSTEMS IN A HYBRID CPU-GPU FRAMEWORK
    Bosnery, Nela
    Bujanovic, Zvonimir
    Drmac, Zlatko
    SIAM JOURNAL ON SCIENTIFIC COMPUTING, 2018, 40 (04): : C605 - C633
  • [48] Noniterative Multireference Coupled Cluster Methods on Heterogeneous CPU-GPU Systems
    Bhaskaran-Nair, Kiran
    Ma, Wenjing
    Krishnamoorthy, Sriram
    Villa, Oreste
    van Dam, Hubertus J. J.
    Apra, Edoardo
    Kowalski, Karol
    JOURNAL OF CHEMICAL THEORY AND COMPUTATION, 2013, 9 (04) : 1949 - 1957
  • [49] HETEROGENEOUS DESIGN AND EFFICIENT CPU-GPU IMPLEMENTATION OF COLLISION DETECTION
    Tayyub, Mohid
    Khan, Gul N.
    IADIS-INTERNATIONAL JOURNAL ON COMPUTER SCIENCE AND INFORMATION SYSTEMS, 2019, 14 (02): : 25 - 40
  • [50] Comparative Study of Massively Parallel Cryptalysis and Cryptography on CPU-GPU Cluster
    Niewiadomska-Szynkiewicz, Ewa
    Marks, Michal
    Jantura, Jaroslaw
    Podbielski, Mikolaj
    Strzelczyk, Przemyslaw
    2013 MILITARY COMMUNICATIONS AND INFORMATION SYSTEMS CONFERENCE (MCC), 2013,