Towards Efficient Decomposition and Parallelization of MPDATA on Hybrid CPU-GPU Cluster

被引:10
|
作者
Wyrzykowski, Roman [1 ]
Szustak, Lukasz [1 ]
Rojek, Krzysztof [1 ]
Tomas, Adam [1 ]
机构
[1] Czestochowa Tech Univ, PL-42201 Czestochowa, Poland
关键词
D O I
10.1007/978-3-662-43880-0_52
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
EULAG (Eulerian/semi-Lagrangian fluid solver) is an established computational model for simulating thermo-fluid flows across a wide range of scales and physical scenarios. The multidimensional positive definite advection transport algorithm (MPDATA) is among the most time-consuming components of EULAG. New supercomputing architectures based on multi-and many-core processors, such as hybrid CPU-GPU platforms, offer notable advantages over traditional supercomputers. In our previous works we considered adaptation of 2-dimensional (2D) MPDATA computations to a single CPU-GPU node. The main goal of this paper is to study tenets of optimal parallel formulation of 3D MPDATA on heterogeneous CPU-GPU cluster. Such supercomputer architecture requires not only a different philosophy of memory management than traditional massively parallel supercomputers, but also a comprehensive look at load balancing in the heterogeneous co-processing computing model. In this paper we propose an approach to implementation of 3D MPDATA algorithm on hybrid CPU-GPU cluster, using a mixture of MPI, OpenMP, and CUDA programming standards. This approach focuses on the donor-cell numerical scheme, and is based on a hierarchical decomposition including level of cluster, as well as distribution of computations between CPU and GPU components of each node, and within CPU and GPU devices. We discuss preliminary performance results for the proposed approach running on a single cluster node consisting of two AMD Opteron Interlagos CPUs and one or two NVIDIA Fermi GPUs.
引用
下载
收藏
页码:457 / 464
页数:8
相关论文
共 50 条
  • [1] Hybrid CPU-GPU constraint checking: Towards efficient context consistency
    Sui, Jun
    Xu, Chang
    Cheung, S. C.
    Xi, Wang
    Jiang, Yanyan
    Cao, Chun
    Ma, Xiaoxing
    Lu, Jian
    INFORMATION AND SOFTWARE TECHNOLOGY, 2016, 74 : 230 - 242
  • [2] Learning Driven Parallelization for Large-Scale Video Workload in Hybrid CPU-GPU Cluster
    Zhang, Haitao
    Tang, Bingchang
    Geng, Xin
    Ma, Huadong
    PROCEEDINGS OF THE 47TH INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING, 2018,
  • [3] GSched: An efficient scheduler for hybrid CPU-GPU HPC systems
    Mateos, Mariano Raboso
    Robles, Juan Antonio Cotobal
    1600, Springer Verlag (217): : 179 - 185
  • [4] Learning Based Performance and Power Efficient Cluster Resource Manager for CPU-GPU Cluster
    Das, Soumen Kumar
    Sudhakaran, G.
    Ashok, V.
    2014 FOURTH INTERNATIONAL CONFERENCE OF EMERGING APPLICATIONS OF INFORMATION TECHNOLOGY (EAIT), 2014, : 161 - 166
  • [5] Efficient irregular wavefront propagation algorithms on hybrid CPU-GPU machines
    Teodoro, George
    Pan, Tony
    Kurc, Tahsin M.
    Kong, Jun
    Cooper, Lee A. D.
    Saltz, Joel H.
    PARALLEL COMPUTING, 2013, 39 (4-5) : 189 - 211
  • [6] Towards a parallelization and performance optimization of Viola and Jones algorithm in heterogeneous CPU-GPU mobile system
    Ghorbel, Agnes
    Ben Amor, Nader
    Jallouli, Mohamed
    2015 15TH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS DESIGN AND APPLICATIONS (ISDA), 2015, : 528 - 532
  • [7] High efficient sedimentary basin simulations on hybrid CPU-GPU clusters
    Mei Wen
    Huayou Su
    Wenjie Wei
    Nan Wu
    Xing Cai
    Chunyuan Zhang
    Cluster Computing, 2014, 17 : 359 - 369
  • [8] Efficient Computation of Galois Field Expressions on Hybrid CPU-GPU Platforms
    Radmanovic, Milos M.
    Gajic, Dusan B.
    Stankovic, Radomir S.
    JOURNAL OF MULTIPLE-VALUED LOGIC AND SOFT COMPUTING, 2016, 26 (3-5) : 417 - 438
  • [9] High efficient sedimentary basin simulations on hybrid CPU-GPU clusters
    Wen, Mei
    Su, Huayou
    Wei, Wenjie
    Wu, Nan
    Cai, Xing
    Zhang, Chunyuan
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2014, 17 (02): : 359 - 369
  • [10] A Hybrid Parallel Algorithm for Computer Simulation of Electrocardiogram Based on a CPU-GPU Cluster
    Shen, Wenfeng
    Sun, Lianqiang
    Wei, Daming
    Xu, Weimin
    Wang, Hui
    Zhu, Xin
    2013 IEEE/ACIS 12TH INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION SCIENCE (ICIS), 2013, : 167 - 171