Data Partitioning on Multicore and Multi-GPU Platforms Using Functional Performance Models

被引:52
|
作者
Zhong, Ziming [1 ]
Rychkov, Vladimir [1 ]
Lastovetsky, Alexey [1 ]
机构
[1] Univ Coll Dublin, Sch Comp Sci & Informat, Dublin 4, Ireland
基金
爱尔兰科学基金会;
关键词
HPC; heterogeneous computing; GPU-accelerated multicore system; performance modeling; data partitioning; HETEROGENEOUS MULTICORE; EQUATIONS; SYSTEMS;
D O I
10.1109/TC.2014.2375202
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Heterogeneous multiprocessor systems, which are composed of a mix of processing elements, such as commodity multicore processors, graphics processing units (GPUs), and others, have been widely used in scientific computing community. Software applications incorporate the code designed and optimized for different types of processing elements in order to exploit the computing power of such heterogeneous computing systems. In this paper, we consider the problem of optimal distribution of the workload of data-parallel scientific applications between processing elements of such heterogeneous computing systems. We present a solution that uses functional performance models (FPMs) of processing elements and FPM-based data partitioning algorithms. Efficiency of this approach is demonstrated by experiments with parallel matrix multiplication and numerical simulation of lid-driven cavity flow on hybrid servers and clusters.
引用
收藏
页码:2506 / 2518
页数:13
相关论文
共 50 条
  • [31] Two-Stage Least Squares algorithms with QR decomposition for Simultaneous Equations Models on heterogeneous multicore and multi-GPU systems
    Ramiro, Carla
    Lopez-Espin, Jose J.
    Gimenez, Domingo
    Vidal, Antonio M.
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE, ICCS 2012, 2012, 9 : 2004 - 2007
  • [32] Hybrid CPU- and GPU-based Implementation for Particle-in-Cell Simulation on Multicore and Multi-GPU Systems
    Wang, Pan
    Zhu, Xiangqin
    2021 PHOTONICS & ELECTROMAGNETICS RESEARCH SYMPOSIUM (PIERS 2021), 2021, : 155 - 161
  • [33] Multi-GPU Kinetic Solvers using MPI and CUDA
    Zabelok, Sergey
    Arslanbekov, Robert
    Kolobov, Vladimir
    PROCEEDINGS OF THE 29TH INTERNATIONAL SYMPOSIUM ON RAREFIED GAS DYNAMICS, 2014, 1628 : 539 - 546
  • [34] ScaleDNN: Data Movement Aware DNN Training on Multi-GPU
    Xu, Weizheng
    Pattnaik, Ashutosh
    Yuan, Geng
    Wang, Yanzhi
    Zhang, Youtao
    Tang, Xulong
    2021 IEEE/ACM INTERNATIONAL CONFERENCE ON COMPUTER AIDED DESIGN (ICCAD), 2021,
  • [35] A Multi-GPU Framework for In-Memory Text Data Analytics
    Chong, Poh Kit
    Karuppiah, Ettikan K.
    Yong, Keh Kok
    2013 IEEE 27TH INTERNATIONAL CONFERENCE ON ADVANCED INFORMATION NETWORKING AND APPLICATIONS WORKSHOPS (WAINA), 2013, : 1411 - 1416
  • [36] Automatic Data Allocation and Buffer Management for Multi-GPU Machines
    Ramashekar, Thejas
    Bondhugula, Uday
    ACM TRANSACTIONS ON ARCHITECTURE AND CODE OPTIMIZATION, 2013, 10 (04)
  • [37] High performance MRI simulations of motion on multi-GPU systems
    Xanthis, Christos G.
    Venetis, Ioannis E.
    Aletras, Anthony H.
    JOURNAL OF CARDIOVASCULAR MAGNETIC RESONANCE, 2014, 16
  • [38] SysCellC: a data-flow programming model on multi-GPU
    Houzet, Dominique
    Huet, Sylvain
    Rahman, Anis
    ICCS 2010 - INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE, PROCEEDINGS, 2010, 1 (01): : 1029 - 1038
  • [39] Efficient implementation of data flow graphs on multi-gpu clusters
    Vincent Boulos
    Sylvain Huet
    Vincent Fristot
    Luc Salvo
    Dominique Houzet
    Journal of Real-Time Image Processing, 2014, 9 : 217 - 232
  • [40] Efficient implementation of data flow graphs on multi-gpu clusters
    Boulos, Vincent
    Huet, Sylvain
    Fristot, Vincent
    Salvo, Luc
    Houzet, Dominique
    JOURNAL OF REAL-TIME IMAGE PROCESSING, 2014, 9 (01) : 217 - 232