Data Partitioning on Multicore and Multi-GPU Platforms Using Functional Performance Models

被引:52
|
作者
Zhong, Ziming [1 ]
Rychkov, Vladimir [1 ]
Lastovetsky, Alexey [1 ]
机构
[1] Univ Coll Dublin, Sch Comp Sci & Informat, Dublin 4, Ireland
基金
爱尔兰科学基金会;
关键词
HPC; heterogeneous computing; GPU-accelerated multicore system; performance modeling; data partitioning; HETEROGENEOUS MULTICORE; EQUATIONS; SYSTEMS;
D O I
10.1109/TC.2014.2375202
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Heterogeneous multiprocessor systems, which are composed of a mix of processing elements, such as commodity multicore processors, graphics processing units (GPUs), and others, have been widely used in scientific computing community. Software applications incorporate the code designed and optimized for different types of processing elements in order to exploit the computing power of such heterogeneous computing systems. In this paper, we consider the problem of optimal distribution of the workload of data-parallel scientific applications between processing elements of such heterogeneous computing systems. We present a solution that uses functional performance models (FPMs) of processing elements and FPM-based data partitioning algorithms. Efficiency of this approach is demonstrated by experiments with parallel matrix multiplication and numerical simulation of lid-driven cavity flow on hybrid servers and clusters.
引用
收藏
页码:2506 / 2518
页数:13
相关论文
共 50 条
  • [1] Data Partitioning on Heterogeneous Multicore and Multi-GPU Systems Using Functional Performance Models of Data-Parallel Applications
    Zhong, Ziming
    Rychkov, Vladimir
    Lastovetsky, Alexey
    2012 IEEE INTERNATIONAL CONFERENCE ON CLUSTER COMPUTING (CLUSTER), 2012, : 191 - 199
  • [2] High performance conjugate gradient solver on multi-GPU clusters using hypergraph partitioning
    Cevahir, Ali
    Nukada, Akira
    Matsuoka, Satoshi
    COMPUTER SCIENCE-RESEARCH AND DEVELOPMENT, 2010, 25 (1-2): : 83 - 91
  • [3] Automatic tuning to performance modelling of matrix polynomials on multicore and multi-GPU systems
    Boratto, Murilo
    Alonso, Pedro
    Gimenez, Domingo
    Lastovetsky, Alexey
    JOURNAL OF SUPERCOMPUTING, 2017, 73 (01): : 227 - 239
  • [4] Automatic tuning to performance modelling of matrix polynomials on multicore and multi-GPU systems
    Murilo Boratto
    Pedro Alonso
    Domingo Giménez
    Alexey Lastovetsky
    The Journal of Supercomputing, 2017, 73 : 227 - 239
  • [5] Data Partitioning on Heterogeneous Multicore Platforms
    Zhong, Ziming
    Rychkov, Vladimir
    Lastovetsky, Alexey
    2011 IEEE INTERNATIONAL CONFERENCE ON CLUSTER COMPUTING (CLUSTER), 2011, : 580 - 584
  • [6] Simulations of Complex and Microscopic Models of Cardiac Electrophysiology Powered by Multi-GPU Platforms
    de Barros, Bruno Gouvea
    Oliveira, Rafael Sachetto
    Meira, Wagner, Jr.
    Lobosco, Marcelo
    dos Santos, Rodrigo Weber
    COMPUTATIONAL AND MATHEMATICAL METHODS IN MEDICINE, 2012, 2012
  • [7] Towards Universal Performance Modeling for Machine Learning Training on Multi-GPU Platforms
    Lin, Zhongyi
    Sun, Ning
    Bhattacharya, Pallab
    Feng, Xizhou
    Feng, Louis
    Owens, John D.
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2025, 36 (02) : 226 - 238
  • [8] Greedily Using GPU Capacity for Data List Processing in Multicore-GPU Platforms
    Alberto Martinez-Angeles, Carlos
    Buenabad-Chavez, Jorge
    Alfonso Castro-Garcia, Miguel
    Luis Quiroz-Fabian, Jose
    2013 10TH INTERNATIONAL CONFERENCE ON ELECTRICAL ENGINEERING, COMPUTING SCIENCE AND AUTOMATIC CONTROL (CCE), 2013, : 195 - 200
  • [9] Parallel Algorithm for Landform Attributes Representation on Multicore and Multi-GPU Systems
    Boratto, Murilo
    Alonso, Pedro
    Ramiro, Carla
    Barreto, Marcos
    Coelho, Leandro
    COMPUTATIONAL SCIENCE AND ITS APPLICATIONS - ICCSA 2012, PT I, 2012, 7333 : 29 - 43
  • [10] MULTICORE/MULTI-GPU ACCELERATED SIMULATIONS OF MULTIPHASE COMPRESSIBLE FLOWS USING WAVELET ADAPTED GRIDS
    Rossinelli, Diego
    Hejazialhosseini, Babak
    Spampinato, Daniele G.
    Koumoutsakos, Petros
    SIAM JOURNAL ON SCIENTIFIC COMPUTING, 2011, 33 (02): : 512 - 540