Data Partitioning on Multicore and Multi-GPU Platforms Using Functional Performance Models

被引:52
|
作者
Zhong, Ziming [1 ]
Rychkov, Vladimir [1 ]
Lastovetsky, Alexey [1 ]
机构
[1] Univ Coll Dublin, Sch Comp Sci & Informat, Dublin 4, Ireland
基金
爱尔兰科学基金会;
关键词
HPC; heterogeneous computing; GPU-accelerated multicore system; performance modeling; data partitioning; HETEROGENEOUS MULTICORE; EQUATIONS; SYSTEMS;
D O I
10.1109/TC.2014.2375202
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Heterogeneous multiprocessor systems, which are composed of a mix of processing elements, such as commodity multicore processors, graphics processing units (GPUs), and others, have been widely used in scientific computing community. Software applications incorporate the code designed and optimized for different types of processing elements in order to exploit the computing power of such heterogeneous computing systems. In this paper, we consider the problem of optimal distribution of the workload of data-parallel scientific applications between processing elements of such heterogeneous computing systems. We present a solution that uses functional performance models (FPMs) of processing elements and FPM-based data partitioning algorithms. Efficiency of this approach is demonstrated by experiments with parallel matrix multiplication and numerical simulation of lid-driven cavity flow on hybrid servers and clusters.
引用
收藏
页码:2506 / 2518
页数:13
相关论文
共 50 条
  • [21] Parallel Singular Value Decomposition on Heterogeneous Multi-core and Multi-GPU Platforms
    Feng, Xiaowen
    Jin, Hai
    Zheng, Ran
    Zhu, Lei
    2014 NINTH INTERNATIONAL CONFERENCE ON DIGITAL INFORMATION MANAGEMENT (ICDIM), 2014, : 45 - 50
  • [22] Time Performance Analysis of Multi-CPU and Multi-GPU in Big Data Clustering Computation
    Adiyoso, Widiarto
    Krisnadhi, Adila
    Wibisono, Ari
    Purbarani, Sumarsih Condroayu
    Saraswati, Anindhita Dwi
    Putri, Annissa Fildzah Rafi
    Saladdin, Ibad Rahadian
    Anwar, S. Reyneta Carissa
    2018 INTERNATIONAL WORKSHOP ON BIG DATA AND INFORMATION SECURITY (IWBIS), 2018, : 113 - 116
  • [23] Multi-GPU Acceleration for Smart Grid Data Compression
    Chen, Zhi-Hung
    Lee, Che-Rung
    2013 IEEE 15TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS & 2013 IEEE INTERNATIONAL CONFERENCE ON EMBEDDED AND UBIQUITOUS COMPUTING (HPCC_EUC), 2013, : 1808 - 1813
  • [24] Data Parallel Multi-GPU Path Tracing using Ray Queue Cycling
    Wald, Ingo
    Jaros, Milan
    Zellmann, Stefan
    COMPUTER GRAPHICS FORUM, 2023, 42 (08)
  • [25] Performance Optimization of Allreduce Operation for Multi-GPU Systems
    Nukada, Akira
    2021 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2021, : 3107 - 3112
  • [26] Multi-GPU performance optimization of a computational fluid dynamics code using OpenACC
    Xue, Weicheng
    Roy, Christoper J.
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2021, 33 (05):
  • [27] MGPUSim: Enabling Multi-GPU Performance Modeling and Optimization
    Sun, Yifan
    Baruah, Trinayan
    Mojumder, Saiful A.
    Dong, Shi
    Gong, Xiang
    Treadway, Shane
    Bao, Yuhui
    Hance, Spencer
    McCardwell, Carter
    Zhao, Vincent
    Barclay, Harrison
    Ziabari, Amir Kavyan
    Chen, Zhongliang
    Ubal, Rafael
    Abelian, Jose L.
    Kim, John
    Joshi, Ajay
    Kaeli, David
    PROCEEDINGS OF THE 2019 46TH INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE (ISCA '19), 2019, : 197 - 209
  • [28] Multi-GPU performance optimization of a computational fluid dynamics code using OpenACC
    Xue, Weicheng
    Roy, Christoper J.
    Concurrency and Computation: Practice and Experience, 2021, 33 (05)
  • [29] Performance Optimization for SpMV on Multi-GPU Systems Using Threads and Multiple Streams
    Guo, Ping
    Zhang, Changjiang
    2016 28TH IEEE INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE AND HIGH PERFORMANCE COMPUTING WORKSHOPS (SBAC-PADW), 2016, : 67 - 72
  • [30] Multi-GPU Implementation of the Uniformization Method for Solving Markov Models
    Karwacki, Marek
    Bylina, Beata
    Bylina, Jaroslaw
    2012 FEDERATED CONFERENCE ON COMPUTER SCIENCE AND INFORMATION SYSTEMS (FEDCSIS), 2012, : 533 - 537