Co-processing SPMD computation on CPUs and GPUs cluster

被引:0
|
作者
Li, Hui [1 ]
Fox, Geoffrey [1 ]
von Laszewski, Gregor [1 ]
Chauhan, Arun [1 ]
机构
[1] Indiana Univ, Pervas Technol Inst, Sch Informat & Comp, Bloomington, IN 47405 USA
关键词
SPMD; GPU; CUDA; Multi-Level-Scheduler;
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Heterogeneous parallel systems with multi processors and accelerators are becoming ubiquitous due to better cost-performance and energy-efficiency. These heterogeneous processor architectures have different instruction sets and are optimized for either task-latency or throughput purposes. Challenges occur in regard to programmability and performance when running SPMD tasks on heterogeneous devices. In order to meet these challenges, we implemented a parallel runtime system that used to co-process SPMD computation on CPUs and GPUs clusters. Furthermore, we are proposing an analytic model to automatically schedule SPMD tasks on heterogeneous clusters. Our analytic model is derived from the roofline model, and therefore it can be applied to a wider range of SPMD applications and hardware devices. The experimental results of the C-means, GMM, and GEMV show good speedup in practical heterogeneous cluster environments.
引用
收藏
页数:10
相关论文
共 50 条
  • [1] Parallel cube computation on modern CPUs and GPUs
    Zhou, Guoliang
    Chen, Hong
    [J]. JOURNAL OF SUPERCOMPUTING, 2012, 61 (03): : 394 - 417
  • [2] Parallel cube computation on modern CPUs and GPUs
    Guoliang Zhou
    Hong Chen
    [J]. The Journal of Supercomputing, 2012, 61 : 394 - 417
  • [3] Co-Processing Parallel Computation for Distributed Optical Fiber Vibration Sensing
    Wang, Yu
    Lv, Yuejuan
    Jin, Baoquan
    Xu, Yuelin
    Chen, Yu
    Liu, Xin
    Bai, Qing
    [J]. APPLIED SCIENCES-BASEL, 2020, 10 (05):
  • [4] Co-processing in Brazil
    [J]. 2005, Palladian Publications (36):
  • [5] Experiences Using CPUs and GPUs for Cooperative Computation in a Multi-Physics Simulation
    Pearce, Olga
    [J]. 47TH INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING (ICPP '18), 2018,
  • [6] Hybrid Cluster of Multicore CPUs and GPUs for Accelerating Hyperspectral Image Hierarchical Segmentation
    Hossam, Mahmoud A.
    Ebied, Hala M.
    Abdel-Aziz, Mohamed H.
    [J]. 2013 8TH INTERNATIONAL CONFERENCE ON COMPUTER ENGINEERING & SYSTEMS (ICCES), 2013, : 262 - 267
  • [7] Heterogeneous parallel 3D image deconvolution on a cluster of GPUs and CPUs
    Domanski, L.
    Bednarz, T.
    Vallotton, P.
    Taylor, J.
    [J]. 19TH INTERNATIONAL CONGRESS ON MODELLING AND SIMULATION (MODSIM2011), 2011, : 613 - 619
  • [8] Hierarchical parallel processing of large scale data clustering on a PC cluster with GPU co-processing
    Takizawa, Hiroyuki
    Kobayashi, Hiroaki
    [J]. JOURNAL OF SUPERCOMPUTING, 2006, 36 (03): : 219 - 234
  • [9] Hierarchical parallel processing of large scale data clustering on a PC cluster with GPU co-processing
    Hiroyuki Takizawa
    Hiroaki Kobayashi
    [J]. The Journal of Supercomputing, 2006, 36 : 219 - 234
  • [10] On reconfigurable co-processing units
    Hartenstein, RW
    Herz, M
    Hoffmann, T
    Nageldinger, U
    [J]. PARALLEL AND DISTRIBUTED PROCESSING, 1998, 1388 : 67 - 72