Simultaneous CPU-GPU Execution of Data Parallel Algorithmic Skeletons

被引:5
|
作者
Wrede, Fabian [1 ]
Ernsting, Steffen [1 ]
机构
[1] Leonardo Campus 3, D-48149 Munster, Germany
关键词
High-level parallel programming; Data parallel algorithmic skeletons; Simultaneous CPU-GPU execution;
D O I
10.1007/s10766-016-0483-9
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Parallel programming has become ubiquitous; however, it is still a low-level and error-prone task, especially when accelerators such as GPUs are used. Thus, algorithmic skeletons have been proposed to provide well-defined programming patterns in order to assist programmers and shield them from low-level aspects. As the complexity of problems, and consequently the need for computing capacity, grows, we have directed our research toward simultaneous CPU-GPU execution of data parallel skeletons to achieve a performance gain. GPUs are optimized with respect to throughput and designed for massively parallel computations. Nevertheless, we analyze whether the additional utilization of the CPU for data parallel skeletons in the Muenster Skeleton Library leads to speedups or causes a reduced performance, because of the smaller computational capacity of CPUs compared to GPUs. We present a C implementation based on a static distribution approach. In order to evaluate the implementation, four different benchmarks, including matrix multiplication, N-body simulation, Frobenius norm, and ray tracing, have been conducted. The ratio of CPU and GPU execution has been varied manually to observe the effects of different distributions. The results show that a speedup can be achieved by distributing the execution among CPUs and GPUs. However, both the results and the optimal distribution highly depend on the available hardware and the specific algorithm.
引用
收藏
页码:42 / 61
页数:20
相关论文
共 50 条
  • [31] Optimization of Parallel Algorithm for Kalman Filter on CPU-GPU Heterogeneous System
    Xu, Dandan
    Xiao, Zheng
    Li, Dapu
    Wu, Fan
    [J]. 2016 12TH INTERNATIONAL CONFERENCE ON NATURAL COMPUTATION, FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY (ICNC-FSKD), 2016, : 2165 - 2172
  • [32] Simeuro: A Hybrid CPU-GPU Parallel Simulator for Neuromorphic Computing Chips
    Zhang, Huaipeng
    Ho, Nhut-Minh
    Polat, Dogukan Yigit
    Chen, Peng
    Wahib, Mohamed
    Nguyen, Truong Thao
    Meng, Jintao
    Goh, Rick Siow Mong
    Matsuoka, Satoshi
    Luo, Tao
    Wong, Weng-Fai
    [J]. IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2023, 34 (10) : 2767 - 2782
  • [33] Parabolic Radon transform parallel algorithm for CPU-GPU heterogeneous platform
    Zhang, Quan
    Lin, Baiyue
    Yang, Bo
    Peng, Bo
    Zhang, Wei
    Tu, Ran
    [J]. Shiyou Diqiu Wuli Kantan/Oil Geophysical Prospecting, 2020, 55 (06): : 1263 - 1270
  • [34] An industrial defect detection algorithm based on CPU-GPU parallel call
    Zhu Li
    Hong-wei Lin
    Yuan-yuan Liu
    Chong Chen
    Yun-fei Xia
    [J]. Multimedia Tools and Applications, 2023, 82 : 44191 - 44207
  • [35] An industrial defect detection algorithm based on CPU-GPU parallel call
    Li, Zhu
    Lin, Hong-wei
    Liu, Yuan-yuan
    Chen, Chong
    Xia, Yun-fei
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (28) : 44191 - 44207
  • [36] Development of a CPU-GPU heterogeneous platform based on a nonlinear parallel algorithm
    Ma, Haifeng
    [J]. NONLINEAR ENGINEERING - MODELING AND APPLICATION, 2022, 11 (01): : 215 - 222
  • [37] Developing a CPU-GPU LES Parallel Solver for Canonical Turbulent Flows
    ZendehAli, Nafiseh
    Emdad, Homayoun
    Abouali, Omid
    [J]. IRANIAN JOURNAL OF SCIENCE AND TECHNOLOGY-TRANSACTIONS OF MECHANICAL ENGINEERING, 2023, 47 (04) : 1535 - 1551
  • [38] PARALLEL BINOMIAL AMERICAN OPTION PRICING ON CPU-GPU HYBRID PLATFORM
    Zhang, Nan
    Lei, Chi-Un
    Man, Ka Lok
    [J]. IAENG TRANSACTIONS ON ELECTRICAL ENGINEERING, VOL 1, 2012, : 161 - 174
  • [39] Data Parallel Algorithmic Skeletons with Accelerator Support
    Ernsting, Steffen
    Kuchen, Herbert
    [J]. INTERNATIONAL JOURNAL OF PARALLEL PROGRAMMING, 2017, 45 (02) : 283 - 299
  • [40] Big data simulation for surface reconstruction on CPU-GPU platform
    Hadi, N. A.
    [J]. 2ND INTERNATIONAL CONFERENCE ON DATA AND INFORMATION SCIENCE, 2019, 1192