Portable Mapping of Data Parallel Programs to OpenCL for Heterogeneous Systems

被引:0
|
作者
Grewe, Dominik [1 ]
Wang, Zheng [1 ]
O'Boyle, Michael F. P. [1 ]
机构
[1] Univ Edinburgh, Sch Informat, Edinburgh EH8 9YL, Midlothian, Scotland
关键词
GPU; OpenCL; Machine-Learning Mapping;
D O I
暂无
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
General purpose GPU based systems are highly attractive as they give potentially massive performance at little cost. Realizing such potential is challenging due to the complexity of programming. This paper presents a compiler based approach to automatically generate optimized OpenCL code from data-parallel OpenMP programs for GPUs. Such an approach brings together the benefits of a clear high level language (OpenMP) and an emerging standard (OpenCL) for heterogeneous multi-cores. A key feature of our scheme is that it leverages existing transformations, especially data transformations, to improve performance on GPU architectures and uses predictive modeling to automatically determine if it is worthwhile running the OpenCL code on the GPU or OpenMP code on the multi-core host. We applied our approach to the entire NAS parallel benchmark suite and evaluated it on two distinct GPU based systems: Core i7/NVIDIA GeForce GTX 580 and Core i7/AMD Radeon 7970. We achieved average (up to) speedups of 4.51x and 4.20x (143x and 67x) respectively over a sequential baseline. This is, on average, a factor 1.63 and 1.56 times faster than a hand-coded, GPU-specific OpenCL implementation developed by independent expert programmers.
引用
收藏
页码:161 / 170
页数:10
相关论文
共 50 条
  • [1] Automatic and Portable Mapping of Data Parallel Programs to OpenCL for GPU-Based Heterogeneous Systems
    Wang, Zheng
    Grewe, Dominik
    O'Boyle, Michael F. P.
    [J]. ACM TRANSACTIONS ON ARCHITECTURE AND CODE OPTIMIZATION, 2014, 11 (04)
  • [2] Portable Parallel Programs with Python']Python and OpenCL
    Di Pierro, Massimo
    [J]. COMPUTING IN SCIENCE & ENGINEERING, 2014, 16 (01) : 34 - 40
  • [3] Adaptive Optimization for OpenCL Programs on Embedded Heterogeneous Systems
    Taylor, Ben
    Marco, Vicent Sanz
    Wang, Zheng
    [J]. ACM SIGPLAN NOTICES, 2017, 52 (05) : 11 - 20
  • [4] Fuzzy classification of OpenCL programs targeting heterogeneous systems
    Al-Zoubi, Ahmad
    Tatas, Konstantinos
    Kyriacou, Costas
    [J]. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2020, 39 (05) : 7189 - 7202
  • [5] OPENCL: A PARALLEL PROGRAMMING STANDARD FOR HETEROGENEOUS COMPUTING SYSTEMS
    Stone, John E.
    Gohara, David
    Shi, Guochun
    [J]. COMPUTING IN SCIENCE & ENGINEERING, 2010, 12 (03) : 66 - 72
  • [6] Automatic Mapping for OpenCL-Programs on CPU/GPU Heterogeneous Platforms
    Moren, Konrad
    Goehringer, Diana
    [J]. COMPUTATIONAL SCIENCE - ICCS 2018, PT II, 2018, 10861 : 301 - 314
  • [7] A Compiler Framework for Automatically Mapping Data Parallel Programs to Heterogeneous MPSoCs
    Chandramohan, Kiran
    O'Boyle, Michael F. P.
    [J]. 2014 INTERNATIONAL CONFERENCE ON COMPILERS, ARCHITECTURE AND SYNTHESIS FOR EMBEDDED SYSTEMS (CASES), 2014,
  • [8] OpenCL Implementation of a Parallel Universal Kriging Algorithm for Massive Spatial Data Interpolation on Heterogeneous Systems
    Huang, Fang
    Bu, Shuanshuan
    Tao, Jian
    Tan, Xicheng
    [J]. ISPRS INTERNATIONAL JOURNAL OF GEO-INFORMATION, 2016, 5 (06)
  • [9] Data Integration Tasks on Heterogeneous Systems Using OpenCL
    Faber, Clayton J.
    Cabrera, Anthony M.
    Booker, Oronde
    Maayan, Gabe
    Chamberlain, Roger D.
    [J]. PROCEEDINGS OF THE INTERNATIONAL WORKSHOP ON OPENCL (IWOCL'19), 2019,
  • [10] Mapping Estimator for OpenCL Heterogeneous Accelerators
    Perina, Andre Bannwart
    Bonato, Vanderlei
    [J]. 2018 INTERNATIONAL CONFERENCE ON FIELD-PROGRAMMABLE TECHNOLOGY (FPT 2018), 2018, : 297 - 300