accULL: An OpenACC Implementation with CUDA and OpenCL Support

被引:0
|
作者
Reyes, Ruyman [1 ]
Lopez-Rodriguez, Ivan [1 ]
Fumero, Juan J. [1 ]
de Sande, Francisco [1 ]
机构
[1] Univ La Laguna, Dept EIO & Comp, San Cristobal la Laguna 38271, Spain
来源
关键词
D O I
暂无
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
The irruption in the HPC scene of hardware accelerators, like GPUs, has made available unprecedented performance to developers. However, even expert developers may not be ready to exploit the new complex processor hierarchies. We need to find a way to leverage the programming effort in these devices at programming language level, otherwise, developers will spend most of their time focusing on device-specific code instead of implementing algorithmic enhancements. The recent advent of the OpenACC standard for heterogeneous computing represents an effort in this direction. This initiative, combined with future releases of the OpenMP standard, will converge into a fully heterogeneous framework that will cope the programming requirements of future computer architectures. In this work we present accULL, a novel implementation of the OpenACC standard, based on the combination of a source to source compiler and a runtime library. To our knowledge, our approach is the first providing support for both OpenCL and CUDA platforms under this new standard.
引用
收藏
页码:871 / 882
页数:12
相关论文
共 50 条
  • [1] Accelerating Hydrocodes with OpenACC, OpenCL and CUDA
    Herdman, J. A.
    Gaudin, W. P.
    McIntosh-Smith, S.
    Boulton, M.
    Beckingsale, D. A.
    Mallinson, A. C.
    Jarvis, S. A.
    [J]. 2012 SC COMPANION: HIGH PERFORMANCE COMPUTING, NETWORKING, STORAGE AND ANALYSIS (SCC), 2012, : 465 - 471
  • [2] Evaluating the Performance and Cost of Accelerating Seismic Processing with CUDA, OpenCL, OpenACC, and OpenMP
    Gimenes, Tiago L.
    Pisani, Flavia
    Borin, Edson
    [J]. 2018 32ND IEEE INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM (IPDPS), 2018, : 399 - 408
  • [3] A heterogeneous computing accelerated SCE-UA global optimization method using OpenMP, OpenCL, CUDA, and OpenACC
    Kan, Guangyuan
    He, Xiaoyan
    Ding, Liuqian
    Li, Jiren
    Liang, Ke
    Hong, Yang
    [J]. WATER SCIENCE AND TECHNOLOGY, 2017, 76 (07) : 1640 - 1651
  • [4] An Early Performance Comparison of CUDA and OpenACC
    Li, Xuechao
    Shih, Po-Chou
    [J]. 2018 3RD INTERNATIONAL CONFERENCE ON MEASUREMENT INSTRUMENTATION AND ELECTRONICS (ICMIE 2018), 2018, 208
  • [5] Performance Comparison of CUDA and OpenACC Based on Optimizations
    Li, Xuechao
    Shih, Po-Chou
    [J]. PROCEEDINGS OF THE 2018 2ND HIGH PERFORMANCE COMPUTING AND CLUSTER TECHNOLOGIES CONFERENCE (HPCCT 2018), 2018, : 53 - 57
  • [6] A Source-to-Source OpenACC Compiler for CUDA
    Tabuchi, Akihiro
    Nakao, Masahiro
    Sato, Mitsuhisa
    [J]. EURO-PAR 2013: PARALLEL PROCESSING WORKSHOPS, 2014, 8374 : 178 - 187
  • [7] Nekbone performance on GPUs with OpenACC and CUDA Fortran implementations
    Jing Gong
    Stefano Markidis
    Erwin Laure
    Matthew Otten
    Paul Fischer
    Misun Min
    [J]. The Journal of Supercomputing, 2016, 72 : 4160 - 4180
  • [8] Nekbone performance on GPUs with OpenACC and CUDA Fortran implementations
    Gong, Jing
    Markidis, Stefano
    Laure, Erwin
    Otten, Matthew
    Fischer, Paul
    Min, Misun
    [J]. JOURNAL OF SUPERCOMPUTING, 2016, 72 (11): : 4160 - 4180
  • [9] Accelerating Phylogenetic Inference on GPUs: an OpenACC and CUDA comparison
    Kuan, Lidia
    Neves, Joao
    Pratas, Frederico
    Tomas, Pedro
    Sousa, Leonel
    [J]. PROCEEDINGS IWBBIO 2014: INTERNATIONAL WORK-CONFERENCE ON BIOINFORMATICS AND BIOMEDICAL ENGINEERING, VOLS 1 AND 2, 2014, : 589 - 600
  • [10] Swan: A tool for porting CUDA programs to OpenCL
    Harvey, M. J.
    De Fabritiis, G.
    [J]. COMPUTER PHYSICS COMMUNICATIONS, 2011, 182 (04) : 1093 - 1099