accULL: An OpenACC Implementation with CUDA and OpenCL Support

被引：0

作者：

Reyes, Ruyman ^{[1
]}

Lopez-Rodriguez, Ivan ^{[1
]}

Fumero, Juan J. ^{[1
]}

de Sande, Francisco ^{[1
]}

机构：

[1] Univ La Laguna, Dept EIO & Comp, San Cristobal la Laguna 38271, Spain

来源：

EURO-PAR 2012 PARALLEL PROCESSING | 2012年 / 7484卷

关键词：

D O I：

暂无

中图分类号：

TP31 [计算机软件];

学科分类号：

081202 ; 0835 ;

摘要：

The irruption in the HPC scene of hardware accelerators, like GPUs, has made available unprecedented performance to developers. However, even expert developers may not be ready to exploit the new complex processor hierarchies. We need to find a way to leverage the programming effort in these devices at programming language level, otherwise, developers will spend most of their time focusing on device-specific code instead of implementing algorithmic enhancements. The recent advent of the OpenACC standard for heterogeneous computing represents an effort in this direction. This initiative, combined with future releases of the OpenMP standard, will converge into a fully heterogeneous framework that will cope the programming requirements of future computer architectures. In this work we present accULL, a novel implementation of the OpenACC standard, based on the combination of a source to source compiler and a runtime library. To our knowledge, our approach is the first providing support for both OpenCL and CUDA platforms under this new standard.

引用

页码：871 / 882

页数：12

共 50 条

[1] Accelerating Hydrocodes with OpenACC, OpenCL and CUDA
Herdman, J. A.
Gaudin, W. P.
McIntosh-Smith, S.
Boulton, M.
Beckingsale, D. A.
Mallinson, A. C.
Jarvis, S. A.
[J]. 2012 SC COMPANION: HIGH PERFORMANCE COMPUTING, NETWORKING, STORAGE AND ANALYSIS (SCC), 2012, : 465 - 471
[2] Evaluating the Performance and Cost of Accelerating Seismic Processing with CUDA, OpenCL, OpenACC, and OpenMP
Gimenes, Tiago L.
Pisani, Flavia
Borin, Edson
[J]. 2018 32ND IEEE INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM (IPDPS), 2018, : 399 - 408
[3] A heterogeneous computing accelerated SCE-UA global optimization method using OpenMP, OpenCL, CUDA, and OpenACC
Kan, Guangyuan
He, Xiaoyan
Ding, Liuqian
Li, Jiren
Liang, Ke
Hong, Yang
[J]. WATER SCIENCE AND TECHNOLOGY, 2017, 76 (07) : 1640 - 1651
[4] An Early Performance Comparison of CUDA and OpenACC
Li, Xuechao
Shih, Po-Chou
[J]. 2018 3RD INTERNATIONAL CONFERENCE ON MEASUREMENT INSTRUMENTATION AND ELECTRONICS (ICMIE 2018), 2018, 208
[5] Performance Comparison of CUDA and OpenACC Based on Optimizations
Li, Xuechao
Shih, Po-Chou
[J]. PROCEEDINGS OF THE 2018 2ND HIGH PERFORMANCE COMPUTING AND CLUSTER TECHNOLOGIES CONFERENCE (HPCCT 2018), 2018, : 53 - 57
[6] A Source-to-Source OpenACC Compiler for CUDA
Tabuchi, Akihiro
Nakao, Masahiro
Sato, Mitsuhisa
[J]. EURO-PAR 2013: PARALLEL PROCESSING WORKSHOPS, 2014, 8374 : 178 - 187
[7] Nekbone performance on GPUs with OpenACC and CUDA Fortran implementations
Jing Gong
Stefano Markidis
Erwin Laure
Matthew Otten
Paul Fischer
Misun Min
[J]. The Journal of Supercomputing, 2016, 72 : 4160 - 4180
[8] Nekbone performance on GPUs with OpenACC and CUDA Fortran implementations
Gong, Jing
Markidis, Stefano
Laure, Erwin
Otten, Matthew
Fischer, Paul
Min, Misun
[J]. JOURNAL OF SUPERCOMPUTING, 2016, 72 (11): : 4160 - 4180
[9] Accelerating Phylogenetic Inference on GPUs: an OpenACC and CUDA comparison
Kuan, Lidia
Neves, Joao
Pratas, Frederico
Tomas, Pedro
Sousa, Leonel
[J]. PROCEEDINGS IWBBIO 2014: INTERNATIONAL WORK-CONFERENCE ON BIOINFORMATICS AND BIOMEDICAL ENGINEERING, VOLS 1 AND 2, 2014, : 589 - 600
[10] Swan: A tool for porting CUDA programs to OpenCL
Harvey, M. J.
De Fabritiis, G.
[J]. COMPUTER PHYSICS COMMUNICATIONS, 2011, 182 (04) : 1093 - 1099

← 1 2 3 4 5 →