Unified Schemes for Directive-Based GPU Offloading

被引：0

作者：

Miki, Yohei ^{[1
]}

Hanawa, Toshihiro ^{[1
]}

机构：

[1] Univ Tokyo, Informat Technol Ctr, Chiba 2770882, Japan

来源：

IEEE ACCESS | 2024年 / 12卷

基金：

日本学术振兴会;

关键词：

Graphics processing units; Codes; Kernel; Costs; Multicore processing; Switches; Supercomputers; Programming; Libraries; User interfaces; Directive; GPU; OpenACC; OpenMP target; preprocessor macro; vendor lock-in;

D O I：

10.1109/ACCESS.2024.3509380

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

GPU is the dominant accelerator device due to its high performance and energy efficiency. Directive-based GPU offloading using OpenACC or OpenMP target is a convenient way to port existing codes originally developed for multicore CPUs. Although OpenACC and OpenMP target provide similar features, both methods have pros and cons. OpenACC has better functions and an abundance of documents, but it is virtually for NVIDIA GPUs. OpenMP target supports NVIDIA/AMD/Intel GPUs but has fewer functions than OpenACC. Here, we have developed a header-only library, Solomon (Simple Off-LOading Macros Orchestrating multiple Notations), to unify the interface for GPU offloading with the support of both OpenACC and OpenMP target. Solomon provides three types of notations to reduce users' implementation and learning costs: intuitive notation for beginners and OpenACC/OpenMP-like notations for experienced developers. This manuscript denotes Solomon's implementation and usage and demonstrates the GPU-offloading in N-body simulation and the three-dimensional diffusion equation. The library and sample codes are provided as open-source software and publicly and freely available at https://github.com/ymikirepo/solomon.

引用

页码：181644 / 181665

页数：22

共 50 条

[41] Evaluation of Directive-Based Programming Models for Stencil Computation on Current GPGPU Architectures
Shan, Baodi
Araya-Polo, Mauricio
Chapman, Barbara
ADVANCING OPENMP FOR FUTURE ACCELERATORS, IWOMP 2024, 2024, 15195 : 126 - 140
[42] Directive-based Auto-tuning for the Finite Difference Method on the Xeon Phi
Katagiri, Takahiro
Ohshima, Satoshi
Matsumoto, Masaharu
2015 IEEE 29TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS, 2015, : 1221 - 1230
[43] Effectiveness of strategic environmental assessment in Canada under directive-based and informal practice
Noble, Bram
Gibson, Robert
White, Lisa
Blakley, Jill
Croal, Peter
Nwanekezie, Kelechi
Doelle, Meinhard
IMPACT ASSESSMENT AND PROJECT APPRAISAL, 2019, 37 (3-4) : 344 - 355
[44] OpenACC to FPGA: A Framework for Directive-based High-Performance Reconfigurable Computing
Lee, Seyong
Kim, Jungwon
Vetter, Jeffrey S.
2016 IEEE 30TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM (IPDPS 2016), 2016, : 544 - 554
[45] Exploring the interoperability of remote GPGPU virtualization using rCUDA and directive-based programming models
Adrián Castelló
Antonio J. Peña
Rafael Mayo
Judit Planas
Enrique S. Quintana-Ortí
Pavan Balaji
The Journal of Supercomputing, 2018, 74 : 5628 - 5642
[46] Exploring the interoperability of remote GPGPU virtualization using rCUDA and directive-based programming models
Castello, Adrian
Pena, Antonio J.
Mayo, Rafael
Planas, Judit
Quintana-Orti, Enrique S.
Balaji, Pavan
JOURNAL OF SUPERCOMPUTING, 2018, 74 (11): : 5628 - 5642
[47] The Scalable Modeling System: directive-based code parallelization for distributed and shared memory computers
Govett, M
Hart, L
Henderson, T
Middlecoff, J
Schaffer, D
PARALLEL COMPUTING, 2003, 29 (08) : 995 - 1020
[48] Directive-Based, High-Level Programming and Optimizations for High-Performance Computing with FPGAs
Lambert, Jacob
Lee, Seyong
Kim, Jungwon
Vetter, Jeffrey S.
Malony, Allen D.
INTERNATIONAL CONFERENCE ON SUPERCOMPUTING (ICS 2018), 2018, : 160 - 171
[49] An Empirical Study of Parallelizing Test Execution Using CUDA Unified Memory and OpenMP GPU Offloading
Bagies, Taghreed
Jannesari, Ali
2021 IEEE INTERNATIONAL CONFERENCE ON SOFTWARE TESTING, VERIFICATION AND VALIDATION WORKSHOPS (ICSTW 2021), 2021, : 271 - 278
[50] A GPU offloading mechanism for LHCb
Badalov, Alexey
Perez, Daniel Hugo Campora
Zvyagin, Alexander
Neufeld, Niko
Cardona, Xavier Vilasis
20TH INTERNATIONAL CONFERENCE ON COMPUTING IN HIGH ENERGY AND NUCLEAR PHYSICS (CHEP2013), PARTS 1-6, 2014, 513

← 1 2 3 4 5 →