Unified Schemes for Directive-Based GPU Offloading

被引:0
|
作者
Miki, Yohei [1 ]
Hanawa, Toshihiro [1 ]
机构
[1] Univ Tokyo, Informat Technol Ctr, Chiba 2770882, Japan
来源
IEEE ACCESS | 2024年 / 12卷
基金
日本学术振兴会;
关键词
Graphics processing units; Codes; Kernel; Costs; Multicore processing; Switches; Supercomputers; Programming; Libraries; User interfaces; Directive; GPU; OpenACC; OpenMP target; preprocessor macro; vendor lock-in;
D O I
10.1109/ACCESS.2024.3509380
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
GPU is the dominant accelerator device due to its high performance and energy efficiency. Directive-based GPU offloading using OpenACC or OpenMP target is a convenient way to port existing codes originally developed for multicore CPUs. Although OpenACC and OpenMP target provide similar features, both methods have pros and cons. OpenACC has better functions and an abundance of documents, but it is virtually for NVIDIA GPUs. OpenMP target supports NVIDIA/AMD/Intel GPUs but has fewer functions than OpenACC. Here, we have developed a header-only library, Solomon (Simple Off-LOading Macros Orchestrating multiple Notations), to unify the interface for GPU offloading with the support of both OpenACC and OpenMP target. Solomon provides three types of notations to reduce users' implementation and learning costs: intuitive notation for beginners and OpenACC/OpenMP-like notations for experienced developers. This manuscript denotes Solomon's implementation and usage and demonstrates the GPU-offloading in N-body simulation and the three-dimensional diffusion equation. The library and sample codes are provided as open-source software and publicly and freely available at https://github.com/ymikirepo/solomon.
引用
收藏
页码:181644 / 181665
页数:22
相关论文
共 50 条
  • [41] Evaluation of Directive-Based Programming Models for Stencil Computation on Current GPGPU Architectures
    Shan, Baodi
    Araya-Polo, Mauricio
    Chapman, Barbara
    ADVANCING OPENMP FOR FUTURE ACCELERATORS, IWOMP 2024, 2024, 15195 : 126 - 140
  • [42] Directive-based Auto-tuning for the Finite Difference Method on the Xeon Phi
    Katagiri, Takahiro
    Ohshima, Satoshi
    Matsumoto, Masaharu
    2015 IEEE 29TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS, 2015, : 1221 - 1230
  • [43] Effectiveness of strategic environmental assessment in Canada under directive-based and informal practice
    Noble, Bram
    Gibson, Robert
    White, Lisa
    Blakley, Jill
    Croal, Peter
    Nwanekezie, Kelechi
    Doelle, Meinhard
    IMPACT ASSESSMENT AND PROJECT APPRAISAL, 2019, 37 (3-4) : 344 - 355
  • [44] OpenACC to FPGA: A Framework for Directive-based High-Performance Reconfigurable Computing
    Lee, Seyong
    Kim, Jungwon
    Vetter, Jeffrey S.
    2016 IEEE 30TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM (IPDPS 2016), 2016, : 544 - 554
  • [45] Exploring the interoperability of remote GPGPU virtualization using rCUDA and directive-based programming models
    Adrián Castelló
    Antonio J. Peña
    Rafael Mayo
    Judit Planas
    Enrique S. Quintana-Ortí
    Pavan Balaji
    The Journal of Supercomputing, 2018, 74 : 5628 - 5642
  • [46] Exploring the interoperability of remote GPGPU virtualization using rCUDA and directive-based programming models
    Castello, Adrian
    Pena, Antonio J.
    Mayo, Rafael
    Planas, Judit
    Quintana-Orti, Enrique S.
    Balaji, Pavan
    JOURNAL OF SUPERCOMPUTING, 2018, 74 (11): : 5628 - 5642
  • [47] The Scalable Modeling System: directive-based code parallelization for distributed and shared memory computers
    Govett, M
    Hart, L
    Henderson, T
    Middlecoff, J
    Schaffer, D
    PARALLEL COMPUTING, 2003, 29 (08) : 995 - 1020
  • [48] Directive-Based, High-Level Programming and Optimizations for High-Performance Computing with FPGAs
    Lambert, Jacob
    Lee, Seyong
    Kim, Jungwon
    Vetter, Jeffrey S.
    Malony, Allen D.
    INTERNATIONAL CONFERENCE ON SUPERCOMPUTING (ICS 2018), 2018, : 160 - 171
  • [49] An Empirical Study of Parallelizing Test Execution Using CUDA Unified Memory and OpenMP GPU Offloading
    Bagies, Taghreed
    Jannesari, Ali
    2021 IEEE INTERNATIONAL CONFERENCE ON SOFTWARE TESTING, VERIFICATION AND VALIDATION WORKSHOPS (ICSTW 2021), 2021, : 271 - 278
  • [50] A GPU offloading mechanism for LHCb
    Badalov, Alexey
    Perez, Daniel Hugo Campora
    Zvyagin, Alexander
    Neufeld, Niko
    Cardona, Xavier Vilasis
    20TH INTERNATIONAL CONFERENCE ON COMPUTING IN HIGH ENERGY AND NUCLEAR PHYSICS (CHEP2013), PARTS 1-6, 2014, 513