Hands on with OpenMP4.5 and Unified Memory: Developing Applications for IBM's Hybrid CPU plus GPU Systems (Part II)

被引:4
|
作者
Grinberg, Leopold [1 ]
Bertolli, Carlo [1 ]
Haque, Riyaz [2 ]
机构
[1] IBM Res, Yorktown Hts, NY 10598 USA
[2] LLNL, Livermore, CA USA
关键词
OpenPOWER; HPC; Offloading; Directive based programming; Nested parallelism;
D O I
10.1007/978-3-319-65578-9_2
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Integration of multiple types of compute elements and memories in a single system requires proper support at a system-software level including operating system (OS), compilers, drivers, etc. The OS helps in scheduling work on different compute elements and manages memory operations in multiple memory pools including page migration. Compilers and programming languages provide tools for taking advantage of advanced architectural features. In this paper we encourage code developers to work with experimental versions of compilers and OpenMP standard extensions designed for hybrid OpenPOWER nodes. Specifically, we focus on nested parallelism and Unified Memory as key elements for efficient system-wide programming of CPU and GPU resources of OpenPOWER. We give implementation details using code samples and we discuss limitations of the presented approaches.
引用
收藏
页码:17 / 29
页数:13
相关论文
共 2 条