Speculative Execution of Parallel Programs with Precise Exception Semantics on GPUs

被引:2
|
作者
Hayashi, Akihiro [1 ]
Grossman, Max [1 ]
Zhao, Jisheng [1 ]
Shirako, Jun [1 ]
Sarkar, Vivek [1 ]
机构
[1] Rice Univ, Dept Comp Sci, Houston, TX 77005 USA
关键词
!text type='JAVA']JAVA[!/text;
D O I
10.1007/978-3-319-09967-5_20
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
General purpose computing on GPUs (GPGPU) can enable significant performance and energy improvements for certain classes of applications. However, current GPGPU programming models, such as CUDA and OpenCL, are only accessible by systems experts through low-level C/C++ APIs. In contrast, large numbers of programmers use high-level languages, such as Java, due to their productivity advantages of type safety, managed runtimes and precise exception semantics. Current approaches to enabling GPGPU computing in Java and other managed languages involve low-level interfaces to native code that compromise the semantic guarantees of managed languages, and are not readily accessible to mainstream programmers. In this paper, we propose compile-time and runtime technique for accelerating Java programs with automatic generation of OpenCL while preserving precise exception semantics. Our approach includes (1) automatic generation of OpenCL kernels and JNI glue code from a Java-based parallel-loop construct (forall), (2) speculative execution of OpenCL kernels on GPUs, and (3) automatic generation of optimized and parallel exception-checking code for execution on the CPU. A key insight in supporting our speculative execution is that the GPU's device memory is separate from the CPU's main memory, so that, in the case of a mis-speculation (exception), any side effects in a GPU kernel can be ignored by simply not communicating results back to the CPU. We demonstrate the efficiency of our approach using eight Java benchmarks on two GPU-equipped platforms. Experimental results show that our approach can significantly accelerate certain classes of Java programs on GPUs while maintaining precise exception semantics.
引用
收藏
页码:342 / 356
页数:15
相关论文
共 50 条
  • [41] PATH FOR AND-PARALLEL EXECUTION OF LOGIC PROGRAMS
    LEE, SH
    KIM, DH
    CHOE, KM
    [J]. INFORMATION PROCESSING LETTERS, 1994, 52 (04) : 191 - 199
  • [42] STOCHASTIC BOUNDS ON EXECUTION TIMES OF PARALLEL PROGRAMS
    YAZICIPEKERGIN, N
    VINCENT, JM
    [J]. IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 1991, 17 (10) : 1005 - 1012
  • [43] Visualization of and/or-parallel execution of logic programs
    Vaupel, R
    Pontelli, E
    Gupta, G
    [J]. LOGIC PROGRAMMING: PROCEEDINGS OF THE FOURTEENTH INTERNATIONAL CONFERENCE ON LOGIC PROGRAMMING, 1997, : 271 - 285
  • [44] Architecture for visualizing the execution of parallel logic programs
    Brayshaw, Mike
    [J]. IJCAI, Proceedings of the International Joint Conference on Artificial Intelligence, 1600, 2
  • [45] Remote execution of data-parallel programs
    Borowiec, J
    [J]. INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED PROCESSING TECHNIQUES AND APPLICATIONS, VOLS I-IV, PROCEEDINGS, 1998, : 1272 - 1279
  • [46] SYNTHESIZING OPERATIONAL DEVICES FOR PARALLEL EXECUTION OF PROGRAMS
    EKMANIS, EE
    KALNINSH, JJ
    [J]. AVTOMATIKA I VYCHISLITELNAYA TEKHNIKA, 1982, (06): : 33 - 39
  • [47] Adaptive execution techniques of parallel programs for multiprocessors
    Lee, Jaejin
    Park, Jung-Ho
    Kim, Honggyu
    Jung, Changhee
    Lim, Daeseob
    Han, SangYong
    [J]. JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2010, 70 (05) : 467 - 480
  • [48] Parallel execution of logic programs by load sharing
    NEC Research Institute, Inc., 4 Independence Way, Princeton, NJ 08540, United States
    [J]. J Logic Program, 1 (XI-51):
  • [49] Identifying causes of execution failure for parallel programs
    Liu Y.
    Gao Y.
    Zhang G.
    [J]. Guofang Keji Daxue Xuebao/Journal of National University of Defense Technology, 2022, 44 (05): : 45 - 52
  • [50] AUTOMATIC RESTRUCTURING OF FORTRAN PROGRAMS FOR PARALLEL EXECUTION
    POLYCHRONOPOULOS, CD
    [J]. LECTURE NOTES IN COMPUTER SCIENCE, 1988, 295 : 107 - 130