An Extension of OpenACC Directives for Out-of-Core Stencil Computation with Temporal Blocking

被引:0
|
作者
Miki, Nobuhiro [1 ]
Ino, Fumihiko [1 ]
Hagihara, Kenichi [1 ]
机构
[1] Osaka Univ, Grad Sch Informat Sci & Technol, 1-5 Yamadaoka, Suita, Osaka 5650871, Japan
基金
日本学术振兴会; 日本科学技术振兴机构;
关键词
D O I
10.1109/WACCPD.2016.10
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
In this paper, aiming at realizing directive-based temporal blocking for out-of-core stencil computation, we present an extension of OpenACC directives and a source-to-source translator capable of accelerating out-of-core stencil computation on a graphics processing unit (GPU). Out-of-core stencil computation here deals with large data that cannot be entirely stored in GPU memory. Given an OpenACC-like code, the proposed translator generates an OpenACC code such that it decomposes large data into smaller chunks, which are then processed in a pipelined manner to hide the data transfer overhead needed for exchanging chunks between the GPU memory and CPU memory. Furthermore, the generated code is optimized with a temporal blocking technique to minimize the amount of CPU-GPU data transfer. In experiments, we apply the proposed translator to three stencil computation codes. The out-of-core performance on a Tesla K40 GPU reaches 73.4 GFLOPS, which is only 13% lower than the in-core performance. Therefore, we think that our directive-based approach is useful for facilitating out-of-core stencil computation on a GPU.
引用
收藏
页码:36 / 45
页数:10
相关论文
共 29 条
  • [1] A Highly Efficient I/O-based Out-of-Core Stencil Algorithm with Globally Optimized Temporal Blocking
    Midorikawa, Hiroko
    Tan, Hideyuki
    [J]. 2017 IEEE PACIFIC RIM CONFERENCE ON COMMUNICATIONS, COMPUTERS AND SIGNAL PROCESSING (PACRIM), 2017,
  • [2] Optimal Temporal Blocking for Stencil Computation
    Muranushi, Takayuki
    Makino, Junichiro
    [J]. INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE, ICCS 2015 COMPUTATIONAL SCIENCE AT THE GATES OF NATURE, 2015, 51 : 1303 - 1312
  • [3] GAMER with out-of-core computation
    Schive, Hsi-Yu
    Tsai, Yu-Chih
    Chiueh, Tzihong
    [J]. COMPUTATIONAL STAR FORMATION, 2011, (270): : 401 - 405
  • [4] Accelerating GPU-Based Out-of-Core Stencil Computation with On-the-Fly Compression
    Shen, Jingcheng
    Wu, Yifan
    Okita, Masao
    Ino, Fumihiko
    [J]. PARALLEL AND DISTRIBUTED COMPUTING, APPLICATIONS AND TECHNOLOGIES, PDCAT 2021, 2022, 13148 : 3 - 14
  • [5] A compression-based memory-efficient optimization for out-of-core GPU stencil computation
    Shen, Jingcheng
    Long, Linbo
    Deng, Xin
    Okita, Masao
    Ino, Fumihiko
    [J]. JOURNAL OF SUPERCOMPUTING, 2023, 79 (10): : 11055 - 11077
  • [6] A compression-based memory-efficient optimization for out-of-core GPU stencil computation
    Jingcheng Shen
    Linbo Long
    Xin Deng
    Masao Okita
    Fumihiko Ino
    [J]. The Journal of Supercomputing, 2023, 79 : 11055 - 11077
  • [7] A Data-Centric Directive-Based Framework to Accelerate Out-of-Core Stencil Computation on a GPU
    Shen, Jingcheng
    Ino, Fumihiko
    Farres, Albert
    Hanzich, Mauricio
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2020, E103D (12): : 2421 - 2434
  • [8] Evaluation of Flash-based Out-of-core Stencil Computation Algorithms for SSD-Equipped Clusters
    Midorikawa, Hiroko
    Tan, Hideyuki
    [J]. 2016 IEEE 22ND INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED SYSTEMS (ICPADS), 2016, : 1031 - 1040
  • [9] Beyond 16GB: Out-of-Core Stencil Computations
    Reguly, Istan Z.
    Mudalige, Gihan R.
    Giles, Michael B.
    [J]. PROCEEDINGS OF MCHPC'17: WORKSHOP ON MEMORY CENTRIC PROGRAMMING FOR HPC, 2017, : 20 - 29
  • [10] Efficient Stencil Computation with Temporal Blocking by Halide DSL
    Aikawa, Hiroki
    Endo, Toshio
    Yuki, Tomoya
    Hirofuchi, Takahiro
    Ikegami, Tsutomu
    [J]. 2022 IEEE INTL CONF ON PARALLEL & DISTRIBUTED PROCESSING WITH APPLICATIONS, BIG DATA & CLOUD COMPUTING, SUSTAINABLE COMPUTING & COMMUNICATIONS, SOCIAL COMPUTING & NETWORKING, ISPA/BDCLOUD/SOCIALCOM/SUSTAINCOM, 2022, : 870 - 877