Optimized Three-Dimensional Stencil Computation on Fermi and Kepler GPUs

被引:0
|
作者
Vizitiu, Anamaria [1 ]
Itu, Lucian
Nita, Cosmin
Suciu, Constantin
机构
[1] SC Siemens SRL, Siemens Corp Technol, Brasov, Romania
关键词
stencil; GPU; double precision; Kepler; Fermi; optimization;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Stencil based algorithms are used intensively in scientific computations. Graphics Processing Units (GPU) based implementations of stencil computations speed-up the execution significantly compared to conventional CPU only systems. In this paper we focus on double precision stencil computations, which are required for meeting the high accuracy requirements, inherent for scientific computations. Starting from two baseline implementations (using two dimensional and three dimensional thread block structures respectively), we employ different optimization techniques which lead to seven kernel versions. Both Fermi and Kepler GPUs are used, to evaluate the impact of different optimization techniques for the two architectures. Overall, the GTX680 GPU card performs best for a kernel with 2D thread block structure and optimized register and shared memory usage. We show that, whereas shared memory is not essential for Fermi GPUs, it is a highly efficient optimization technique for Kepler GPUs (mainly due to the different L1 cache usage). Furthermore, we evaluate the performance of Kepler GPU cards designed for desktop PCs and notebook PCs. The results indicate that the ratio of execution time is roughly equal to the inverse of the ratio of power consumption.
引用
收藏
页数:6
相关论文
共 50 条
  • [1] Highly Optimized Code Generation for Stencil Codes with Computation Reuse for GPUs
    Wen-Jing Ma
    Kan Gao
    Guo-Ping Long
    Journal of Computer Science and Technology, 2016, 31 : 1262 - 1274
  • [2] Highly Optimized Code Generation for Stencil Codes with Computation Reuse for GPUs
    Ma, Wen-Jing
    Gao, Kan
    Long, Guo-Ping
    JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2016, 31 (06) : 1262 - 1274
  • [3] Double precision stencil computations on Kepler GPUs
    Vizitiu, Anamaria
    Itu, Lucian
    Lazar, Laszlo
    Suciu, Constantin
    2014 18TH INTERNATIONAL CONFERENCE SYSTEM THEORY, CONTROL AND COMPUTING (ICSTCC), 2014, : 123 - 127
  • [4] Performance Upper Bound Analysis and Optimization of SGEMM on Fermi and Kepler GPUs
    Lai, Junjie
    Seznec, Andre
    PROCEEDINGS OF THE 2013 IEEE/ACM INTERNATIONAL SYMPOSIUM ON CODE GENERATION AND OPTIMIZATION (CGO), 2013, : 89 - 98
  • [5] DHTS: A Dynamic Hybrid Tiling Strategy for Optimizing Stencil Computation on GPUs
    Liu, Song
    Zhang, Zengyuan
    Wu, Weiguo
    IEEE TRANSACTIONS ON COMPUTERS, 2023, 72 (10) : 2795 - 2807
  • [6] Three-dimensional settlement computation for a lock
    Wunsch, R
    GEOTECHNICAL ENGINEERING FOR TRANSPORTATION INFRASTRUCTURE, VOLS 1-3: THEORY AND PRACTICE, PLANNING AND DESIGN, CONSTRUCTION AND MAINTENANCE, 1999, : 871 - 874
  • [7] Computation of three-dimensional hydrostatic menisci
    Pozrikidis, C.
    IMA JOURNAL OF APPLIED MATHEMATICS, 2010, 75 (03) : 418 - 438
  • [8] Performance Evaluation of the Three-Dimensional Finite-Difference Time-Domain(FDTD) Method on Fermi Architecture GPUs
    Hou, Kaixi
    Zhao, Ying
    Huang, Jiumei
    Zhang, Lingjie
    ALGORITHMS AND ARCHITECTURES FOR PARALLEL PROCESSING, PT I: ICA3PP 2011, 2011, 7916 : 460 - 469
  • [9] csTuner: Scalable Auto-tuning Framework for Complex Stencil Computation on GPUs
    Sun, Qingxiao
    Liu, Yi
    Yang, Hailong
    Jiang, Zhonghui
    Liu, Xiaoyan
    Dun, Ming
    Luan, Zhongzhi
    Qian, Depei
    2021 IEEE INTERNATIONAL CONFERENCE ON CLUSTER COMPUTING (CLUSTER 2021), 2021, : 192 - 203
  • [10] Three-dimensional CFD simulation of the stencil printing performance of solder paste
    M. S. Rusdi
    M. Z. Abdullah
    M. H. H. Ishak
    M. S. Abdul Aziz
    M. K. Abdullah
    P. Rethinasamy
    A. Jalar
    The International Journal of Advanced Manufacturing Technology, 2020, 108 : 3351 - 3359