Optimized Three-Dimensional Stencil Computation on Fermi and Kepler GPUs

被引:0
|
作者
Vizitiu, Anamaria [1 ]
Itu, Lucian
Nita, Cosmin
Suciu, Constantin
机构
[1] SC Siemens SRL, Siemens Corp Technol, Brasov, Romania
关键词
stencil; GPU; double precision; Kepler; Fermi; optimization;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Stencil based algorithms are used intensively in scientific computations. Graphics Processing Units (GPU) based implementations of stencil computations speed-up the execution significantly compared to conventional CPU only systems. In this paper we focus on double precision stencil computations, which are required for meeting the high accuracy requirements, inherent for scientific computations. Starting from two baseline implementations (using two dimensional and three dimensional thread block structures respectively), we employ different optimization techniques which lead to seven kernel versions. Both Fermi and Kepler GPUs are used, to evaluate the impact of different optimization techniques for the two architectures. Overall, the GTX680 GPU card performs best for a kernel with 2D thread block structure and optimized register and shared memory usage. We show that, whereas shared memory is not essential for Fermi GPUs, it is a highly efficient optimization technique for Kepler GPUs (mainly due to the different L1 cache usage). Furthermore, we evaluate the performance of Kepler GPU cards designed for desktop PCs and notebook PCs. The results indicate that the ratio of execution time is roughly equal to the inverse of the ratio of power consumption.
引用
收藏
页数:6
相关论文
共 50 条
  • [21] Three-dimensional computation of flow around groyne
    Ouillon, S
    Dartus, D
    JOURNAL OF HYDRAULIC ENGINEERING, 1997, 123 (11) : 962 - 970
  • [22] A three-dimensional Lagrangian hurricane eyewall computation
    Rutherford, Blake
    Dangelmayr, Gerhard
    QUARTERLY JOURNAL OF THE ROYAL METEOROLOGICAL SOCIETY, 2010, 136 (653) : 1931 - 1944
  • [23] Three-dimensional display with optimized view distribution
    Lv, Guo-Jiao
    Zhao, Bai-Chuan
    Wu, Fei
    Wang, Qiong-Hua
    OPTICAL ENGINEERING, 2019, 58 (02)
  • [24] Optimized sequential design of three-dimensional tolerances
    Huang, MF
    Jiang, XQ
    Zhong, YR
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON MECHANICAL ENGINEERING AND MECHANICS 2005, VOLS 1 AND 2, 2005, : 227 - 232
  • [25] Polarization in a three-dimensional Fermi gas with Rabi coupling
    Penna, V
    Salasnich, L.
    JOURNAL OF PHYSICS B-ATOMIC MOLECULAR AND OPTICAL PHYSICS, 2019, 52 (03)
  • [26] Bose–Fermi mixtures in a three-dimensional optical lattice
    Q.-M. Liu
    X. Dai
    Z. Fang
    J.-N. Zhuang
    Y. Zhao
    Applied Physics B, 2010, 99 : 639 - 650
  • [27] Weyl Superfluidity in a Three-Dimensional Dipolar Fermi Gas
    Liu, Bo
    Li, Xiaopeng
    Yin, Lan
    Liu, W. Vincent
    PHYSICAL REVIEW LETTERS, 2015, 114 (04)
  • [28] A Parallel Optimization Method for Stencil Computation on the Domain that is Bigger than Memory Capacity of GPUs
    Jin, Guanghao
    Endo, Toshio
    Matsuoka, Satoshi
    2013 IEEE INTERNATIONAL CONFERENCE ON CLUSTER COMPUTING (CLUSTER), 2013,
  • [29] The algebra of the quantum nondegenerate three-dimensional Kepler-Coulomb potential
    Y. Tanoudis
    C. Daskaloyannis
    Physics of Atomic Nuclei, 2011, 74 : 1083 - 1089
  • [30] The Algebra of the Quantum Nondegenerate Three-Dimensional Kepler-Coulomb Potential
    Tanoudis, Y.
    Daskaloyannis, C.
    PHYSICS OF ATOMIC NUCLEI, 2011, 74 (07) : 1083 - 1089