Optimized Three-Dimensional Stencil Computation on Fermi and Kepler GPUs

被引:0
|
作者
Vizitiu, Anamaria [1 ]
Itu, Lucian
Nita, Cosmin
Suciu, Constantin
机构
[1] SC Siemens SRL, Siemens Corp Technol, Brasov, Romania
关键词
stencil; GPU; double precision; Kepler; Fermi; optimization;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Stencil based algorithms are used intensively in scientific computations. Graphics Processing Units (GPU) based implementations of stencil computations speed-up the execution significantly compared to conventional CPU only systems. In this paper we focus on double precision stencil computations, which are required for meeting the high accuracy requirements, inherent for scientific computations. Starting from two baseline implementations (using two dimensional and three dimensional thread block structures respectively), we employ different optimization techniques which lead to seven kernel versions. Both Fermi and Kepler GPUs are used, to evaluate the impact of different optimization techniques for the two architectures. Overall, the GTX680 GPU card performs best for a kernel with 2D thread block structure and optimized register and shared memory usage. We show that, whereas shared memory is not essential for Fermi GPUs, it is a highly efficient optimization technique for Kepler GPUs (mainly due to the different L1 cache usage). Furthermore, we evaluate the performance of Kepler GPU cards designed for desktop PCs and notebook PCs. The results indicate that the ratio of execution time is roughly equal to the inverse of the ratio of power consumption.
引用
收藏
页数:6
相关论文
共 50 条
  • [31] GPGPU computation and visualization of three-dimensional cellular automata
    Stéphane Gobron
    Arzu Çöltekin
    Hervé Bonafos
    Daniel Thalmann
    The Visual Computer, 2011, 27 : 67 - 81
  • [32] Computation of incompressible flow over three-dimensional cavities
    Yao, H
    Cooper, RK
    Raghunathan, SR
    ADVANCES IN FLUID MECHANICS III, 2000, 26 : 125 - 133
  • [33] Computation of magnetic gradients due to three-dimensional bodies
    Changli Yao
    Zhining Guan
    Science in China Series D: Earth Sciences, 1997, 40 : 293 - 299
  • [34] Three-dimensional computation of a synthetic jet in quiescent air
    Cui, J.
    Agarwal, R.K.
    AIAA Journal, 2006, 44 (12): : 2857 - 2865
  • [35] Computation of three-dimensional hypersonic flows in chemical nonequilibrium
    Menne, S., 1600, Publ by AIAA, Washington, DC, United States (31):
  • [36] Distributed computation in a three-dimensional mesh with communication delays
    Glazek, W
    PROCEEDINGS OF THE SIXTH EUROMICRO WORKSHOP ON PARALLEL AND DISTRIBUTED PROCESSING - PDP '98, 1998, : 38 - 42
  • [37] Computation of three-dimensional tokamak and spherical torus equilibria
    Park, Jong-Kyu
    Boozer, Allen H.
    Glasser, Alan H.
    PHYSICS OF PLASMAS, 2007, 14 (05)
  • [38] Finite element computation of three-dimensional elastoacoustic vibrations
    Bermúdez, A
    Hervella-Nieto, L
    Rodríguez, R
    JOURNAL OF SOUND AND VIBRATION, 1999, 219 (02) : 279 - 306
  • [39] Methods for three-dimensional flows computation in hydraulic turbines
    Cherny, SG
    Sharov, SV
    Skorospelov, VA
    Turuk, PA
    RUSSIAN JOURNAL OF NUMERICAL ANALYSIS AND MATHEMATICAL MODELLING, 2003, 18 (02) : 87 - 104
  • [40] On computation of the three-dimensional flow past a stretching sheet
    Ariel, P. Donald
    APPLIED MATHEMATICS AND COMPUTATION, 2007, 188 (02) : 1244 - 1250