Optimized Three-Dimensional Stencil Computation on Fermi and Kepler GPUs

被引:0
|
作者
Vizitiu, Anamaria [1 ]
Itu, Lucian
Nita, Cosmin
Suciu, Constantin
机构
[1] SC Siemens SRL, Siemens Corp Technol, Brasov, Romania
关键词
stencil; GPU; double precision; Kepler; Fermi; optimization;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Stencil based algorithms are used intensively in scientific computations. Graphics Processing Units (GPU) based implementations of stencil computations speed-up the execution significantly compared to conventional CPU only systems. In this paper we focus on double precision stencil computations, which are required for meeting the high accuracy requirements, inherent for scientific computations. Starting from two baseline implementations (using two dimensional and three dimensional thread block structures respectively), we employ different optimization techniques which lead to seven kernel versions. Both Fermi and Kepler GPUs are used, to evaluate the impact of different optimization techniques for the two architectures. Overall, the GTX680 GPU card performs best for a kernel with 2D thread block structure and optimized register and shared memory usage. We show that, whereas shared memory is not essential for Fermi GPUs, it is a highly efficient optimization technique for Kepler GPUs (mainly due to the different L1 cache usage). Furthermore, we evaluate the performance of Kepler GPU cards designed for desktop PCs and notebook PCs. The results indicate that the ratio of execution time is roughly equal to the inverse of the ratio of power consumption.
引用
收藏
页数:6
相关论文
共 50 条
  • [41] Computation of magnetic gradients due to three-dimensional bodies
    姚长利
    管志宁
    Science in China(Series D:Earth Sciences), 1997, (03) : 293 - 299
  • [42] Three-dimensional computation of flow and sound for human hemilarynx
    Jo, Yongwoo
    Ra, Hawoon
    Moon, Young J.
    Doellinger, Michael
    COMPUTERS & FLUIDS, 2016, 134 : 41 - 50
  • [43] GPGPU computation and visualization of three-dimensional cellular automata
    Gobron, Stephane
    Coeltekin, Arzu
    Bonafos, Herve
    Thalmann, Daniel
    VISUAL COMPUTER, 2011, 27 (01): : 67 - 81
  • [44] Three-dimensional computation of light scattering from cells
    Dunn, A
    RichardsKortum, R
    IEEE JOURNAL OF SELECTED TOPICS IN QUANTUM ELECTRONICS, 1996, 2 (04) : 898 - 905
  • [45] Fast and efficient computation of three-dimensional Hahn moments
    Mesbah, Abderrahim
    Berrahou, Aissam
    El Mallahi, Mostafa
    Qjidaa, Hassan
    JOURNAL OF ELECTRONIC IMAGING, 2016, 25 (06)
  • [46] Computation of Terrestrial Visible Area in Three-Dimensional GIS
    Yu, Lili
    Du, Yunming
    Su, Xiaoguang
    Fanbo, Meng
    CONFERENCE PROCEEDINGS OF 2019 5TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND ROBOTICS (ICCAR), 2019, : 311 - 314
  • [47] Massively parallel computation of three-dimensional scramjet combustor
    Zheng, ZH
    Le, JL
    Shock Waves, Vols 1 and 2, Proceedings, 2005, : 897 - 902
  • [48] Parallel computation of three-dimensional nonlinear magnetostatic problems
    Levine, D
    Gropp, W
    Forsman, K
    Kettunen, L
    CONCURRENCY-PRACTICE AND EXPERIENCE, 1999, 11 (02): : 109 - 120
  • [49] Computation of magnetic gradients due to three-dimensional bodies
    Yao, CL
    Guan, ZN
    SCIENCE IN CHINA SERIES D-EARTH SCIENCES, 1997, 40 (03): : 293 - 299
  • [50] Exponential meshes and three-dimensional computation of a magnetic field
    Alouges, F
    Laminie, J
    Mefire, SM
    NUMERICAL METHODS FOR PARTIAL DIFFERENTIAL EQUATIONS, 2003, 19 (05) : 595 - 637