Stencil computations on heterogeneous platforms for the Jacobi method: GPUs versus Cell BE

被引:7
|
作者
Cecilia, Jose M. [1 ]
Abellan, Jose L. [2 ]
Fernandez, Juan [3 ]
Acacio, Manuel E. [2 ]
Garcia, Jose M. [2 ]
Ujaldon, Manuel [4 ]
机构
[1] Catholic Univ Murcia, Dept Comp Sci, Murcia, Spain
[2] Univ Murcia, Dept Comp Engn, Murcia, Spain
[3] Univ Politecn Cataluna, Intel Barcelona Res Ctr, Intel Labs, Barcelona, Spain
[4] Univ Malaga, Comp Architecture Dept, E-29071 Malaga, Spain
来源
JOURNAL OF SUPERCOMPUTING | 2012年 / 62卷 / 02期
关键词
Hardware accelerators; GPGPU; CELL; Stencil computations; GRAPHICS;
D O I
10.1007/s11227-012-0749-y
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
We are witnessing the consolidation of the heterogeneous computing in parallel computing with architectures such as Cell Broadband Engine (Cell BE) or Graphics Processing Units (GPUs) which are present in a myriad of developments for high performance computing. These platforms provide a Software Development Kit (SDK) to maximize performance at the expense of dealing with complex and low-level architectural details which makes the software development a daunting task. This paper explores stencil computations in several heterogeneous programming models like Cell SDK, CellSs, ALF and CUDA to optimize the Jacobi method for solving Laplace's differential equation. We describe the programming techniques to extract the maximum performance on the Cell BE and the GPU, and compare their computing paradigms. Experimental results are shown on two Nvidia Teslas and one IBM BladeCenter QS20 blade which incorporates two 3.2 GHz Cell BEs v 5.1. The speed-up factor for our set of GPU optimizations reaches 3-4x, and the execution times defeat those of the Cell BE by an order of magnitude, also showing great scalability when moving towards newer GPU generations and/or more demanding problem sizes.
引用
收藏
页码:787 / 803
页数:17
相关论文
共 50 条
  • [1] Stencil computations on heterogeneous platforms for the Jacobi method: GPUs versus Cell BE
    José M. Cecilia
    José L. Abellán
    Juan Fernández
    Manuel E. Acacio
    José M. García
    Manuel Ujaldón
    [J]. The Journal of Supercomputing, 2012, 62 : 787 - 803
  • [2] A Hybrid Circular Queue Method for Iterative Stencil Computations on GPUs
    Yang, Yang
    Cui, Hui-Min
    Feng, Xiao-Bing
    Xue, Jing-Ling
    [J]. JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2012, 27 (01) : 57 - 74
  • [3] A Hybrid Circular Queue Method for Iterative Stencil Computations on GPUs
    Yang Yang
    Hui-Min Cui
    Xiao-Bing Feng
    Jing-Ling Xue
    [J]. Journal of Computer Science and Technology, 2012, 27 : 57 - 74
  • [4] A Hybrid Circular Queue Method for Iterative Stencil Computations on GPUs
    杨杨
    崔慧敏
    冯晓兵
    薛京灵
    [J]. Journal of Computer Science & Technology, 2012, (01) : 57 - 74
  • [5] A Hybrid Circular Queue Method for Iterative Stencil Computations on GPUs
    杨杨
    崔慧敏
    冯晓兵
    薛京灵
    [J]. Journal of Computer Science & Technology., 2012, 27 (01) - 74
  • [6] Register Caching for Stencil Computations on GPUs
    Falch, Thomas L.
    Elster, Anne C.
    [J]. 16TH INTERNATIONAL SYMPOSIUM ON SYMBOLIC AND NUMERIC ALGORITHMS FOR SCIENTIFIC COMPUTING (SYNASC 2014), 2014, : 479 - 486
  • [7] CUDA 2D Stencil Computations for the Jacobi Method
    Maria Cecilia, Jose
    Manuel Garcia, Jose
    Ujaldon, Manuel
    [J]. APPLIED PARALLEL AND SCIENTIFIC COMPUTING, PT I, 2012, 7133 : 173 - 183
  • [8] CUDA 2D stencil computations for the Jacobi method
    Computer Engineering and Technology Department, University of Murcia, Spain
    不详
    [J]. Lect. Notes Comput. Sci., PART 1 (173-183):
  • [9] Double precision stencil computations on Kepler GPUs
    Vizitiu, Anamaria
    Itu, Lucian
    Lazar, Laszlo
    Suciu, Constantin
    [J]. 2014 18TH INTERNATIONAL CONFERENCE SYSTEM THEORY, CONTROL AND COMPUTING (ICSTCC), 2014, : 123 - 127
  • [10] Autotuning Stencil-Based Computations on GPUs
    Mametjanov, Azamat
    Lowell, Daniel
    Ma, Ching-Chen
    Norris, Boyana
    [J]. 2012 IEEE INTERNATIONAL CONFERENCE ON CLUSTER COMPUTING (CLUSTER), 2012, : 266 - 274