Scientific Computing Kernels on the Cell Processor

被引:0
|
作者
Samuel Williams
John Shalf
Leonid Oliker
Shoaib Kamil
Parry Husbands
Katherine Yelick
机构
[1] CRD/NERSC,Lawrence Berkeley National Laboratory
关键词
Cell processor; GEMM; SpMV; sparse matrix; FFT; Stencil; three level memory;
D O I
暂无
中图分类号
学科分类号
摘要
In this work, we examine the potential of using the recently-released STI Cell processor as a building block for future high-end scientific computing systems. Our work contains several novel contributions. First, we introduce a performance model for Cell and apply it to several key numerical kernels: dense matrix multiply, sparse matrix vector multiply, stencil computations, and 1D/2D FFTs. Next, we validate our model by comparing results against published hardware data, as well as our own Cell blade implementations. Additionally, we compare Cell performance to benchmarks run on leading superscalar (AMD Opteron), VLIW (Intel Itanium2), and vector (Cray X1E) architectures. Our work also explores several different kernel implementations and demonstrates a simple and effective programming model for Cell’s unique architecture. Finally, we propose modest microarchitectural modifications that could significantly increase the efficiency of double-precision calculations. Overall results demonstrate the tremendous potential of the Cell architecture for scientific computations in terms of both raw performance and power efficiency.
引用
收藏
页码:263 / 298
页数:35
相关论文
共 50 条
  • [1] Scientific computing kernels on the cell processor
    Williams, Samuel
    Shalf, John
    Oliker, Leonid
    Kamil, Shoaib
    Husbands, Parry
    Yelick, Katherine
    INTERNATIONAL JOURNAL OF PARALLEL PROGRAMMING, 2007, 35 (03) : 263 - 298
  • [2] Scientific computing on the Itanium® processor
    Greer, Bruce
    Harrison, John
    Henry, Greg
    Li, Wei
    Tang, Peter
    Scientific Programming, 2002, 10 (04) : 329 - 337
  • [3] Scientific computing applications on a stream processor
    Zhang, Ying
    Yang, Xuejun
    Wang, Guibin
    Rogerst, Ian
    Li, Gen
    Deng, Yu
    Yan, Xiaobo
    ISPASS 2008: IEEE INTERNATIONAL SYMPOSIUM ON PERFORMANCE ANALYSIS OF SYSTEMS AND SOFTWARE, 2008, : 105 - +
  • [4] Scientific computing applications on the Imagine stream processor
    Du, Jing
    Yang, Xuejun
    Wang, Guibin
    Ao, Fujiang
    ADVANCES IN COMPUTER SYSTEMS ARCHITECTURE, PROCEEDINGS, 2006, 4186 : 38 - 51
  • [5] INTEGRATING AN ARRAY PROCESSOR INTO A SCIENTIFIC COMPUTING SYSTEM
    MARON, N
    BRENGLE, TA
    COMPUTER, 1981, 14 (09) : 41 - &
  • [6] An acceleration processor for data intensive scientific computing
    Kim, CG
    Kim, HS
    Kang, SH
    Kim, SD
    Han, GH
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2004, E87D (07): : 1766 - 1773
  • [7] A Customized Processor for Energy Efficient Scientific Computing
    Sethia, Ankit
    Dasika, Ganesh
    Mudge, Trevor
    Mahlke, Scott
    IEEE TRANSACTIONS ON COMPUTERS, 2012, 61 (12) : 1711 - 1723
  • [8] The use of configurable computing for computational kernels in scientific simulations
    Jones, M
    Nakad, Z
    Plassmann, P
    Yi, YH
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2006, 22 (1-2): : 67 - 79
  • [9] AN INTEGRATED FLOATING POINT VECTOR PROCESSOR FOR DSP AND SCIENTIFIC COMPUTING
    SPADERNA, D
    GREEN, P
    TAM, K
    DATTA, T
    KUMAR, M
    PROCEEDINGS - IEEE INTERNATIONAL CONFERENCE ON COMPUTER DESIGN : VLSI IN COMPUTERS & PROCESSORS, 1989, : 8 - 13
  • [10] Accelerating computing with the cell broadband engine processor
    Crawford, Catherine
    Henning, Paul
    Kistler, Michael
    Wright, Cornell
    2008 IEEE INTERNATIONAL SYMPOSIUM ON PARALLEL & DISTRIBUTED PROCESSING, VOLS 1-8, 2008, : 353 - 353