Reconsidering algorithms for iterative solvers in the multicore era

被引:1
|
作者
Wallin, Dan [1 ]
Lof, Henrik [1 ]
Hagersten, Erik [1 ]
Holmgren, Sverker [1 ]
机构
[1] Uppsala Univ, Dept Informat Technol, POB 337, SE-75105 Uppsala, Sweden
关键词
multicore processors; data locality; parallel multigrid;
D O I
10.1504/IJCSE.2009.029163
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Efficient solution of computational problems require a match between the algorithm and the underlying architecture. New multicore processors feature low intra-chip communication cost and smaller per-thread caches compared to single-core implementations, indicating that data locality issues are more important than communication overheads. We investigate the impact of these changes on parallel multigrid methods. We present a temporally blocked, naturally ordered, smoother implementation that improves the data locality as much as ten times. compared with the standard red-black algorithm. We present results of the performance of our new algorithm on an SMP system, an UltraSPARC T1 (Niagara) SMT/CMP, and a simulated CMP processor.
引用
收藏
页码:270 / 282
页数:13
相关论文
共 50 条
  • [31] Performance Prediction for Parallel Iterative Solvers
    V. Blanco
    P. González
    J. C. Cabaleiro
    D. B. Heras
    T. F. Pena
    J. J. Pombo
    F. F. Rivera
    The Journal of Supercomputing, 2004, 28 : 177 - 191
  • [32] Performance prediction for parallel iterative solvers
    Blanco, V
    González, P
    Cabaleiro, JC
    Heras, DB
    Pena, TF
    Pombo, JJ
    Rivera, FF
    COMPUTATIONAL SCIENCE-ICCS 2002, PT II, PROCEEDINGS, 2002, 2330 : 923 - 932
  • [33] A Blocking Strategy on Multicore Architectures for Dynamically Adaptive PDE Solvers
    Eckhardt, Wolfgang
    Weinzierl, Tobias
    PARALLEL PROCESSING AND APPLIED MATHEMATICS, PT I, 2010, 6067 : 567 - 575
  • [34] Evaluating Point-Based POMDP Solvers on Multicore Machines
    Shani, Guy
    IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2010, 40 (04): : 1062 - 1074
  • [35] Performance and energy benchmarking of spectral solvers on hybrid multicore machines
    Banerjee, Tania
    Rabb, Jacob
    Ranka, Sanjay
    SUSTAINABLE COMPUTING-INFORMATICS & SYSTEMS, 2016, 12 : 10 - 20
  • [36] Efficiency increment in 3D multi-zone boundary element algorithms by use of iterative solvers
    Araüjo, Francisco C.
    Belmonte, Geraldo J.
    Freitas, Marcilio S. R.
    Journal of the Chinese Institute of Engineers, Transactions of the Chinese Institute of Engineers,Series A/Chung-kuo Kung Ch'eng Hsuch K'an, 2000, 23 (03): : 269 - 274
  • [37] Efficiency increment in 3D multi-zone boundary element algorithms by use of iterative solvers
    Araújo, FC
    Belmonte, GJ
    Freitas, MSR
    JOURNAL OF THE CHINESE INSTITUTE OF ENGINEERS, 2000, 23 (03) : 269 - 274
  • [38] Amdahl's law in the multicore era
    Hill, Mark D.
    Marty, Michael R.
    COMPUTER, 2008, 41 (07) : 33 - +
  • [39] Parallel iterative solvers with localized ILU preconditioning
    Nakajima, K
    Nakamura, H
    Tanahashi, T
    PARALLEL COMPUTATIONAL FLUID DYNAMICS: RECENT DEVELOPMENTS AND ADVANCES USING PARALLEL COMPUTERS, 1998, : 359 - 366
  • [40] Fast iterative solvers for discrete Stokes equations
    Peters, J
    Reichelt, V
    Reusken, A
    SIAM JOURNAL ON SCIENTIFIC COMPUTING, 2005, 27 (02): : 646 - 666