Testing and Implementing Some New Algorithms Using the FFTW Library on Massively Parallel Supercomputers

被引:0
|
作者
Guarrasi, Massimiliano [1 ]
Li, Ning [2 ]
Frigio, Sandro [3 ]
Emerson, Andrew [1 ]
Erbacci, Giovanni [1 ]
机构
[1] CINECA Supercomp Ctr, Bologna, Italy
[2] NAG, Oxford, England
[3] Univ Camerino, Camerino, Italy
基金
欧盟第七框架计划;
关键词
FFT; Numerical Algorithms; HPC; Parallel Computing; Numerical Libraries; Blue Gene Q clusters;
D O I
10.3233/978-1-61499-381-0-375
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The aim of this paper is to provide a strategy for overcoming the limits of codes employing the FFTW library by implementing a more powerful parallel domain decomposition algorithm and by refining the auto-tuning mechanism that is already implemented in this library. In the first part of this paper we identify some of the major performance bottlenecks present in the current FFTW implementation, in particular the auto-tuning mechanism provided in FFTW. To do this we have tested for the first time on a Blue Gene/Q system a 2D Parallel Domain Decomposition algorithm provided by the 2DECOMP&FFT library. We found that on massively parallel supercomputers such as Blue Gene/Q clusters the performance of this new algorithm is significantly higher. To demonstrate the benefits of the algorithm in a real application we included the library in a CFD code, BlowupNS, where we found a marked improvement in parallel scalability.
引用
收藏
页码:375 / 386
页数:12
相关论文
共 50 条
  • [1] Micromechanical simulation of geotechnical problems using massively parallel supercomputers
    Washington, DW
    Meegoda, JN
    [J]. ENGINEERING MECHANICS: PROCEEDINGS OF THE 11TH CONFERENCE, VOLS 1 AND 2, 1996, : 717 - 721
  • [2] Molecular simulation of complex systems using massively parallel supercomputers
    Cummings, PT
    [J]. FLUID PHASE EQUILIBRIA, 1998, 144 (1-2) : 331 - 342
  • [3] A New Parallel Intel Xeon Phi Hydrodynamics Code for Massively Parallel Supercomputers
    Kulikov I.M.
    Chernykh I.G.
    Tutukov A.V.
    [J]. Lobachevskii Journal of Mathematics, 2018, 39 (9) : 1207 - 1216
  • [4] Some massively parallel algorithms from nature
    Li, Yan
    Kang, Li-Shan
    Chen, Yu-Ping
    Liu, Pu
    Cao, Hong-Qing
    Pan, Zheng-Jun
    [J]. Wuhan University Journal of Natural Sciences, 2002, 7 (01) : 37 - 46
  • [5] Some Massively Parallel Algorithms from Nature
    Li Yan
    [J]. Wuhan University Journal of Natural Sciences, 2002, (01) : 37 - 46
  • [6] Micro-mechanical simulations of soils using massively parallel supercomputers
    Washington, DW
    Meegoda, JN
    [J]. 7TH WORLD MULTICONFERENCE ON SYSTEMICS, CYBERNETICS AND INFORMATICS, VOL XIV, PROCEEDINGS: COMPUTER SCIENCE, ENGINEERING AND APPLICATIONS, 2003, : 298 - 303
  • [7] Efficient massively parallel implementation of some combinatorial algorithms
    Academia Sinica, Taipei, Taiwan
    [J]. Theor Comput Sci, 2 (297-322):
  • [8] Efficient massively parallel implementation of some combinatorial algorithms
    Hsu, TS
    Ramachandran, V
    [J]. THEORETICAL COMPUTER SCIENCE, 1996, 162 (02) : 297 - 322
  • [9] A Library Hierarchy for Implementing Scalable Parallel Search Algorithms
    T. K. Ralphs
    L. Ládanyi
    M. J. Saltzman
    [J]. The Journal of Supercomputing, 2004, 28 : 215 - 234
  • [10] A library hierarchy for implementing scalable parallel search algorithms
    Ralphs, TK
    Ládanyi, L
    Saltzman, MJ
    [J]. JOURNAL OF SUPERCOMPUTING, 2004, 28 (02): : 215 - 234