Testing and Implementing Some New Algorithms Using the FFTW Library on Massively Parallel Supercomputers

被引:0
|
作者
Guarrasi, Massimiliano [1 ]
Li, Ning [2 ]
Frigio, Sandro [3 ]
Emerson, Andrew [1 ]
Erbacci, Giovanni [1 ]
机构
[1] CINECA Supercomp Ctr, Bologna, Italy
[2] NAG, Oxford, England
[3] Univ Camerino, Camerino, Italy
基金
欧盟第七框架计划;
关键词
FFT; Numerical Algorithms; HPC; Parallel Computing; Numerical Libraries; Blue Gene Q clusters;
D O I
10.3233/978-1-61499-381-0-375
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The aim of this paper is to provide a strategy for overcoming the limits of codes employing the FFTW library by implementing a more powerful parallel domain decomposition algorithm and by refining the auto-tuning mechanism that is already implemented in this library. In the first part of this paper we identify some of the major performance bottlenecks present in the current FFTW implementation, in particular the auto-tuning mechanism provided in FFTW. To do this we have tested for the first time on a Blue Gene/Q system a 2D Parallel Domain Decomposition algorithm provided by the 2DECOMP&FFT library. We found that on massively parallel supercomputers such as Blue Gene/Q clusters the performance of this new algorithm is significantly higher. To demonstrate the benefits of the algorithm in a real application we included the library in a CFD code, BlowupNS, where we found a marked improvement in parallel scalability.
引用
收藏
页码:375 / 386
页数:12
相关论文
共 50 条
  • [31] A new massively parallel and distributed virtual machine model using mobile agents
    Youssfi, Mohamed
    Bouattane, Omar
    Bakkoury, Jamila
    Bensalah, Mohammed Ouadi
    [J]. 2014 INTERNATIONAL CONFERENCE ON MULTIMEDIA COMPUTING AND SYSTEMS (ICMCS), 2014, : 407 - 414
  • [32] A multicentre collaborative study on HIV drug resistance testing using 454 massively parallel pyrosequeneing
    Simen, B. B.
    Braverman, M. S.
    Abbate, I.
    Aerssens, J.
    Bidet, Y.
    Bouchez, O.
    Gabriel, C.
    Izopet, J.
    Kessler, H. H.
    Radonic, A.
    Metzner, K.
    Paredes, R.
    Recordon-Pinson, P.
    Sakwa, J.
    Schmitz-Agheguian, G. G.
    Dauemer, M. P.
    [J]. ANTIVIRAL THERAPY, 2010, 15 : A37 - A37
  • [33] Observations in using parallel and sequential evolutionary algorithms for automatic software testing
    Alba, Enrique
    Chicano, Francisco
    [J]. COMPUTERS & OPERATIONS RESEARCH, 2008, 35 (10) : 3161 - 3183
  • [34] PREIMPLANTATION GENETIC SCREENING BY MASSIVELY PARALLEL SEQUENCING ON A MISEQ USING TRANSPOSASE-BASED LIBRARY PREPARATION.
    Kaper, F.
    Royce, T.
    Klotzle, B.
    Lee, X.
    Wu, L.
    Zhang, Z.
    [J]. FERTILITY AND STERILITY, 2013, 100 (03) : S197 - S198
  • [35] SOME ALGORITHMS FOR PRIME TESTING USING GENERALIZED LEHMER FUNCTIONS
    WILLIAMS, HC
    JUDD, JS
    [J]. MATHEMATICS OF COMPUTATION, 1976, 30 (136) : 867 - 886
  • [36] SOME ALGORITHMS FOR PRIME TESTING USING GENERALIZED LEHMER FUNCTIONS
    WILLIAMS, HC
    JUDD, JS
    [J]. NOTICES OF THE AMERICAN MATHEMATICAL SOCIETY, 1976, 23 (01): : A46 - A46
  • [37] A New Programming Methodology for Broadcast Video Encoding Using a Massively Parallel Processor Achitecture
    Bruns, Michael W.
    Hunt, Martin A.
    Tong, Lin
    Bindloss, Keith
    [J]. SMPTE MOTION IMAGING JOURNAL, 2011, 120 (03): : 45 - 54
  • [39] Genetic testing for sporadic hearing loss using targeted massively parallel sequencing identifies 10 novel mutations
    Gu, X.
    Guo, L.
    Ji, H.
    Sun, S.
    Chai, R.
    Wang, L.
    Li, H.
    [J]. CLINICAL GENETICS, 2015, 87 (06) : 588 - 593
  • [40] Toward a new paradigm of DNA writing using a massively parallel sequencing platform and degenerate oligonucleotide
    Byungjin Hwang
    Duhee Bang
    [J]. Scientific Reports, 6