Evaluating recursive filters on distributed memory parallel computers

被引:1
|
作者
Stpiczynski, Przemyslaw [1 ]
机构
[1] Marie Curie Sklodowska Univ, Dept Comp Sci, PL-20031 Lublin, Poland
来源
关键词
recursive filters; linear recurrences; BLAS; BSP model; mpi;
D O I
10.1002/cnm.867
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
The aim of this paper is to show that the recently developed high performance divide and conquer algorithm for solving linear recurrence systems with constant coefficients together with the new BLAS-based algorithm for narrow-banded triangular Toeplitz matrix-vector multiplication, allow to evaluate linear recursive filters efficiently on distributed memory parallel computers. We apply the BSP model of parallel computing to predict the behaviour of the algorithm and to find the optimal values of the method's parameters. The results of experiments performed on a cluster of twelve dual-processor Itanium 2 computers and Cray X1 are also presented and discussed. The algorithm allows to utilize up to 30% of the peak performance of 24 Itanium processors, while a simple scalar algorithm can only utilize about 4% of the peak performance of a single processor. Copyright (C) 2006 John Wiley & Sons, Ltd.
引用
收藏
页码:1087 / 1095
页数:9
相关论文
共 50 条
  • [1] Measuring the performance of parallel computers with distributed memory
    Iushchenko R.A.
    [J]. Cybernetics and Systems Analysis, 2009, 45 (6) : 941 - 951
  • [2] MEASURING THE PERFORMANCE OF PARALLEL COMPUTERS WITH DISTRIBUTED MEMORY
    Iushehenko, R. A.
    [J]. CYBERNETICS AND SYSTEMS ANALYSIS, 2009, 45 (06) : 933 - 943
  • [3] Parallel MSC/NASTRAN on distributed memory computers
    Mayer, S
    Poschmann, P
    Shamsian, S
    Komzsik, L
    Boston, I
    Surridge, M
    [J]. HIGH-PERFORMANCE COMPUTING AND NETWORKING, 1995, 919 : 850 - 855
  • [5] Scalable parallel matrix multiplication on distributed memory parallel computers
    Li, KQ
    [J]. JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2001, 61 (12) : 1709 - 1731
  • [6] Parallel operation of CartaBlanca on shared and distributed memory computers
    Padial-Collins, NT
    VanderHeyden, WB
    Zhang, DZ
    Dendy, ED
    Livescu, D
    [J]. CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2004, 16 (01): : 61 - 77
  • [7] OPTIMIZATION OF DATA EXCHANGE IN PARALLEL COMPUTERS WITH DISTRIBUTED MEMORY
    Adutskevich, E. V.
    Likhoded, N. A.
    [J]. CYBERNETICS AND SYSTEMS ANALYSIS, 2006, 42 (02) : 298 - 310
  • [8] LOCALITY ESTIMATION OF PARALLEL ALGORITHM FOR DISTRIBUTED MEMORY COMPUTERS
    Likhoded, Nikolai A.
    Tolstsikau, Aliaksei A.
    [J]. DOKLADY NATSIONALNOI AKADEMII NAUK BELARUSI, 2020, 64 (06): : 647 - 656
  • [9] Parallel algebraic multigrid methods on distributed memory computers
    Haase, G
    Kuhn, M
    Reitzinger, S
    [J]. SIAM JOURNAL ON SCIENTIFIC COMPUTING, 2002, 24 (02): : 410 - 427
  • [10] An Implementation of Parallel MLFMA on a Cluster of Computers with Distributed Memory
    Guo, Hailin
    Xue, Xiaoyan
    Wang, Xingang
    Tong, Weiqin
    Ni, Weili
    [J]. PROCEEDINGS OF THE 9TH INTERNATIONAL CONFERENCE FOR YOUNG COMPUTER SCIENTISTS, VOLS 1-5, 2008, : 1379 - +