Evaluating recursive filters on distributed memory parallel computers

被引：1

作者：

Stpiczynski, Przemyslaw ^{[1
]}

机构：

[1] Marie Curie Sklodowska Univ, Dept Comp Sci, PL-20031 Lublin, Poland

来源：

COMMUNICATIONS IN NUMERICAL METHODS IN ENGINEERING | 2006年 / 22卷 / 11期

关键词：

recursive filters; linear recurrences; BLAS; BSP model; mpi;

D O I：

10.1002/cnm.867

中图分类号：

T [工业技术];

学科分类号：

08 ;

摘要：

The aim of this paper is to show that the recently developed high performance divide and conquer algorithm for solving linear recurrence systems with constant coefficients together with the new BLAS-based algorithm for narrow-banded triangular Toeplitz matrix-vector multiplication, allow to evaluate linear recursive filters efficiently on distributed memory parallel computers. We apply the BSP model of parallel computing to predict the behaviour of the algorithm and to find the optimal values of the method's parameters. The results of experiments performed on a cluster of twelve dual-processor Itanium 2 computers and Cray X1 are also presented and discussed. The algorithm allows to utilize up to 30% of the peak performance of 24 Itanium processors, while a simple scalar algorithm can only utilize about 4% of the peak performance of a single processor. Copyright (C) 2006 John Wiley & Sons, Ltd.

引用

页码：1087 / 1095

页数：9

共 50 条

[1] Measuring the performance of parallel computers with distributed memory
Iushchenko R.A.
[J]. Cybernetics and Systems Analysis, 2009, 45 (6) : 941 - 951
[2] MEASURING THE PERFORMANCE OF PARALLEL COMPUTERS WITH DISTRIBUTED MEMORY
Iushehenko, R. A.
[J]. CYBERNETICS AND SYSTEMS ANALYSIS, 2009, 45 (06) : 933 - 943
[3] Parallel MSC/NASTRAN on distributed memory computers
Mayer, S
Poschmann, P
Shamsian, S
Komzsik, L
Boston, I
Surridge, M
[J]. HIGH-PERFORMANCE COMPUTING AND NETWORKING, 1995, 919 : 850 - 855
[4] Scalable parallel matrix multiplication on distributed memory parallel computers
[J]. Li, Keqin, 2000, IEEE, United States
[5] Scalable parallel matrix multiplication on distributed memory parallel computers
Li, KQ
[J]. JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2001, 61 (12) : 1709 - 1731
[6] Parallel operation of CartaBlanca on shared and distributed memory computers
Padial-Collins, NT
VanderHeyden, WB
Zhang, DZ
Dendy, ED
Livescu, D
[J]. CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2004, 16 (01): : 61 - 77
[7] OPTIMIZATION OF DATA EXCHANGE IN PARALLEL COMPUTERS WITH DISTRIBUTED MEMORY
Adutskevich, E. V.
Likhoded, N. A.
[J]. CYBERNETICS AND SYSTEMS ANALYSIS, 2006, 42 (02) : 298 - 310
[8] LOCALITY ESTIMATION OF PARALLEL ALGORITHM FOR DISTRIBUTED MEMORY COMPUTERS
Likhoded, Nikolai A.
Tolstsikau, Aliaksei A.
[J]. DOKLADY NATSIONALNOI AKADEMII NAUK BELARUSI, 2020, 64 (06): : 647 - 656
[9] Parallel algebraic multigrid methods on distributed memory computers
Haase, G
Kuhn, M
Reitzinger, S
[J]. SIAM JOURNAL ON SCIENTIFIC COMPUTING, 2002, 24 (02): : 410 - 427
[10] An Implementation of Parallel MLFMA on a Cluster of Computers with Distributed Memory
Guo, Hailin
Xue, Xiaoyan
Wang, Xingang
Tong, Weiqin
Ni, Weili
[J]. PROCEEDINGS OF THE 9TH INTERNATIONAL CONFERENCE FOR YOUNG COMPUTER SCIENTISTS, VOLS 1-5, 2008, : 1379 - +

← 1 2 3 4 5 →