Parallel pivots LU algorithm on the Cray T3E

被引:0
|
作者
Asenjo, R [1 ]
Zapata, EL [1 ]
机构
[1] Univ Malaga, Comp Architecture Dept, E-29071 Malaga, Spain
来源
PARALLEL COMPUTATION | 1999年 / 1557卷
关键词
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Solving large nonsymmetric sparse linear systems on distributed memory multiprocessors is an active research area. We present a loop-level parallelized generic LU algorithm which comprises analyse-factorize and solve stages. To further exploit matrix sparsity and parallelism, the analyse step looks for a set of compatible pivots. Sparse techniques are applied until the reduced submatrix reaches a threshold density. At this point, a switch to dense routines takes place in both analyse-factorize and solve stages. The SPMD code follows a sparse cyclic distribution to map the system matrix onto a P x Q processor mesh. Experimental results show a good behavior of our sequential algorithm compared with a standard generic solver: the MA48 routine. Additionally, a parallel version an the Gray T3E exhibits high performance in terms of speed-up and efficiency.
引用
收藏
页码:38 / 47
页数:10
相关论文
共 50 条
  • [1] Performance of parallel Gaussian 94 on the Cray T3E
    Sosa, CP
    Ochterski, J
    Carpenter, J
    Frisch, MJ
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 1997, 213 : 31 - COMP
  • [2] Parallelization and implementation of the NOABL program on CRAY T3E parallel machine
    Mastrangelo, V
    Mehilli, I
    INTERNATIONAL JOURNAL OF MODERN PHYSICS C, 2000, 11 (03): : 573 - 587
  • [3] Performance analysis on CRAY T3E
    Gerndt, M
    Mohr, B
    Pantano, M
    Wolf, F
    PROCEEDINGS OF THE SEVENTH EUROMICRO WORKSHOP ON PARALLEL AND DISTRIBUTED PROCESSING, PDP'99, 1999, : 241 - 248
  • [4] Parallelising the unified model for the Cray T3E
    Burton, P
    Dickinson, A
    MAKING ITS MARK, 1997, : 68 - 82
  • [5] PScheD - Political Scheduling on the CRAY T3E
    Lagerstrom, RN
    Gipp, SK
    JOB SCHEDULING STRATEGIES FOR PARALLEL PROCESSING, 1997, 1291 : 117 - 138
  • [6] Parallel rendering of 3D AMR data on the SGI/Cray T3E
    Ma, KL
    FRONTIERS '99 - THE SEVENTH SYMPOSIUM ON THE FRONTIERS OF MASSIVELY PARALLEL COMPUTATION, PROCEEDINGS, 1999, : 138 - 145
  • [7] Fine-grained multithreading on the Cray T3E
    Grävinghoff, A
    Keller, J
    HIGH PERFORMANCE COMPUTING IN SCIENCE AND ENGINEERING '99, 2000, : 447 - 456
  • [8] Cray T3E performances of a parallel code for a stochastic dynamic assets and liabilities management model
    Zanghirati, G
    Cocco, F
    Taddei, F
    Paruolo, G
    EURO-PAR'99: PARALLEL PROCESSING, 1999, 1685 : 1176 - 1186
  • [9] System utilization benchmark on the Cray T3E and IBM SP
    Wong, A
    Oliker, L
    Kramer, W
    Kaltz, T
    Bailey, D
    JOB SCHEDULING STRATEGIES FOR PARALLEL PROCESSING, PROCEEDINGS, 2000, 1911 : 56 - 67
  • [10] A recursive PVM implementation of an image segmentation algorithm with performance results comparing the HIVE and the Cray T3E
    Tilton, JC
    FRONTIERS '99 - THE SEVENTH SYMPOSIUM ON THE FRONTIERS OF MASSIVELY PARALLEL COMPUTATION, PROCEEDINGS, 1999, : 146 - 153