Key concepts for parallel out-of-core LU factorization

被引:15
|
作者
Dongarra, JJ [1 ]
Hammarling, S
Walker, DW
机构
[1] Univ Tennessee, Dept Comp Sci, Knoxville, TN 37996 USA
[2] NAG Ltd, Oxford OX2 8DR, England
[3] Oak Ridge Natl Lab, Math Sci Sect, Oak Ridge, TN 37831 USA
关键词
out-of-core LU factorization; dense matrices; parallel computing; performance;
D O I
10.1016/S0898-1221(98)00029-7
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
This paper considers key ideas in the design of out-of-core dense LU factorization routines. A left-looking variant of the LU factorization algorithm is shown to require less I/O to disk than the right-looking variant, and is used to develop a parallel, out-of-core implementation. This implementation makes use of a small library of parallel I/O routines, together with ScaLAPACK and PBLAS routines. Results for runs on an Intel Paragon are presented and interpreted using a simple performance model.
引用
收藏
页码:13 / 31
页数:19
相关论文
共 50 条
  • [41] GAMER with out-of-core computation
    Schive, Hsi-Yu
    Tsai, Yu-Chih
    Chiueh, Tzihong
    [J]. COMPUTATIONAL STAR FORMATION, 2011, (270): : 401 - 405
  • [42] LU FACTORIZATION ON PARALLEL COMPUTERS
    NETA, B
    TAI, HM
    [J]. COMPUTERS & MATHEMATICS WITH APPLICATIONS, 1985, 11 (06) : 573 - 579
  • [43] A Parallel Out-of-Core Algorithm for the Time-Domain Adaptive Integral Method
    Kaur, Guneet
    Yilmaz, Ali E.
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTATIONAL ELECTROMAGNETICS (ICCEM), 2015, : 89 - 91
  • [44] Load Balanced Parallel GPU Out-of-Core for Continuous LOD Model Visualization
    Peng, Chao
    Mi, Peng
    Cao, Yong
    [J]. 2012 SC COMPANION: HIGH PERFORMANCE COMPUTING, NETWORKING, STORAGE AND ANALYSIS (SCC), 2012, : 215 - 223
  • [45] An object-oriented method for out-of-core parallel computations on cluster of workstations
    Tang, JQ
    Fang, BX
    Hu, MZ
    Zhang, HL
    [J]. PARALLEL AND DISTRIBUTED COMPUTING, APPLICATIONS AND TECHNOLOGIES, PDCAT'2003, PROCEEDINGS, 2003, : 507 - 510
  • [46] Parallel out-of-core divide-and-conquer techniques with application to classification trees
    Sreenivas, MK
    AlSabti, K
    Ranka, S
    [J]. IPPS/SPDP 1999: 13TH INTERNATIONAL PARALLEL PROCESSING SYMPOSIUM & 10TH SYMPOSIUM ON PARALLEL AND DISTRIBUTED PROCESSING, PROCEEDINGS, 1999, : 555 - 562
  • [47] An out-of-core volume rendering architecture
    Amorim, Paulo H. J.
    de Moraes, Thiago F.
    da Silva, Jorge V. L.
    Pedrini, Helio
    [J]. COMPUTATIONAL VISION AND MEDICAL IMAGE PROCESSING IV, 2014, : 173 - 179
  • [48] An Out-of-Core Sparse Cholesky Solver
    Reid, John K.
    Scott, Jennifer A.
    [J]. ACM TRANSACTIONS ON MATHEMATICAL SOFTWARE, 2009, 36 (02):
  • [49] Out-of-core segmentation by deformable models
    Giraldi, G
    Schaefer, L
    Farias, R
    Silva, R
    [J]. FUZZY LOGIC AND APPLICATIONS, 2006, 2955 : 216 - 223
  • [50] Amy files for out-of-core computations
    Zhang, Y
    Apon, A
    Pulay, P
    [J]. PDPTA'03: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED PROCESSING TECHNIQUES AND APPLICATIONS, VOLS 1-4, 2003, : 191 - 197