OUT-OF-CORE IMPLEMENTATIONS OF CHOLESKY FACTORIZATION: LOOP-BASED VERSUS RECURSIVE ALGORITHMS

被引:5
|
作者
Bereux, Natacha [1 ]
机构
[1] Ecole Polytech, CNRS, CMAP, F-91128 Palaiseau, France
关键词
Cholesky factorization; out-of-core algorithms;
D O I
10.1137/06067256X
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
We compare, in the same framework, out-of-core implementations of the Cholesky factorization algorithm. The candidate implementations are the classical blocked left-looking variant and a more recent recursive formulation. Both have been implemented for real positive definite matrices: the former in the parallel out-of-core linear algebra package (POOCLAPACK) library and the latter in the scalable out-of-core linear algebra computations (SOLAR) library. We perform a theoretical analysis of the amount of input/output (I/O) operations required by each variant. We consider alternatives for the left-looking algorithm: the one-tile and two-tiles approaches. We show that when main memory is restricted, the one-tile approach yields less I/O volume. We then show that the left-looking implementation requires less I/O volume than the recursive variant. We have implemented all for complex matrices, and we report on numerical experiments.
引用
收藏
页码:1302 / 1319
页数:18
相关论文
共 50 条
  • [21] Locality optimization algorithms for compilation of out-of-core codes
    Kandemir, M
    Choudhary, A
    Ramanujam, J
    Kandaswamy, M
    JOURNAL OF INFORMATION SCIENCE AND ENGINEERING, 1998, 14 (01) : 107 - 138
  • [22] Out-of-Core Attribute Algorithms for Binary Partition Hierarchies
    Lefevre, Josselin
    Cousty, Jean
    Perret, Benjamin
    Phelippeau, Harold
    DISCRETE GEOMETRY AND MATHEMATICAL MORPHOLOGY, DGMM 2024, 2024, 14605 : 298 - 311
  • [23] Applying out-of-core QR decomposition algorithms on FPGA-based systems
    Tai, Yi-Gang
    Lo, Chia-Tien Dan
    Psarris, Kleanthis
    2007 INTERNATIONAL CONFERENCE ON FIELD PROGRAMMABLE LOGIC AND APPLICATIONS, PROCEEDINGS, VOLS 1 AND 2, 2007, : 86 - 91
  • [24] Synthesizing efficient out-of-core programs for block recursive algorithms using block-cyclic data distributions
    Li, ZY
    Reif, JH
    Gupta, SKS
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 1999, 10 (03) : 297 - 315
  • [25] Data locality optimization for synthesis of efficient out-of-core algorithms
    Krishnan, S
    Krishnamoorthy, S
    Baumgartner, G
    Cociorva, D
    Lam, CC
    Sadayappan, P
    Ramanujam, J
    Bernholdt, DE
    Choppella, V
    HIGH PERFORMANCE COMPUTING - HIPC 2003, 2003, 2913 : 406 - 417
  • [26] Efficient out-of-core sorting algorithms for the Parallel Disks Model
    Kundeti, Vamsi
    Rajasekaran, Sanguthevar
    JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2011, 71 (11) : 1427 - 1433
  • [27] IN-LOOP FEATURE TRACKING FOR STRUCTURE AND MOTION WITH OUT-OF-CORE OPTIMIZATION
    Herrero, Nicolas
    Landabaso, Jose-Luis
    Gallego, Guillermo
    Pujol-Alcolado, Jose-Carlos
    2010 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, 2010, : 2937 - 2940
  • [28] Efficient synthesis of out-of-core algorithms using a nonlinear optimization solver
    Krishnan, S
    Krishnamoorthy, S
    Baumgartner, G
    Lam, CC
    Ramanujam, J
    Sadayappan, P
    Choppella, V
    JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2006, 66 (05) : 659 - 673
  • [29] Efficient out-of-core algorithms for linear relaxation using blocking covers
    Leiserson, CE
    Rao, S
    Toledo, S
    JOURNAL OF COMPUTER AND SYSTEM SCIENCES, 1997, 54 (02) : 332 - 344
  • [30] Recursion Brings Speedup to Out-of-Core TensorCore-based Linear Algebra Algorithms: A Case Study of Classic Gram-Schmidt QR Factorization
    Zhang, Shaoshuai
    Wu, Panruo
    50TH INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING, 2021,