PARALLEL SPARSE QR FACTORIZATION ON SHARED-MEMORY ARCHITECTURES

被引:15
|
作者
MATSTOMS, P
机构
[1] Department of Mathematics, Linköping University
关键词
ORTHOGONAL DECOMPOSITION; SPARSE MATRIX; PARALLEL PROGRAMMING; MULTIFRONTAL METHOD; SHARED MEMORY MULTIPROCESS SYSTEM;
D O I
10.1016/0167-8191(94)00092-O
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
We discuss a parallel shared memory implementation of multifrontal QR factorization. To achieve high performance for general large and sparse matrices, a combination of tree and node level parallelism is used. Acceptable load balancing is obtained by the use of a pool-of-tasks approach. For the storage of frontal and update matrices, we use a buddy system based on Fibonacci blocks. It turns out to be more efficient than blocks of size 2(i), as proposed by other authors. Also the order in which memory space for update and frontal matrices are allocated is shown to be of importance. An implementation of the proposed algorithm on the GRAY X-MP/416 (four processors), gives speedups of about three with about 20% of extra real memory space required.
引用
收藏
页码:473 / 486
页数:14
相关论文
共 50 条
  • [1] QR FACTORIZATION OF A DENSE MATRIX ON A SHARED-MEMORY MULTIPROCESSOR
    CHU, E
    GEORGE, A
    [J]. PARALLEL COMPUTING, 1989, 11 (01) : 55 - 71
  • [2] PARALLEL CHOLESKY FACTORIZATION ON A SHARED-MEMORY MULTIPROCESSOR
    GEORGE, A
    HEATH, MT
    LIU, J
    [J]. LINEAR ALGEBRA AND ITS APPLICATIONS, 1986, 77 : 165 - 187
  • [3] Optimization of Block Sparse Matrix-Vector Multiplication on Shared-Memory Parallel Architectures
    Eberhardt, Ryan
    Hoemmen, Mark
    [J]. 2016 IEEE 30TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS (IPDPSW), 2016, : 663 - 672
  • [4] LIBMF: A Library for Parallel Matrix Factorization in Shared-memory Systems
    Chin, Wei-Sheng
    Yuan, Bo-Wen
    Yang, Meng-Yuan
    Zhuang, Yong
    Juan, Yu-Chin
    Lin, Chih-Jen
    [J]. JOURNAL OF MACHINE LEARNING RESEARCH, 2016, 17
  • [5] COMPLEXITY OF THE PARALLEL GIVENS FACTORIZATION ON SHARED MEMORY ARCHITECTURES
    COSNARD, M
    DAOUDI, EM
    ROBERT, Y
    [J]. LECTURE NOTES IN COMPUTER SCIENCE, 1989, 401 : 86 - 105
  • [6] COMPLEXITY OF THE PARALLEL GIVENS FACTORIZATION ON SHARED MEMORY ARCHITECTURES
    COSNARD, M
    DAOUDI, EM
    ROBERT, Y
    [J]. OPTIMAL ALGORITHMS, 1989, 401 : 86 - 105
  • [7] A Parallel Resampling Algorithm for Particle Filtering on Shared-Memory Architectures
    Gong, Peng
    Basciftci, Yuksel Ozan
    Ozguner, Fusun
    [J]. 2012 IEEE 26TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS & PHD FORUM (IPDPSW), 2012, : 1477 - 1483
  • [8] Parallel state space generation and exploration on shared-memory architectures
    Ceska, M
    Krena, B
    Vojnar, T
    [J]. COMPUTER AIDED SYSTEMS THEORY - EUROCAST 2005, 2005, 3643 : 275 - 280
  • [9] Asynchronous SGD for DNN training on Shared-memory Parallel Architectures
    Lopez, Florent
    Chow, Edmond
    Tomov, Stanimire
    Dongarra, Jack
    [J]. 2020 IEEE 34TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS (IPDPSW 2020), 2020, : 995 - 998
  • [10] PARALLEL SPARSE CHOLESKY FACTORIZATION ON A SHARED MEMORY MULTIPROCESSOR
    ZHANG, G
    ELMAN, HC
    [J]. PARALLEL COMPUTING, 1992, 18 (09) : 1009 - 1022