Implementation of Parallel Sparse Cholesky Factorization on GPU

被引:0
|
作者
Zou, Dan [1 ]
Dou, Yong [1 ]
机构
[1] Natl Univ Def Technol, Natl Lab Parallel & Distribut Proc, Changsha, Hunan, Peoples R China
关键词
sparse Cholesky factorization; GPU; PERFORMANCE; ALGORITHMS; SOLVER;
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Direct methods for solving large sparse symmetric positive-definite linear systems of equations are popular because of their generality and robustness. The main bottleneck is the sparse Cholesky factorization, which exhibits irregular memory access behavior and unbalanced workload. In the past 10 years, many sparse Cholesky factorization algorithms have emerged, exploiting new architectural features. However, programming techniques currently employed on these platforms are not sufficient to implement sparse Cholesky factorization on many-core graphics processing units (GPUs) due to mismatches between irregular problem structures and single-instruction multiple-thread GPU architectures. In the present paper, we propose a task-based software approach for the parallel sparse Cholesky factorization aimed at heterogeneous computing platforms with GPU accelerators. The tasks are generated by CPU. An efficient task-scheduling mechanism guarantees the correct ordering of task execution and ensures a load balanced execution on GPU. Comparisons are made with the existing solver using problems arising from a range of practical applications. The experiment results show that the proposed approach can substantially improve the performance of sparse Cholesky factorization on GPU with 2.7(x)-4(x) speedup.
引用
收藏
页码:2228 / 2232
页数:5
相关论文
共 50 条
  • [1] Sparse Cholesky factorization on GPU
    Zou, Dan
    Dou, Yong
    Guo, Song
    [J]. Jisuanji Xuebao/Chinese Journal of Computers, 2014, 37 (07): : 1445 - 1454
  • [2] Parallel sparse Cholesky factorization
    Monien, B
    Schulze, J
    [J]. SOLVING IRREGULARLY STRUCTURED PROBLEMS IN PARALLEL, 1997, 1253 : 255 - 272
  • [3] Parallel sparse Cholesky factorization
    Schulze, J
    [J]. MULTISCALE PHENOMENA AND THEIR SIMULATION, 1997, : 292 - 296
  • [4] HIGHLY PARALLEL SPARSE CHOLESKY FACTORIZATION
    GILBERT, JR
    SCHREIBER, R
    [J]. SIAM JOURNAL ON SCIENTIFIC AND STATISTICAL COMPUTING, 1992, 13 (05): : 1151 - 1172
  • [5] LIMITING COMMUNICATION IN PARALLEL SPARSE CHOLESKY FACTORIZATION
    HULBERT, L
    ZMIJEWSKI, E
    [J]. SIAM JOURNAL ON SCIENTIFIC AND STATISTICAL COMPUTING, 1991, 12 (05): : 1184 - 1197
  • [6] A MAPPING ALGORITHM FOR PARALLEL SPARSE CHOLESKY FACTORIZATION
    POTHEN, A
    SUN, CG
    [J]. SIAM JOURNAL ON SCIENTIFIC COMPUTING, 1993, 14 (05): : 1253 - 1257
  • [7] Parallel Sparse Cholesky Factorization on a Heterogeneous Platform
    Zou, Dan
    Dou, Yong
    Li, Rongchun
    [J]. IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2013, E96A (04) : 833 - 834
  • [8] A PARALLEL ALGORITHM FOR SPARSE SYMBOLIC CHOLESKY FACTORIZATION ON A MULTIPROCESSOR
    ZMIJEWSKI, E
    GILBERT, JR
    [J]. PARALLEL COMPUTING, 1988, 7 (02) : 199 - 210
  • [9] COMMUNICATION RESULTS FOR PARALLEL SPARSE CHOLESKY FACTORIZATION ON A HYPERCUBE
    GEORGE, A
    LIU, JWH
    NG, E
    [J]. PARALLEL COMPUTING, 1989, 10 (03) : 287 - 298
  • [10] PARALLEL SPARSE CHOLESKY FACTORIZATION ON A SHARED MEMORY MULTIPROCESSOR
    ZHANG, G
    ELMAN, HC
    [J]. PARALLEL COMPUTING, 1992, 18 (09) : 1009 - 1022