A massively parallel algorithm for Bordered Almost Block Diagonal Systems on GPUs

被引:2
|
作者
Dessole, M. [1 ]
Marcuzzi, F. [1 ]
机构
[1] Univ Padua, Dept Math Tullio Levi Civita, Via Trieste 63, I-35121 Padua, Italy
关键词
GPU; Parallel algorithms; BABD system; Batched routines; Optimal control; GPGPU computing;
D O I
10.1007/s11075-020-00931-8
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
In this paper, we present PARASOF, an algorithm for the solution of linear systems with BABD matrices on massively parallel computing systems like graphic processing units or GPUs. This algorithm is compared with the state-of-the-art algorithms, in particular SOF, from which it is inspired and takes the same stability properties. We detail its design and implementation issues and give the main figures of its theoretical and experimental performances.
引用
收藏
页码:1243 / 1263
页数:21
相关论文
共 50 条
  • [31] ON SOLVING ALMOST BLOCK DIAGONAL (STAIRCASE) LINEAR-SYSTEMS
    REID, JK
    JENNINGS, A
    ACM TRANSACTIONS ON MATHEMATICAL SOFTWARE, 1984, 10 (02): : 196 - 201
  • [32] Massively Parallel Inverse Block-sorting Transforms for bzip2 Decompression on GPUs
    Weissenberger, Andre
    Schmidt, Bertil
    53RD INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING, ICPP 2024, 2024, : 856 - 865
  • [33] SOLVEBLOK - A PACKAGE FOR SOLVING ALMOST BLOCK DIAGONAL LINEAR-SYSTEMS
    DEBOOR, C
    WEISS, R
    ACM TRANSACTIONS ON MATHEMATICAL SOFTWARE, 1980, 6 (01): : 80 - 87
  • [34] Preface to the workshop - Massively Parallel Computational Biology on GPUs
    Hamacher, Kay
    Goesele, Michael
    Lecture Notes in Informatics (LNI), Proceedings - Series of the Gesellschaft fur Informatik (GI), 2009, P-154 : 44 - 45
  • [35] Efficient parallelization of SPH algorithm on modern multi-core CPUs and massively parallel GPUs
    Jagtap, Pravin
    Nasre, Rupesh
    Sanapala, V. S.
    Patnaik, B. S., V
    INTERNATIONAL JOURNAL OF MODELING SIMULATION AND SCIENTIFIC COMPUTING, 2021, 12 (06)
  • [36] PARALLEL SOLUTION OF ALMOST BLOCK DIAGONAL SYSTEMS ON THE CRAY Y-MP USING LEVEL-3 BLAS
    GLADWELL, I
    PAPRZYCKI, M
    JOURNAL OF COMPUTATIONAL AND APPLIED MATHEMATICS, 1993, 45 (1-2) : 181 - 189
  • [37] Improve Collaborative Filtering Through Bordered Block Diagonal Form Matrices
    Zhang, Yongfeng
    Zhang, Min
    Liu, Yiqun
    Ma, Shaoping
    SIGIR'13: THE PROCEEDINGS OF THE 36TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH & DEVELOPMENT IN INFORMATION RETRIEVAL, 2013, : 313 - 322
  • [38] An interior point method for bordered block-diagonal linear programs
    Grigoriadis, MD
    Khachiyan, LG
    SIAM JOURNAL ON OPTIMIZATION, 1996, 6 (04) : 913 - 932
  • [39] A Parallel Algorithm for Block Tridiagonal Systems
    Zhang, Heng
    Zhang, Wu
    Sun, Xian-He
    PDCAT 2008: NINTH INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED COMPUTING, APPLICATIONS AND TECHNOLOGIES, PROCEEDINGS, 2008, : 62 - +
  • [40] ALGORITHM-741 - LEAST-SQUARES SOLUTION OF A LINEAR, BORDERED, BLOCK-DIAGONAL SYSTEM OF EQUATIONS
    RAY, RD
    ACM TRANSACTIONS ON MATHEMATICAL SOFTWARE, 1995, 21 (01): : 20 - 25