ParILUT - A Parallel Threshold ILU for GPUs

被引:7
|
作者
Anzt, Hartwig [1 ,2 ]
Ribizel, Tobias [1 ]
Flegar, Goran [3 ]
Chow, Edmond [4 ]
Dongarra, Jack [2 ,5 ,6 ]
机构
[1] Karlsruhe Inst Technol, Steinbuch Ctr Comp, Karlsruhe, Germany
[2] Univ Tennessee, Innovat Comp Lab ICL, Knoxville, TN 37996 USA
[3] Univ Jaume I Castellon, Dept Ingn & Ciencia Comp, Castellon De La Plana, Spain
[4] Georgia Inst Technol, Sch Computat Sci & Engn, Atlanta, GA 30332 USA
[5] Univ Manchester, Manchester, Lancs, England
[6] Oak Ridge Natl Lab ORNL, Oak Ridge, TN USA
关键词
ParILUT; parallel threshold ILU; incomplete factorization preconditioners; parallel selection; GPU;
D O I
10.1109/IPDPS.2019.00033
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we present the first algorithm for computing threshold ILU factorizations on GPU architectures. The proposed ParILUT-GPU algorithm is based on interleaving parallel fixed-point iterations that approximate the incomplete factors for an existing nonzero pattern with a strategy that dynamically adapts the nonzero pattern to the problem characteristics. This requires the efficient selection of thresholds that separate the values to be dropped from the incomplete factors, and we design a novel selection algorithm tailored towards GPUs. All components of the ParILUT-GPU algorithm make heavy use of the features available in the latest NVIDIA GPU generations, and outperform existing multithreaded CPU implementations.
引用
收藏
页码:231 / 241
页数:11
相关论文
共 50 条
  • [1] PARILUT-A NEW PARALLEL THRESHOLD ILU FACTORIZATION
    Anzt, Hartwig
    Chow, Edmond
    Dongarra, Jack
    SIAM JOURNAL ON SCIENTIFIC COMPUTING, 2018, 40 (04): : C503 - C519
  • [2] Hybrid Multi-Elimination ILU Preconditioners on GPUs
    Lukarski, Dimitar
    Anzt, Hartwig
    Tomov, Stanimire
    Dongarra, Jack
    PROCEEDINGS OF 2014 IEEE INTERNATIONAL PARALLEL & DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS (IPDPSW), 2014, : 7 - 16
  • [3] Parallel ILU preconditioners in GPU computation
    Yan Chen
    Xuhong Tian
    Hui Liu
    Zhangxin Chen
    Bo Yang
    Wenyuan Liao
    Peng Zhang
    Ruijian He
    Min Yang
    Soft Computing, 2018, 22 : 8187 - 8205
  • [4] Parallel ILU preconditioners in GPU computation
    Chen, Yan
    Tian, Xuhong
    Liu, Hui
    Chen, Zhangxin
    Yang, Bo
    Liao, Wenyuan
    Zhang, Peng
    He, Ruijian
    Yang, Min
    SOFT COMPUTING, 2018, 22 (24) : 8187 - 8205
  • [5] Parallel Computing with GPUs
    Elster, Anne C.
    Requena, Stephane
    PARALLEL COMPUTING: FROM MULTICORES AND GPU'S TO PETASCALE, 2010, 19 : 533 - 535
  • [6] Parallel Programming with GPUs
    Kinayman, Noyan
    IEEE MICROWAVE MAGAZINE, 2013, 14 (04) : 102 - 115
  • [7] Parallel selection on GPUs
    Ribizel, Tobias
    Anzt, Hartwig
    PARALLEL COMPUTING, 2020, 91
  • [8] Parallel iterative solvers with localized ILU preconditioning
    Nakajima, K
    Nakamura, H
    Tanahashi, T
    HIGH-PERFORMANCE COMPUTING AND NETWORKING, 1997, 1225 : 342 - 350
  • [9] Comparison criteria for parallel orderings in ILU preconditioning
    Iwashita, T
    Nakanishi, Y
    Shimasaki, M
    SIAM JOURNAL ON SCIENTIFIC COMPUTING, 2005, 26 (04): : 1234 - 1260
  • [10] Parallel iterative solvers with localized ILU preconditioning
    Nakajima, K
    Nakamura, H
    Tanahashi, T
    PARALLEL COMPUTATIONAL FLUID DYNAMICS: RECENT DEVELOPMENTS AND ADVANCES USING PARALLEL COMPUTERS, 1998, : 359 - 366