ParILUT - A Parallel Threshold ILU for GPUs

被引:7
|
作者
Anzt, Hartwig [1 ,2 ]
Ribizel, Tobias [1 ]
Flegar, Goran [3 ]
Chow, Edmond [4 ]
Dongarra, Jack [2 ,5 ,6 ]
机构
[1] Karlsruhe Inst Technol, Steinbuch Ctr Comp, Karlsruhe, Germany
[2] Univ Tennessee, Innovat Comp Lab ICL, Knoxville, TN 37996 USA
[3] Univ Jaume I Castellon, Dept Ingn & Ciencia Comp, Castellon De La Plana, Spain
[4] Georgia Inst Technol, Sch Computat Sci & Engn, Atlanta, GA 30332 USA
[5] Univ Manchester, Manchester, Lancs, England
[6] Oak Ridge Natl Lab ORNL, Oak Ridge, TN USA
关键词
ParILUT; parallel threshold ILU; incomplete factorization preconditioners; parallel selection; GPU;
D O I
10.1109/IPDPS.2019.00033
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we present the first algorithm for computing threshold ILU factorizations on GPU architectures. The proposed ParILUT-GPU algorithm is based on interleaving parallel fixed-point iterations that approximate the incomplete factors for an existing nonzero pattern with a strategy that dynamically adapts the nonzero pattern to the problem characteristics. This requires the efficient selection of thresholds that separate the values to be dropped from the incomplete factors, and we design a novel selection algorithm tailored towards GPUs. All components of the ParILUT-GPU algorithm make heavy use of the features available in the latest NVIDIA GPU generations, and outperform existing multithreaded CPU implementations.
引用
收藏
页码:231 / 241
页数:11
相关论文
共 50 条
  • [21] Parallel Combinatorial Optimization Heuristics with GPUs
    Rashid, Mohammad Harun
    Tao, Lixin
    2017 INTERNATIONAL SYMPOSIUM ON COMPUTER SCIENCE AND INTELLIGENT CONTROLS (ISCSIC), 2017, : 118 - 123
  • [22] Massively Parallel ANS Decoding on GPUs
    Weissenberger, Andre
    Schmidt, Bertil
    PROCEEDINGS OF THE 48TH INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING (ICPP 2019), 2019,
  • [23] Parallel SRP-PHAT for GPUs
    Lee, Taewoo
    Chang, Sukmoon
    Yook, Dongsuk
    COMPUTER SPEECH AND LANGUAGE, 2016, 35 : 1 - 13
  • [24] Massively Parallel Logic Simulation with GPUs
    Zhu, Yuhao
    Wang, Bo
    Deng, Yangdong
    ACM TRANSACTIONS ON DESIGN AUTOMATION OF ELECTRONIC SYSTEMS, 2011, 16 (03)
  • [25] Efficient Parallel Reduction on GPUs with Hipacc
    Qiao, Bo
    Reiche, Oliver
    Oezkan, M. Akif
    Teich, Juergen
    Hannig, Frank
    PROCEEDINGS OF THE 23RD INTERNATIONAL WORKSHOP ON SOFTWARE AND COMPILERS FOR EMBEDDED SYSTEMS (SCOPES 2020), 2020, : 58 - 61
  • [26] Independent columns: A new parallel ILU preconditioner for the PCG method
    Pakzad, M
    Lloyd, JL
    Phillips, C
    PARALLEL COMPUTING, 1997, 23 (06) : 637 - 647
  • [27] Massively Parallel Huffman Decoding on GPUs
    Weissenberger, Andre
    Schmidt, Bertil
    PROCEEDINGS OF THE 47TH INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING, 2018,
  • [28] A Fast Parallel Selection Algorithm on GPUs
    Bakunas-Milanowski, Darius
    Rego, Vernon
    Sang, Janche
    Yu, Chansu
    2015 INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND COMPUTATIONAL INTELLIGENCE (CSCI), 2015, : 609 - 614
  • [29] Parallel computation of stream surfaces on GPUs
    Xie, Deyue
    Zhang, Jun
    Tao, Jun
    JOURNAL OF VISUALIZATION, 2024, 27 (03) : 367 - 382
  • [30] Parallel Latent Dirichlet Allocation on GPUs
    Moon, Gordon E.
    Nisa, Israt
    Sukumaran-Rajam, Aravind
    Bandyopadhyay, Bortik
    Parthasarathy, Srinivasan
    Sadayappan, P.
    COMPUTATIONAL SCIENCE - ICCS 2018, PT II, 2018, 10861 : 259 - 272