ParILUT - A Parallel Threshold ILU for GPUs

被引:7
|
作者
Anzt, Hartwig [1 ,2 ]
Ribizel, Tobias [1 ]
Flegar, Goran [3 ]
Chow, Edmond [4 ]
Dongarra, Jack [2 ,5 ,6 ]
机构
[1] Karlsruhe Inst Technol, Steinbuch Ctr Comp, Karlsruhe, Germany
[2] Univ Tennessee, Innovat Comp Lab ICL, Knoxville, TN 37996 USA
[3] Univ Jaume I Castellon, Dept Ingn & Ciencia Comp, Castellon De La Plana, Spain
[4] Georgia Inst Technol, Sch Computat Sci & Engn, Atlanta, GA 30332 USA
[5] Univ Manchester, Manchester, Lancs, England
[6] Oak Ridge Natl Lab ORNL, Oak Ridge, TN USA
关键词
ParILUT; parallel threshold ILU; incomplete factorization preconditioners; parallel selection; GPU;
D O I
10.1109/IPDPS.2019.00033
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we present the first algorithm for computing threshold ILU factorizations on GPU architectures. The proposed ParILUT-GPU algorithm is based on interleaving parallel fixed-point iterations that approximate the incomplete factors for an existing nonzero pattern with a strategy that dynamically adapts the nonzero pattern to the problem characteristics. This requires the efficient selection of thresholds that separate the values to be dropped from the incomplete factors, and we design a novel selection algorithm tailored towards GPUs. All components of the ParILUT-GPU algorithm make heavy use of the features available in the latest NVIDIA GPU generations, and outperform existing multithreaded CPU implementations.
引用
收藏
页码:231 / 241
页数:11
相关论文
共 50 条
  • [41] Parallel CT image reconstruction based on GPUs
    Flores, Liubov A.
    Vidal, Vicent
    Mayo, Patricia
    Rodenas, Francisco
    Verdu, Gumersindo
    RADIATION PHYSICS AND CHEMISTRY, 2014, 95 : 247 - 250
  • [42] Parallel K-clique Counting on GPUs
    Almasri, Mohammad
    El Hajj, Izzat
    Nagi, Rakesh
    Xiong, Jinjun
    Hwu, Wen-Mei
    PROCEEDINGS OF THE 36TH ACM INTERNATIONAL CONFERENCE ON SUPERCOMPUTING, ICS 2022, 2022,
  • [43] Parallel Implementation of Feedforward Neural Networks on GPUs
    Gurgel, Saskya T. A.
    Formiga, Andrei de A.
    2013 BRAZILIAN CONFERENCE ON INTELLIGENT SYSTEMS (BRACIS), 2013, : 143 - 149
  • [44] Fast Parallel Connected Components Algorithms on GPUs
    Cong, Guojing
    Muzio, Paul
    EURO-PAR 2014: PARALLEL PROCESSING WORKSHOPS, PT I, 2014, 8805 : 153 - 164
  • [45] Power Consumption Analysis of Parallel Algorithms on GPUs
    Magoules, Frederic
    Ahamed, Abal-Kassim Cheik
    Desmaison, Alban
    Lechenet, Jean-Christophe
    Mayer, Francois
    Ben Salem, Haifa
    Zhu, Thomas
    2014 IEEE INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS, 2014 IEEE 6TH INTL SYMP ON CYBERSPACE SAFETY AND SECURITY, 2014 IEEE 11TH INTL CONF ON EMBEDDED SOFTWARE AND SYST (HPCC,CSS,ICESS), 2014, : 304 - 311
  • [46] Parallel graph component labelling with GPUs and CUDA
    Hawick, K. A.
    Leist, A.
    Playne, D. P.
    PARALLEL COMPUTING, 2010, 36 (12) : 655 - 678
  • [47] InstantTrace: fast parallel neuron tracing on GPUs
    Hou, Yuxuan
    Ren, Zhong
    Hou, Qiming
    Tao, Yubo
    Jiang, Yankai
    Chen, Wei
    VISUAL COMPUTER, 2023, 39 (08): : 3783 - 3796
  • [48] Parallel Approximation of Multidimensional Tensors Using GPUs
    N. S. Kapralov
    A. Yu. Morozov
    S. P. Nikulin
    Programming and Computer Software, 2023, 49 : 295 - 301
  • [49] Parallel cube computation on modern CPUs and GPUs
    Zhou, Guoliang
    Chen, Hong
    JOURNAL OF SUPERCOMPUTING, 2012, 61 (03): : 394 - 417
  • [50] Louvain Community Detection With Parallel Heuristics On GPUs
    Forster, Richard
    INES 2016 20TH JUBILEE IEEE INTERNATIONAL CONFERENCE ON INTELLIGENT ENGINEERING SYSTEMS, 2016, : 227 - 232