GPU-Accelerated Parallel Sparse LU Factorization Method for Fast Circuit Analysis (vol 24, pg 1140, 2016)

被引:0
|
作者
He, Kai [1 ]
Tan, Sheldon [1 ]
机构
[1] Univ Calif Riverside, Dept Elect Engn, Riverside, CA 92521 USA
关键词
Circuit simulation and analysis; graphic processing unit (GPU) parallelization; sparse LU factorization;
D O I
10.1109/TVLSI.2015.2507135
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Lower upper (LU) factorization for sparse matrices is the most important computing step for circuit simulation problems. However, parallelizing LU factorization on the graphic processing units (GPUs) turns out to be a difficult problem due to intrinsic data dependence and irregular memory access, which diminish GPU computing power. In this paper, we propose a new sparse LU solver on GPUs for circuit simulation and more general scientific computing. The new method, which is called GPU accelerated LU factorization (GLU) solver (for GPU LU), is based on a hybrid right-looking LU factorization algorithm for sparse matrices. We show that more concurrency can be exploited in the right-looking method than the left-looking method, which is more popular for circuit analysis, on GPU platforms. At the same time, the GLU also preserves the benefit of column-based left-looking LU method, such as symbolic analysis and column-level concurrency. We show that the resulting new parallel GPU LU solver allows the parallelization of all three loops in the LU factorization on GPUs. While in contrast, the existing GPU-based left-looking LU factorization approach can only allow parallelization of two loops. Experimental results show that the proposed GLU solver can deliver 5.71x and 1.46x speedup over the single-threaded and the 16-threaded PARDISO solvers, respectively, 19.56x speedup over the KLU solver, 47.13x over the UMFPACK solver, and 1.47x speedup over a recently proposed GPU-based left-looking LU solver on the set of typical circuit matrices from the University of Florida (UFL) sparse matrix collection. Furthermore, we also compare the proposed GLU solver on a set of general matrices from the UFL, GLU achieves 6.38x and 1.12x speedup over the single-threaded and the 16-threaded PARDISO solvers, respectively, 39.39x speedup over the KLU solver, 24.04x over the UMFPACK solver, and 2.35x speedup over the same GPU-based left-looking LU solver. In addition, comparison on self-generated RLC mesh networks shows a similar trend, which further validates the advantage of the proposed method over the existing sparse LU solvers.
引用
收藏
页码:1212 / 1212
页数:1
相关论文
共 11 条
  • [1] GPU-Accelerated Sparse LU Factorization for Circuit Simulation with Performance Modeling
    Chen, Xiaoming
    Ren, Ling
    Wang, Yu
    Yang, Huazhong
    [J]. IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2015, 26 (03) : 786 - 795
  • [2] Dynamic GPU Parallel Sparse LU Factorization for Fast Circuit Simulation
    Lee, Wai-Kong
    Achar, Ramachandra
    Nakhla, Michel S.
    [J]. IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, 2018, 26 (11) : 2518 - 2529
  • [3] GPU-Accelerated Sparse LU Factorization for Power System Simulation
    Gnanavignesh, R.
    Shenoy, U. Jayachandra
    [J]. Proceedings of 2019 IEEE PES Innovative Smart Grid Technologies Europe, ISGT-Europe 2019, 2019,
  • [4] GPU-Accelerated Sparse LU Factorization for Power System Simulation
    Gnanavignesh, R.
    Shenoy, U. Jayachandra
    [J]. PROCEEDINGS OF 2019 IEEE PES INNOVATIVE SMART GRID TECHNOLOGIES EUROPE (ISGT-EUROPE), 2019,
  • [5] Sparse LU Factorization for Parallel Circuit Simulation on GPU
    Ren, Ling
    Chen, Xiaoming
    Wang, Yu
    Zhang, Chenxi
    Yang, Huazhong
    [J]. 2012 49TH ACM/EDAC/IEEE DESIGN AUTOMATION CONFERENCE (DAC), 2012, : 1125 - 1130
  • [6] An Alternate GPU-Accelerated Algorithm for Very Large Sparse LU Factorization
    Chen, Jile
    Zhu, Peimin
    [J]. MATHEMATICS, 2023, 11 (14)
  • [7] GPU-Accelerated Sparse LU Factorization for Concurrent Analysis of Large-Scale Power Systems
    Shawlin, Sk Subrina
    Mohammadi, Fazel
    Rezaei-Zare, Afshin
    [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON ENVIRONMENT AND ELECTRICAL ENGINEERING AND 2022 IEEE INDUSTRIAL AND COMMERCIAL POWER SYSTEMS EUROPE (EEEIC / I&CPS EUROPE), 2022,
  • [8] GPU-Accelerated Adaptive PCBSO Mode-Based Hybrid RLA for Sparse LU Factorization in Circuit Simulation
    Lee, Wai-Kong
    Achar, Ramachandra
    [J]. IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2021, 40 (11) : 2320 - 2330
  • [9] GLU3.0: Fast GPU-based Parallel Sparse LU Factorization for Circuit Simulation
    Peng, Shaoyi
    Tan, Sheldon X. -D.
    [J]. IEEE DESIGN & TEST, 2020, 37 (03) : 78 - 90
  • [10] Massively Parallel GPU-Accelerated String Method for Fast and Accurate Prediction of Molecular Diffusivity in Nanoporous Materials
    Zhou, Musen
    Wu, Jianzhong
    [J]. ACS APPLIED NANO MATERIALS, 2021, 4 (05) : 5394 - 5403