Accelerating Sparse LU Factorization with Density-Aware Adaptive Matrix Multiplication for Circuit Simulation

被引:3
|
作者
Wang, Tengcheng [1 ]
Li, Wenhao [1 ]
Pei, Haojie [1 ]
Sun, Yuying [1 ]
Jin, Zhou [1 ]
Liu, Weifeng [1 ]
机构
[1] China Univ Petr, Super Sci Software Lab, Beijing, Peoples R China
基金
中国国家自然科学基金; 国家重点研发计划;
关键词
sparse LU factorization; circuit simulation; matrix multiplication; supernodal LU factorization; machine learning; random forest;
D O I
10.1109/DAC56929.2023.10247767
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Sparse LU factorization is considered to be one of the most time-consuming components in circuit simulation, particularly when dealing with circuits of considerable size in the advanced process era. Sparse LU factorization can be expedited by utilizing the supernode structure, which partitions the matrix into dense sub-matrices, thereby improving computational performance by utilizing level-3 Basic Linear Algebra Subprograms (BLAS) General Matrix Multiplication (GEMM) operations. The sparse and irregular structure of circuit matrices often impedes the formation of supernodes or results in the formation of supernodes with many zero elements, which in turn poses challenges for exploiting GEMM operations. In this paper, by fully utilizing the density in sub-matrices and combining GEMM with the Dense-Sparse Matrix Multiplication (SpMM), we propose a density-aware adaptive matrix multiplication equipped with machine learning techniques to optimize performance of the most-time consuming matrix multiplication operator so as to accelerate the sparse LU factorization. Numerical experiment results show that among the 6 circuit matrices tested, the average performance of matrix multiplication in our algorithm can be improved by 5.35x (up to 9.35x) compared to the performance of using GEMM directly in Schur-complement updates. Compared with state-of-the-art solver SuperLU_DIST, our method shows a substantial performance improvement.
引用
收藏
页数:6
相关论文
共 50 条
  • [41] GPU-Accelerated Parallel Sparse LU Factorization Method for Fast Circuit Analysis
    [J]. 1600, Institute of Electrical and Electronics Engineers Inc., United States (24):
  • [42] Sparse approximate matrix-matrix multiplication for density matrix purification with error control
    Artemov, Anton G.
    Rubensson, Emanuel H.
    [J]. JOURNAL OF COMPUTATIONAL PHYSICS, 2021, 438
  • [43] Density-Aware Store-Carry-Forward Routing with Adaptive Forwarding Probability Control
    Kimura, Tomotaka
    Jonouchi, Tatsuro
    Matsuda, Takahiro
    Takine, Tctsuya
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS - TAIWAN (ICCE-TW), 2015, : 474 - 475
  • [44] PerspectiveNet: An Object Detection Method With Adaptive Perspective Box Network Based on Density-Aware
    Li, Cui-Jin
    Qu, Zhong
    Wang, Sheng-Ye
    [J]. IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2023, 24 (05) : 5419 - 5429
  • [45] FSpGEMM: A Framework for Accelerating Sparse General Matrix–Matrix Multiplication Using Gustavson’s Algorithm on FPGAs
    Tavakoli, Erfan Bank
    Riera, Michael
    Quraishi, Masudul Hassan
    Ren, Fengbo
    [J]. IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, 2024, 32 (04) : 633 - 644
  • [46] Accelerating CPU-Based Sparse General Matrix Multiplication With Binary Row Merging
    Du, Zhaoyang
    Guan, Yijin
    Guan, Tianchan
    Niu, Dimin
    Zheng, Hongzhong
    Xie, Yuan
    [J]. IEEE ACCESS, 2022, 10 : 79237 - 79248
  • [47] Performance-Aware Model for Sparse Matrix-Matrix Multiplication on the Sunway TaihuLight Supercomputer
    Chen, Yuedan
    Li, Kenli
    Yang, Wangdong
    Xiao, Guoqing
    Xie, Xianghui
    Li, Tao
    [J]. IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2019, 30 (04) : 923 - 938
  • [48] Communication-Optimal Parallel 2.5D Matrix Multiplication and LU Factorization Algorithms
    Solomonik, Edgar
    Demmel, James
    [J]. EURO-PAR 2011 PARALLEL PROCESSING, PT 2, 2011, 6853 : 90 - 109
  • [49] A FAST SPARSE-MATRIX SOLVER FOR CIRCUIT SIMULATION
    WANG, CJ
    ESS, MA
    [J]. SUPERCOMPUTER, 1991, 8 (03): : 22 - 31
  • [50] Parallel Sparse Matrix Solution for Circuit Simulation on FPGAs
    Nechma, Tarek
    Zwolinski, Mark
    [J]. IEEE TRANSACTIONS ON COMPUTERS, 2015, 64 (04) : 1090 - 1103