Parallelization and scalability analysis of inverse factorization using the chunks and tasks programming model

被引:2
|
作者
Artemov, Anton G. [1 ]
Rudberg, Elias [1 ]
Rubensson, Emanuel H. [1 ]
机构
[1] Uppsala Univ, Dept Informat Technol, Div Sci Comp, Box 337, SE-75105 Uppsala, Sweden
基金
瑞典研究理事会;
关键词
Parallel computing; Sparse matrix algorithms; Scalable algorithms; Large-scale computing; Algorithm analysis; MATRIX-MATRIX MULTIPLICATION;
D O I
10.1016/j.parco.2019.102548
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
We present three methods for distributed memory parallel inverse factorization of block-sparse Hermitian positive definite matrices. The three methods are a recursive variant of the AINV inverse Cholesky algorithm, iterative refinement, and localized inverse factorization. All three methods are implemented using the Chunks and Tasks programming model, building on the distributed sparse quad-tree matrix representation and parallel matrix-matrix multiplication in the publicly available Chunks and Tasks Matrix Library (CHTML). Although the algorithms are generally applicable, this work was mainly motivated by the need for efficient and scalable inverse factorization of the basis set overlap matrix in large scale electronic structure calculations. We perform various computational tests on overlap matrices for quasi linear glutamic acid-alanine molecules and three-dimensional water clusters discretized using the standard Gaussian basis set STO-3G with up to more than 10 million basis functions. We show that for such matrices the computational cost increases only linearly with system size for all the three methods. We show both theoretically and in numerical experiments that the methods based on iterative refinement and localized inverse factorization outperform previous parallel implementations in weak scaling tests where the system size is increased in direct proportion to the number of processes. We show also that, compared to the method based on pure iterative refinement, the localized inverse factorization requires much less communication. (C) 2019 Elsevier B.V. All rights reserved.
引用
收藏
页数:12
相关论文
共 50 条
  • [1] Chunks and Tasks: A programming model for parallelization of dynamic algorithms
    Rubensson, Emanuel H.
    Rudberg, Elias
    [J]. PARALLEL COMPUTING, 2014, 40 (07) : 328 - 343
  • [2] Parallelization Technique using Hybrid Programming Model
    Algarni, Abdullah
    Alofi, Abdulraheem
    Eassa, Fathy
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2021, 12 (02) : 682 - 690
  • [3] Locality-aware parallel block-sparse matrix-matrix multiplication using the Chunks and Tasks programming model
    Rubensson, Emanuel H.
    Rudberg, Elias
    [J]. PARALLEL COMPUTING, 2016, 57 : 87 - 106
  • [4] Analysis on scalability and energy efficiency of HEVC decoding using task-based programming model
    Georgakarakos, Georgios
    Holmbacka, Simon
    Lilius, Johan
    [J]. PROCEEDINGS OF THE 2016 CONFERENCE ON DESIGN AND ARCHITECTURES FOR SIGNAL & IMAGE PROCESSING, 2016, : 34 - 41
  • [5] Parallelization Using Task Parallel Library with Task-Based Programming Model
    Hei, Xinhong
    Zhang, Jinlong
    Wang, Bin
    Jin, Haiyan
    Giacaman, Nasser
    [J]. 2014 5TH IEEE INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING AND SERVICE SCIENCE (ICSESS), 2014, : 653 - 656
  • [6] PARAGON: an approach for parallelization of power system contingency analysis using Go programming language
    Khaitan, Siddhartha Kumar
    McCalley, James D.
    [J]. INTERNATIONAL TRANSACTIONS ON ELECTRICAL ENERGY SYSTEMS, 2015, 25 (11): : 2909 - 2920
  • [7] Uncertainty quantification of inverse analysis for geomaterials using probabilistic programming
    Zhao, Hongbo
    Li, Shaojun
    Zang, Xiaoyu
    Liu, Xinyi
    Zhang, Lin
    Ren, Jiaolong
    [J]. JOURNAL OF ROCK MECHANICS AND GEOTECHNICAL ENGINEERING, 2024, 16 (03) : 895 - 908
  • [8] Scalability Performance Analysis of Blockchain Using Hierarchical Model in Healthcare
    Sadath, Lipsa
    Mehrotra, Deepti
    Kumar, Anand
    [J]. Blockchain in Healthcare Today, 2024, 7 (01):
  • [9] Scalability Analysis of a Novel Integer Programming Model to Deal with Energy Consumption in Heterogeneous Wireless Sensor Networks
    Aguiar, Alexci
    Pinheiro, Placido Rogerio
    Coelho, Andre L. V.
    Nepomuceno, Napoleao
    Neto, Alvaxo
    Cunha, Ruddy P. P.
    [J]. MODELLING, COMPUTATION AND OPTIMIZATION IN INFORMATION SYSTEMS AND MANAGEMENT SCIENCES, PROCEEDINGS, 2008, 14 : 11 - 20
  • [10] Phylogenetic Analysis using MapReduce Programming Model
    Siddesh, G. M.
    Srinivasa, K. G.
    Mishra, Ishank
    Anurag, Abhinav
    Uppal, Eklavya
    [J]. 2015 IEEE 29TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS, 2015, : 350 - 356