VECTORIZATION OF A THREAD-PARALLEL JACOBI SINGULAR VALUE DECOMPOSITION METHOD

被引:2
|
作者
Novakovic, Vedran
机构
[1] Zagreb
来源
SIAM JOURNAL ON SCIENTIFIC COMPUTING | 2023年 / 45卷 / 03期
关键词
batched eigendecomposition of Hermitian matrices of order two; SIMD vectorization; singular value decomposition; parallel one-sided Jacobi-type SVD method; SVD ALGORITHM; ORTHOGONAL EIGENVECTORS; ACCURATE; QR;
D O I
10.1137/22M1478847
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
The eigenvalue decomposition (EVD) of (a batch of) Hermitian matrices of order two has a role in many numerical algorithms, of which the one-sided Jacobi method for the singular value decomposition (SVD) is the prime example. In this paper the batched EVD is vectorized with a vector-friendly data layout and the AVX-512 SIMD instructions of Intel CPUs, alongside other key components of a real and a complex OpenMP-parallel Jacobi-type SVD method, inspired by the sequential xGESVJ routines from LAPACK. These vectorized building blocks should be portable to other platforms that support similar vector operations. Unconditional numerical reproducibility is guaranteed for the batched EVD, sequential or threaded, and for the column transformations, which are, like the scaled dot-products, presently sequential but can be threaded if nested parallelism is desired. No avoidable overflow of the results can occur with the proposed EVD or the whole SVD. The measured accuracy of the proposed EVD often surpasses that of the xLAEV2 routines from LAPACK. While the batched EVD outperforms the matching sequence of xLAEV2 calls, speedup of the parallel SVD is modest but can be improved and is already beneficial with enough threads. Regardless of their number, the proposed SVD method gives identical results but of a somewhat lower accuracy than xGESVJ.
引用
收藏
页码:C73 / C100
页数:28
相关论文
共 50 条
  • [1] Jacobi method for quaternion matrix singular value decomposition
    Le Bihan, Nicolas
    Sangwine, Stephen J.
    APPLIED MATHEMATICS AND COMPUTATION, 2007, 187 (02) : 1265 - 1271
  • [3] Singular value decomposition for complex matrices using two-sided Jacobi method
    Chiyonobu, Miho
    Miyamae, Takahiro
    Takata, Masami
    Harayama, Jun
    Kimura, Kinji
    Nakamura, Yoshimasa
    JOURNAL OF SUPERCOMPUTING, 2024, 80 (08): : 11719 - 11740
  • [4] Singular value decomposition for complex matrices using two-sided Jacobi method
    Miho Chiyonobu
    Takahiro Miyamae
    Masami Takata
    Jun Harayama
    Kinji Kimura
    Yoshimasa Nakamura
    The Journal of Supercomputing, 2024, 80 : 11719 - 11740
  • [5] Efficiency of thread-parallel Java programs from scientific computing
    Blaar, H.
    Legeler, M.
    Rauber, T.
    Proceedings - International Parallel and Distributed Processing Symposium, IPDPS 2002, 2002,
  • [6] Singular Value Decomposition Using Jacobi Algorithm in pMRI and CS
    Qazi, Sohaib A.
    Saeed, Abeera
    Nasir, Saima
    Omer, Hammad
    APPLIED MAGNETIC RESONANCE, 2017, 48 (05) : 461 - 471
  • [7] Singular Value Decomposition Using Jacobi Algorithm in pMRI and CS
    Sohaib A. Qazi
    Abeera Saeed
    Saima Nasir
    Hammad Omer
    Applied Magnetic Resonance, 2017, 48 : 461 - 471
  • [8] STUDY OF JACOBI METHODS FOR EIGENVALUES AND SINGULAR VALUE DECOMPOSITION ON DAP
    MODI, JJ
    PARKINSON, D
    COMPUTER PHYSICS COMMUNICATIONS, 1982, 26 (3-4) : 317 - 320
  • [9] PARALLEL SINGULAR VALUE DECOMPOSITION WITH CYCLIC STORING
    KOCKLER, N
    SIMON, M
    PARALLEL COMPUTING, 1991, 17 (01) : 39 - 47
  • [10] Thread-Parallel Integrated Test Pattern Generator Utilizing Satisfiability Analysis
    Alexander Czutro
    Ilia Polian
    Matthew Lewis
    Piet Engelke
    Sudhakar M. Reddy
    Bernd Becker
    International Journal of Parallel Programming, 2010, 38 : 185 - 202