VECTORIZATION OF A THREAD-PARALLEL JACOBI SINGULAR VALUE DECOMPOSITION METHOD

被引:2
|
作者
Novakovic, Vedran
机构
[1] Zagreb
来源
SIAM JOURNAL ON SCIENTIFIC COMPUTING | 2023年 / 45卷 / 03期
关键词
batched eigendecomposition of Hermitian matrices of order two; SIMD vectorization; singular value decomposition; parallel one-sided Jacobi-type SVD method; SVD ALGORITHM; ORTHOGONAL EIGENVECTORS; ACCURATE; QR;
D O I
10.1137/22M1478847
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
The eigenvalue decomposition (EVD) of (a batch of) Hermitian matrices of order two has a role in many numerical algorithms, of which the one-sided Jacobi method for the singular value decomposition (SVD) is the prime example. In this paper the batched EVD is vectorized with a vector-friendly data layout and the AVX-512 SIMD instructions of Intel CPUs, alongside other key components of a real and a complex OpenMP-parallel Jacobi-type SVD method, inspired by the sequential xGESVJ routines from LAPACK. These vectorized building blocks should be portable to other platforms that support similar vector operations. Unconditional numerical reproducibility is guaranteed for the batched EVD, sequential or threaded, and for the column transformations, which are, like the scaled dot-products, presently sequential but can be threaded if nested parallelism is desired. No avoidable overflow of the results can occur with the proposed EVD or the whole SVD. The measured accuracy of the proposed EVD often surpasses that of the xLAEV2 routines from LAPACK. While the batched EVD outperforms the matching sequence of xLAEV2 calls, speedup of the parallel SVD is modest but can be improved and is already beneficial with enough threads. Regardless of their number, the proposed SVD method gives identical results but of a somewhat lower accuracy than xGESVJ.
引用
收藏
页码:C73 / C100
页数:28
相关论文
共 50 条
  • [22] An eigentracking method based on singular value decomposition
    Ren, Xianyi
    Zhou, Xiao
    Zhang, Guilin
    Zhang, Tianxu
    Huazhong Ligong Daxue Xuebao/Journal Huazhong (Central China) University of Science and Technology, 2001, 29 (12): : 50 - 51
  • [23] A METHOD FOR COMPUTING THE GENERALIZED SINGULAR VALUE DECOMPOSITION
    STEWART, GW
    LECTURE NOTES IN MATHEMATICS, 1983, 973 : 207 - 220
  • [24] Parallel tensor decomposition with distributed memory based on hierarchical singular value decomposition
    Fang, Zisen
    Qi, Fumin
    Dong, Yichuan
    Zhang, Yong
    Feng, Shengzhong
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2023, 35 (17):
  • [25] A Scalable FPGA Engine for Parallel Acceleration of Singular Value Decomposition
    Wang, Yu
    Lee, Jeong-Jun
    Ding, Yu
    Li, Peng
    PROCEEDINGS OF THE TWENTYFIRST INTERNATIONAL SYMPOSIUM ON QUALITY ELECTRONIC DESIGN (ISQED 2020), 2020, : 370 - 376
  • [26] A PARALLEL ALGORITHM FOR COMPUTING THE GENERALIZED SINGULAR-VALUE DECOMPOSITION
    BAI, ZJ
    JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 1994, 20 (03) : 280 - 288
  • [27] Singular value decomposition utilizing parallel algorithms on graphical processors
    Kotas, Charlotte
    Barhen, Jacob
    OCEANS 2011, 2011,
  • [28] A PARALLEL ALGORITHM FOR COMPUTING THE SINGULAR-VALUE DECOMPOSITION OF A MATRIX
    JESSUP, ER
    SORENSEN, DC
    SIAM JOURNAL ON MATRIX ANALYSIS AND APPLICATIONS, 1994, 15 (02) : 530 - 548
  • [29] A Jacobi-Davidson type method for the generalized singular value problem
    Hochstenbach, M. E.
    LINEAR ALGEBRA AND ITS APPLICATIONS, 2009, 431 (3-4) : 471 - 487
  • [30] A ONE-SIDED JACOBI ALGORITHM FOR COMPUTING THE SINGULAR VALUE DECOMPOSITION ON A VECTOR COMPUTER
    DERIJK, PPM
    SIAM JOURNAL ON SCIENTIFIC AND STATISTICAL COMPUTING, 1989, 10 (02): : 359 - 371