Direct QR factorizations for tall-and-skinny matrices in MapReduce architectures

被引:0
|
作者
Benson, Austin R. [1 ]
Gleich, David F. [2 ]
Demmel, James [3 ]
机构
[1] Stanford Univ, Inst Computat & Math Engn, Stanford, CA 94305 USA
[2] Purdue Univ, Dept Comp Sci, W Lafayette, IN 47907 USA
[3] Univ Calif Berkeley, Div Comp Sci, Dept Math, Berkeley, CA 94720 USA
关键词
matrix factorization; QR; SVD; TSQR; MapReduce; Hadoop;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The QR factorization and the SVD are two fundamental matrix decompositions with applications throughout scientific computing and data analysis. For matrices with many more rows than columns, so-called "tall-and-skinny matrices," there is a numerically stable, efficient, communication-avoiding algorithm for computing the QR factorization. It has been used in traditional high performance computing and grid computing environments. For MapReduce environments, existing methods to compute the QR decomposition use a numerically unstable approach that relies on indirectly computing the Q factor. In the best case, these methods require only two passes over the data. In this paper, we describe how to compute a stable tall-and-skinny QR factorization on a MapReduce architecture in only slightly more than 2 passes over the data. We can compute the SVD with only a small change and no difference in performance. We present a performance comparison between our new direct TSQR method, indirect TSQR methods that use the communication-avoiding TSQR algorithm, and a standard unstable implementation for MapReduce (Cholesky QR). We find that our new stable method is competitive with unstable methods for matrices with amodest number of columns. This holds both in a theoretical performance model as well as in an actual implementation.
引用
收藏
页数:9
相关论文
共 28 条
  • [1] An Approach of the QR Factorization for Tall-and-Skinny Matrices on Multicore Platforms
    Kuznetsov, Sergey V.
    APPLIED PARALLEL AND SCIENTIFIC COMPUTING (PARA 2012), 2013, 7782 : 235 - 249
  • [2] Cholesky and Gram-Schmidt Orthogonalization for Tall-and-Skinny QR Factorizations on Graphics Processors
    Tomas, Andres E.
    Quintana-Orti, Enrique S.
    EURO-PAR 2019: PARALLEL PROCESSING, 2019, 11725 : 469 - 480
  • [3] Acceleration of Parallel-Blocked QR Decomposition of Tall-and-Skinny Matrices on FPGAs
    Borbon, Jose M. Rodriguez
    Huang, Junjie
    Wong, Bryan M.
    Najjar, Walid
    ACM TRANSACTIONS ON ARCHITECTURE AND CODE OPTIMIZATION, 2021, 18 (03)
  • [4] Scalable Methods for Nonnegative Matrix Factorizations of Near-separable Tall-and-skinny Matrices
    Benson, Austin R.
    Lee, Jason D.
    Rajwa, Bartek
    Gleich, David F.
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 27 (NIPS 2014), 2014, 27
  • [5] Tall-and-skinny QR factorization with approximate Householder reflectors on graphics processors
    Andrés E. Tomás
    Enrique S. Quintana-Ortí
    The Journal of Supercomputing, 2020, 76 : 8771 - 8786
  • [6] Tall-and-skinny QR factorization with approximate Householder reflectors on graphics processors
    Tomas, Andres E.
    Quintana-Orti, Enrique S.
    JOURNAL OF SUPERCOMPUTING, 2020, 76 (11): : 8771 - 8786
  • [7] Tall-and-Skinny QR Factorization for Clusters of GPUs Using High-Performance Building Blocks
    Tomas, Andres E.
    Quintana-Orti, Enrique S.
    EURO-PAR 2023: PARALLEL PROCESSING WORKSHOPS, PT I, EURO-PAR 2023, 2024, 14351 : 306 - 317
  • [8] Reproducible Tall-Skinny QR
    Demmel, James
    Hong Diep Nguyen
    IEEE 22nd Symposium on Computer Arithmetic ARITH 22, 2015, : 152 - 159
  • [9] Performance Analysis of the Householder-Type Parallel Tall-Skinny QR Factorizations Toward Automatic Algorithm Selection
    Fukaya, Takeshi
    Imamura, Toshiyuki
    Yamamoto, Yusaku
    HIGH PERFORMANCE COMPUTING FOR COMPUTATIONAL SCIENCE - VECPAR 2014, 2015, 8969 : 269 - 283
  • [10] Exact QR factorizations of rectangular matrices
    Christopher Lourenco
    Erick Moreno-Centeno
    Optimization Letters, 2024, 18 : 681 - 695