Distributed-Memory Parallel Symmetric Nonnegative Matrix Factorization

被引:2
|
作者
Eswar, Srinivas [1 ]
Hayashi, Koby [1 ]
Ballard, Grey [2 ]
Kannan, Ramakrishnan [3 ]
Vuduc, Richard [1 ]
Park, Haesun [1 ]
机构
[1] Georgia Inst Technol, Dept Computat Sci & Engn, Atlanta, GA 30332 USA
[2] Wake Forest Univ, Dept Comp Sci, Winston Salem, NC 27101 USA
[3] Oak Ridge Natl Lab, Computat Data Analyt Grp, Oak Ridge, TN USA
关键词
High performance computing; Newton method; Parallel algorithms; Symmetric Matrices; COLLECTIVE COMMUNICATION; COORDINATE DESCENT; ALGORITHMS;
D O I
10.1109/sc41405.2020.00078
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We develop the first distributed -memory parallel implementation of Symmetric Nonnegative Matrix Factorization (SymNMF), a key data analytics kernel 14 clustering and dimensionality reduction. Our implementation includes two different algorithms for SytnNMF, which give comparable results in terms of time and accuracy. The first algorithm is a parallelization of an existing sequential approach that uses solvers for nonsymmetric NNW The second algorithm is a novel approach based on the Gauss -Newton method. It exploits second -order information without incurring large computational and memory costs. We evaluate the scalability of our algorithms on the Summit system at Oak Ridge National Laboratory, scaling up to 128 nodes (4,096 cores) with 70% efficiency. Additionally, we demonstrate our software on an image segmentation task.
引用
收藏
页数:14
相关论文
共 50 条
  • [1] Parallel Nonnegative Matrix Factorization Algorithm on the Distributed Memory Platform
    Chao Dong
    Huijie Zhao
    Wei Wang
    [J]. International Journal of Parallel Programming, 2010, 38 : 117 - 137
  • [2] Parallel Nonnegative Matrix Factorization Algorithm on the Distributed Memory Platform
    Dong, Chao
    Zhao, Huijie
    Wang, Wei
    [J]. INTERNATIONAL JOURNAL OF PARALLEL PROGRAMMING, 2010, 38 (02) : 117 - 137
  • [3] Parallel sparse orthogonal factorization on distributed-memory multiprocessors
    Sun, CG
    [J]. SIAM JOURNAL ON SCIENTIFIC COMPUTING, 1996, 17 (03): : 666 - 685
  • [4] Distributed-memory lattice H-matrix factorization
    Yamazaki, Ichitaro
    Ida, Akihiro
    Yokota, Rio
    Dongarra, Jack
    [J]. INTERNATIONAL JOURNAL OF HIGH PERFORMANCE COMPUTING APPLICATIONS, 2019, 33 (05): : 1046 - 1063
  • [5] Accelerated parallel and distributed algorithm using limited internal memory for nonnegative matrix factorization
    Duy Khuong Nguyen
    Tu Bao Ho
    [J]. Journal of Global Optimization, 2017, 68 : 307 - 328
  • [6] Accelerated parallel and distributed algorithm using limited internal memory for nonnegative matrix factorization
    Duy Khuong Nguyen
    Tu Bao Ho
    [J]. JOURNAL OF GLOBAL OPTIMIZATION, 2017, 68 (02) : 307 - 328
  • [7] ADAPTIVE BLOCK TECHNIQUES FOR MATRIX FACTORIZATION ON DISTRIBUTED-MEMORY COMPUTERS
    STRAUSS, H
    RONSCH, W
    [J]. HYPERCUBE AND DISTRIBUTED COMPUTERS, 1989, : 355 - 356
  • [8] Distributed-memory hierarchical interpolative factorization
    Li, Yingzhou
    Ying, Lexing
    [J]. RESEARCH IN THE MATHEMATICAL SCIENCES, 2017, 4
  • [9] Distributed-memory hierarchical interpolative factorization
    Yingzhou Li
    Lexing Ying
    [J]. Research in the Mathematical Sciences, 4
  • [10] PARALLEL MATRIX TRANSPOSE ALGORITHMS ON DISTRIBUTED-MEMORY CONCURRENT COMPUTERS
    CHOI, JY
    DONGARRA, JJ
    WALKER, DW
    [J]. PARALLEL COMPUTING, 1995, 21 (09) : 1387 - 1405