On Optimizing Distributed Tucker Decomposition for Sparse Tensors

被引:12
|
作者
Chakaravarthy, Venkatesan T. [1 ]
Choi, Jee W. [1 ]
Joseph, Douglas J. [1 ]
Murali, Prakash [1 ,2 ]
Pandian, Shivmaran S. [1 ]
Sabharwal, Yogish [1 ]
Sreedhar, Dheeraj [1 ]
机构
[1] IBM Res, Armonk, NY 10504 USA
[2] Princeton Univ, Princeton, NJ 08544 USA
关键词
Tensor decompositions; tensor distribution schemes;
D O I
10.1145/3205289.3205315
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The Tucker decomposition generalizes the notion of Singular Value Decomposition (SVD) to tensors, the higher dimensional analogues of matrices. We study the problem of constructing the Tucker decomposition of sparse tensors on distributed memory systems via the HOOI procedure, a popular iterative method. The scheme used for distributing the input tensor among the processors (MPI ranks) critically influences the HOOI execution time. Prior work has proposed different distribution schemes: an offline scheme based on sophisticated hypergraph partitioning method and simple, lightweight alternatives that can be used real-time. While the hypergraph based scheme typically results in faster HOOI execution time, being complex, the time taken for determining the distribution is an order of magnitude higher than the execution time of a single HOOI iteration. Our main contribution is a lightweight distribution scheme, which achieves the best of both worlds. We show that the scheme is near-optimal on certain fundamental metrics associated with the HOOI procedure and as a result, near-optimal on the computational load (FLOPs). Though the scheme may incur higher communication volume, the computation time is the dominant factor and as the result, the scheme achieves better performance on the overall HOOI execution time. Our experimental evaluation on large real-life tensors (having up to 4 billion elements) shows that the scheme outperforms the prior schemes on the HOOI execution time by a factor of up to 3x. On the other hand, its distribution time is comparable to the prior lightweight schemes and is typically lesser than the execution time of a single HOOI iteration.
引用
收藏
页码:374 / 384
页数:11
相关论文
共 50 条
  • [41] a-Tucker: fast input-adaptive and matricization-free Tucker decomposition of higher-order tensors on GPUs
    Lian Duan
    Chuanfu Xiao
    Min Li
    Mingshuo Ding
    Chao Yang
    [J]. CCF Transactions on High Performance Computing, 2023, 5 : 12 - 25
  • [42] Tucker Decomposition for Compressing Translation Operator Tensors in FMM-FFT Accelerated SIE Solvers
    Yucel, Abdulkadir C.
    Gomez, Luis J.
    Michielssen, Eric
    [J]. 2015 USNC-URSI RADIO SCIENCE MEETING (JOINT WITH AP-S SYMPOSIUM) PROCEEDINGS, 2015, : 141 - 141
  • [43] PARALLEL CANDECOMP/PARAFAC DECOMPOSITION OF SPARSE TENSORS USING DIMENSION TREES
    Kaya, Oguz
    Ucar, Bora
    [J]. SIAM JOURNAL ON SCIENTIFIC COMPUTING, 2018, 40 (01): : C99 - C130
  • [44] Dimensionality reduction of tensors based on manifold-regularized tucker decomposition and its iterative solution
    Huang, Haidong
    Ma, Zhengming
    Zhang, Guokai
    [J]. INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2022, 13 (02) : 509 - 522
  • [45] A low-rank and sparse enhanced Tucker decomposition approach for tensor completion
    Pan, Chenjian
    Ling, Chen
    He, Hongjin
    Qi, Liqun
    Xu, Yanwei
    [J]. APPLIED MATHEMATICS AND COMPUTATION, 2024, 465
  • [46] Dimensionality reduction of tensors based on manifold-regularized tucker decomposition and its iterative solution
    Haidong Huang
    Zhengming Ma
    Guokai Zhang
    [J]. International Journal of Machine Learning and Cybernetics, 2022, 13 : 509 - 522
  • [47] Sparse and Supervised Tucker Decomposition for Tensor-based Multichannel Noise Reduction
    Tong, Renjie
    Pu, Shiliang
    Chen, Yangkun
    Chen, Zhan
    [J]. PROCEEDINGS OF 2020 IEEE 15TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP 2020), 2020, : 6 - 10
  • [48] Distributed Nonlocal Coupled Hierarchical Tucker Decomposition for Hyperspectral Image Fusion
    Zheng, Peng
    Sun, Jin
    Xu, Yang
    Zhang, Yi
    Wei, Zhihui
    Plaza, Javier
    Plaza, Antonio
    Wu, Zebin
    [J]. IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2023, 20
  • [49] Model-Driven Sparse CP Decomposition for Higher-Order Tensors
    Li, Jiajia
    Choi, Jee
    Perros, Ioakeim
    Sun, Jimeng
    Vuduc, Richard
    [J]. 2017 31ST IEEE INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM (IPDPS), 2017, : 1048 - 1057
  • [50] EEG multi-domain feature transfer based on sparse regularized Tucker decomposition
    Gao, Yunyuan
    Zhang, Congrui
    Huang, Jincheng
    Meng, Ming
    [J]. COGNITIVE NEURODYNAMICS, 2024, 18 (01) : 185 - 197