On Optimizing Distributed Tucker Decomposition for Sparse Tensors

被引：12

作者：

Chakaravarthy, Venkatesan T. ^{[1
]}

Choi, Jee W. ^{[1
]}

Joseph, Douglas J. ^{[1
]}

Murali, Prakash ^{[1
,2
]}

Pandian, Shivmaran S. ^{[1
]}

Sabharwal, Yogish ^{[1
]}

Sreedhar, Dheeraj ^{[1
]}

机构：

[1] IBM Res, Armonk, NY 10504 USA

[2] Princeton Univ, Princeton, NJ 08544 USA

来源：

INTERNATIONAL CONFERENCE ON SUPERCOMPUTING (ICS 2018) | 2018年

关键词：

Tensor decompositions; tensor distribution schemes;

D O I：

10.1145/3205289.3205315

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

The Tucker decomposition generalizes the notion of Singular Value Decomposition (SVD) to tensors, the higher dimensional analogues of matrices. We study the problem of constructing the Tucker decomposition of sparse tensors on distributed memory systems via the HOOI procedure, a popular iterative method. The scheme used for distributing the input tensor among the processors (MPI ranks) critically influences the HOOI execution time. Prior work has proposed different distribution schemes: an offline scheme based on sophisticated hypergraph partitioning method and simple, lightweight alternatives that can be used real-time. While the hypergraph based scheme typically results in faster HOOI execution time, being complex, the time taken for determining the distribution is an order of magnitude higher than the execution time of a single HOOI iteration. Our main contribution is a lightweight distribution scheme, which achieves the best of both worlds. We show that the scheme is near-optimal on certain fundamental metrics associated with the HOOI procedure and as a result, near-optimal on the computational load (FLOPs). Though the scheme may incur higher communication volume, the computation time is the dominant factor and as the result, the scheme achieves better performance on the overall HOOI execution time. Our experimental evaluation on large real-life tensors (having up to 4 billion elements) shows that the scheme outperforms the prior schemes on the HOOI execution time by a factor of up to 3x. On the other hand, its distribution time is comparable to the prior lightweight schemes and is typically lesser than the execution time of a single HOOI iteration.

引用

页码：374 / 384

页数：11

共 50 条

[41] a-Tucker: fast input-adaptive and matricization-free Tucker decomposition of higher-order tensors on GPUs
Lian Duan
Chuanfu Xiao
Min Li
Mingshuo Ding
Chao Yang
[J]. CCF Transactions on High Performance Computing, 2023, 5 : 12 - 25
[42] Tucker Decomposition for Compressing Translation Operator Tensors in FMM-FFT Accelerated SIE Solvers
Yucel, Abdulkadir C.
Gomez, Luis J.
Michielssen, Eric
[J]. 2015 USNC-URSI RADIO SCIENCE MEETING (JOINT WITH AP-S SYMPOSIUM) PROCEEDINGS, 2015, : 141 - 141
[43] PARALLEL CANDECOMP/PARAFAC DECOMPOSITION OF SPARSE TENSORS USING DIMENSION TREES
Kaya, Oguz
Ucar, Bora
[J]. SIAM JOURNAL ON SCIENTIFIC COMPUTING, 2018, 40 (01): : C99 - C130
[44] Dimensionality reduction of tensors based on manifold-regularized tucker decomposition and its iterative solution
Huang, Haidong
Ma, Zhengming
Zhang, Guokai
[J]. INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2022, 13 (02) : 509 - 522
[45] A low-rank and sparse enhanced Tucker decomposition approach for tensor completion
Pan, Chenjian
Ling, Chen
He, Hongjin
Qi, Liqun
Xu, Yanwei
[J]. APPLIED MATHEMATICS AND COMPUTATION, 2024, 465
[46] Dimensionality reduction of tensors based on manifold-regularized tucker decomposition and its iterative solution
Haidong Huang
Zhengming Ma
Guokai Zhang
[J]. International Journal of Machine Learning and Cybernetics, 2022, 13 : 509 - 522
[47] Sparse and Supervised Tucker Decomposition for Tensor-based Multichannel Noise Reduction
Tong, Renjie
Pu, Shiliang
Chen, Yangkun
Chen, Zhan
[J]. PROCEEDINGS OF 2020 IEEE 15TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP 2020), 2020, : 6 - 10
[48] Distributed Nonlocal Coupled Hierarchical Tucker Decomposition for Hyperspectral Image Fusion
Zheng, Peng
Sun, Jin
Xu, Yang
Zhang, Yi
Wei, Zhihui
Plaza, Javier
Plaza, Antonio
Wu, Zebin
[J]. IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2023, 20
[49] Model-Driven Sparse CP Decomposition for Higher-Order Tensors
Li, Jiajia
Choi, Jee
Perros, Ioakeim
Sun, Jimeng
Vuduc, Richard
[J]. 2017 31ST IEEE INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM (IPDPS), 2017, : 1048 - 1057
[50] EEG multi-domain feature transfer based on sparse regularized Tucker decomposition
Gao, Yunyuan
Zhang, Congrui
Huang, Jincheng
Meng, Ming
[J]. COGNITIVE NEURODYNAMICS, 2024, 18 (01) : 185 - 197

← 1 2 3 4 5 →