Scalable Tucker Factorization for Sparse Tensors - Algorithms and Discoveries

被引:48
|
作者
Oh, Sejoon [1 ]
Park, Namyong [2 ]
Sael, Lee [1 ]
Kang, U. [1 ]
机构
[1] Seoul Natl Univ, Seoul, South Korea
[2] Carnegie Mellon Univ, Pittsburgh, PA 15213 USA
基金
新加坡国家研究基金会;
关键词
SCALE; DECOMPOSITIONS;
D O I
10.1109/ICDE.2018.00104
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Given sparse multi-dimensional data (e.g., (user, movie, time; rating) for movie recommendations), how can we discover latent concepts/relations and predict missing values? Tucker factorization has been widely used to solve such problems with multi-dimensional data, which are modeled as tensors. However, most Tucker factorization algorithms regard and estimate missing entries as zeros, which triggers a highly inaccurate decomposition. Moreover, few methods focusing on an accuracy exhibit limited scalability since they require huge memory and heavy computational costs while updating factor matrices. In this paper, we propose P-TUCKER, a scalable Tucker factorization method for sparse tensors. P-TUCKER performs alternating least squares with a row-wise update rule in a fully parallel way, which significantly reduces memory requirements for updating factor matrices. Furthermore, we offer two variants of P-TUCKER: a caching algorithm P-TUCKER-CACHE and an approximation algorithm P-TUCKER-APPROX, both of which accelerate the update process. Experimental results show that P-TUCKER exhibits 1.7-14.1x speed-up and 1.4-4.8x less error compared to the state-of-the-art. In addition, P-TUCKER scales near linearly with the number of observable entries in a tensor and number of threads. Thanks to P-TUCKER, we successfully discover hidden concepts and relations in a large-scale real-world tensor, while existing methods cannot reveal latent features due to their limited scalability or low accuracy.
引用
收藏
页码:1120 / 1131
页数:12
相关论文
共 50 条
  • [31] Sparse nonnegative matrix factorization with genetic algorithms for microarray analysis
    Stadlthanner, K.
    Lutter, D.
    Theis, F. J.
    Lang, E. W.
    Tome, A. M.
    Georgieva, P.
    Puntonet, C. G.
    [J]. 2007 IEEE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-6, 2007, : 294 - +
  • [32] New Sparse Multivariate Polynomial Factorization Algorithms over Integers
    Huang, Qiao-Long
    Gao, Xiao-Shan
    [J]. PROCEEDINGS OF THE INTERNATIONAL SYMPOSIUM ON SYMBOLIC & ALGEBRAIC COMPUTATION, ISSAC 2023, 2023, : 315 - 324
  • [33] Improved symbolic and numerical factorization algorithms for unsymmetric sparse matrices
    Gupta, A
    [J]. SIAM JOURNAL ON MATRIX ANALYSIS AND APPLICATIONS, 2002, 24 (02) : 529 - 552
  • [34] FAST EFFICIENT AND SCALABLE CORE CONSISTENCY DIAGNOSTIC FOR THE PARAFAC DECOMPOSITION FOR BIG SPARSE TENSORS
    Papalexakis, Evangelos E.
    Faloutsos, Christos
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 5441 - 5445
  • [35] Compact Data Structure and Scalable Algorithms for the Sparse Grid Technique
    Murarasu, Alin
    Weidendorfer, Josef
    Buse, Gerrit
    Butnaru, Daniel
    Pflueger, Dirk
    [J]. ACM SIGPLAN NOTICES, 2011, 46 (08) : 25 - 34
  • [36] A family of fast Walsh Hadamard algorithms with identical sparse matrix factorization
    Marti-Puig, Pere
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2006, 13 (11) : 672 - 675
  • [37] DFacTo: Distributed Factorization of Tensors
    Choi, Joon Hee
    Vishwanathan, S. V. N.
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 27 (NIPS 2014), 2014, 27
  • [38] On Optimizing Distributed Tucker Decomposition for Dense Tensors
    Chakaravarthy, Venkatesan T.
    Choi, Jee W.
    Joseph, Douglas J.
    Liu, Xing
    Murali, Prakash
    Sabharwal, Yogish
    Sreedhar, Dheeraj
    [J]. 2017 31ST IEEE INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM (IPDPS), 2017, : 1038 - 1047
  • [39] RECOMPRESSION OF HADAMARD PRODUCTS OF TENSORS IN TUCKER FORMAT
    Kressner, Daniel
    Perisa, Lana
    [J]. SIAM JOURNAL ON SCIENTIFIC COMPUTING, 2017, 39 (05): : A1879 - A1902
  • [40] Static and Streaming Tucker Decomposition for Dense Tensors
    Jang, Jun-Gi
    Kang, U.
    [J]. ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2023, 17 (05)