Scalable Tucker Factorization for Sparse Tensors - Algorithms and Discoveries

被引:48
|
作者
Oh, Sejoon [1 ]
Park, Namyong [2 ]
Sael, Lee [1 ]
Kang, U. [1 ]
机构
[1] Seoul Natl Univ, Seoul, South Korea
[2] Carnegie Mellon Univ, Pittsburgh, PA 15213 USA
基金
新加坡国家研究基金会;
关键词
SCALE; DECOMPOSITIONS;
D O I
10.1109/ICDE.2018.00104
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Given sparse multi-dimensional data (e.g., (user, movie, time; rating) for movie recommendations), how can we discover latent concepts/relations and predict missing values? Tucker factorization has been widely used to solve such problems with multi-dimensional data, which are modeled as tensors. However, most Tucker factorization algorithms regard and estimate missing entries as zeros, which triggers a highly inaccurate decomposition. Moreover, few methods focusing on an accuracy exhibit limited scalability since they require huge memory and heavy computational costs while updating factor matrices. In this paper, we propose P-TUCKER, a scalable Tucker factorization method for sparse tensors. P-TUCKER performs alternating least squares with a row-wise update rule in a fully parallel way, which significantly reduces memory requirements for updating factor matrices. Furthermore, we offer two variants of P-TUCKER: a caching algorithm P-TUCKER-CACHE and an approximation algorithm P-TUCKER-APPROX, both of which accelerate the update process. Experimental results show that P-TUCKER exhibits 1.7-14.1x speed-up and 1.4-4.8x less error compared to the state-of-the-art. In addition, P-TUCKER scales near linearly with the number of observable entries in a tensor and number of threads. Thanks to P-TUCKER, we successfully discover hidden concepts and relations in a large-scale real-world tensor, while existing methods cannot reveal latent features due to their limited scalability or low accuracy.
引用
收藏
页码:1120 / 1131
页数:12
相关论文
共 50 条
  • [1] SWIFT: Scalable Wasserstein Factorization for Sparse Nonnegative Tensors
    Afshar, Ardavan
    Yin, Kejing
    Yan, Sherry
    Qian, Cheng
    Ho, Joyce
    Park, Haesun
    Sun, Jimeng
    [J]. THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 6548 - 6556
  • [2] VEST: Very Sparse Tucker Factorization of Large-Scale Tensors
    Park, Moonjeong
    Jang, Jun-Gi
    Sael, Lee
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON BIG DATA AND SMART COMPUTING (BIGCOMP 2021), 2021, : 172 - 179
  • [3] High Performance Parallel Algorithms for the Tucker Decomposition of Sparse Tensors
    Kaya, Oguz
    Ucar, Bora
    [J]. PROCEEDINGS 45TH INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING - ICPP 2016, 2016, : 103 - 112
  • [4] Highly scalable parallel algorithms for sparse matrix factorization
    Gupta, A
    Karypis, G
    Kumar, V
    [J]. IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 1997, 8 (05) : 502 - 520
  • [5] On Optimizing Distributed Tucker Decomposition for Sparse Tensors
    Chakaravarthy, Venkatesan T.
    Choi, Jee W.
    Joseph, Douglas J.
    Murali, Prakash
    Pandian, Shivmaran S.
    Sabharwal, Yogish
    Sreedhar, Dheeraj
    [J]. INTERNATIONAL CONFERENCE ON SUPERCOMPUTING (ICS 2018), 2018, : 374 - 384
  • [6] Accelerating the Tucker Decomposition with Compressed Sparse Tensors
    Smith, Shaden
    Karypis, George
    [J]. EURO-PAR 2017: PARALLEL PROCESSING, 2017, 10417 : 653 - 668
  • [7] Separation of Composite Tensors with Sparse Tucker Representations
    Prater-Bennette, Ashley
    Carr, Kenneth Theodore
    [J]. BIG DATA: LEARNING, ANALYTICS, AND APPLICATIONS, 2019, 10989
  • [8] Efficient and Scalable Computations with Sparse Tensors
    Baskaran, Muthu
    Meister, Benoit
    Vasilache, Nicolas
    Lethin, Richard
    [J]. 2012 IEEE CONFERENCE ON HIGH PERFORMANCE EXTREME COMPUTING (HPEC), 2012,
  • [9] Algorithms for sparse nonnegative Tucker decompositions
    Morup, Morten
    Hansen, Lars Kai
    Arnfred, Sidse M.
    [J]. NEURAL COMPUTATION, 2008, 20 (08) : 2112 - 2131
  • [10] Sparse Hierarchical Tucker Factorization and its Application to Healthcare
    Perros, Ioakeim
    Chen, Robert
    Vuduc, Richard
    Sun, Jimeng
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2015, : 943 - 948