Shared-memory and shared-nothing stochastic gradient descent algorithms for matrix completion

被引:0
|
作者
Faraz Makari
Christina Teflioudi
Rainer Gemulla
Peter Haas
Yannis Sismanis
机构
[1] Max Planck Institute for Computer Science,
[2] IBM Almaden Research Center,undefined
[3] Google,undefined
来源
关键词
Parallel and distributed matrix completion; Low-rank matrix factorization; Stochastic gradient descent; Recommender systems;
D O I
暂无
中图分类号
学科分类号
摘要
We provide parallel algorithms for large-scale matrix completion on problems with millions of rows, millions of columns, and billions of revealed entries. We focus on in-memory algorithms that run either in a shared-memory environment on a powerful compute node or in a shared-nothing environment on a small cluster of commodity nodes; even very large problems can be handled effectively in these settings. Our ASGD, DSGD-MR, DSGD++, and CSGD algorithms are novel variants of the popular stochastic gradient descent (SGD) algorithm, with the latter three algorithms based on a new “stratified SGD” approach. All of the algorithms are cache-friendly and exploit thread-level parallelism, in-memory processing, and asynchronous communication. We investigate the performance of both new and existing algorithms via a theoretical complexity analysis and a set of large-scale experiments. The results show that CSGD is more scalable, and up to 60 % faster, than the best-performing alternative method in the shared-memory setting. DSGD++ is superior in terms of overall runtime, memory consumption, and scalability in the shared-nothing setting. For example, DSGD++ can solve a difficult matrix completion problem on a high-variance matrix with 10M rows, 1M columns, and 10B revealed entries in around 40 min on 16 compute nodes. In general, algorithms based on SGD appear to perform better than algorithms based on alternating minimizations, such as the PALS and DALS alternating least-squares algorithms.
引用
收藏
页码:493 / 523
页数:30
相关论文
共 50 条
  • [1] Shared-memory and shared-nothing stochastic gradient descent algorithms for matrix completion
    Makari, Faraz
    Teflioudi, Christina
    Gemulla, Rainer
    Haas, Peter
    Sismanis, Yannis
    [J]. KNOWLEDGE AND INFORMATION SYSTEMS, 2015, 42 (03) : 493 - 523
  • [2] Stochastic Gradient Descent for matrix completion: Hybrid parallelization on shared- and distributed-memory systems
    Buyukkaya, Kemal
    Karsavuran, M. Ozan
    Aykanat, Cevdet
    [J]. KNOWLEDGE-BASED SYSTEMS, 2024, 283
  • [3] The Convergence of Stochastic Gradient Descent in Asynchronous Shared Memory
    Alistarh, Dan
    De Sa, Christopher
    Konstantinov, Nikola
    [J]. PODC'18: PROCEEDINGS OF THE 2018 ACM SYMPOSIUM ON PRINCIPLES OF DISTRIBUTED COMPUTING, 2018, : 169 - 177
  • [4] On the Use of Shared Storage in Shared-Nothing Environments
    Krish, K. R.
    Khasymski, Aleksandr
    Wang, Guanying
    Butt, Ali R.
    Makkar, Gaurav
    [J]. 2013 IEEE INTERNATIONAL CONFERENCE ON BIG DATA, 2013,
  • [5] An Asynchronously Alternative Stochastic Gradient Descent Algorithm for Efficiently Parallel Latent Feature Analysis on Shared-Memory
    Qin, Wen
    Luo, Xin
    [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON KNOWLEDGE GRAPH (ICKG), 2022, : 217 - 224
  • [6] Fast brain tumor detection using adaptive stochastic gradient descent on shared-memory parallel environment
    Qin, Chuandong
    Li, Baosheng
    Han, Baole
    [J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 120
  • [7] High Performance Parallel Stochastic Gradient Descent in Shared Memory
    Sallinen, Scott
    Satish, Nadathur
    Smelyanskiy, Mikhail
    Sury, Samantika S.
    Re, Christopher
    [J]. 2016 IEEE 30TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM (IPDPS 2016), 2016, : 873 - 882
  • [8] SYNCHRONIZATION ALGORITHMS FOR SHARED-MEMORY MULTIPROCESSORS
    GRAUNKE, G
    THAKKAR, S
    [J]. COMPUTER, 1990, 23 (06) : 60 - 69
  • [9] VIRTUAL SHARED-MEMORY - ALGORITHMS AND COMPLEXITY
    CHIN, A
    MCCOLL, WF
    [J]. INFORMATION AND COMPUTATION, 1994, 113 (02) : 199 - 219
  • [10] IMPLEMENTATION ANALYSIS OF FAST MATRIX MULTIPLICATION ALGORITHMS ON SHARED-MEMORY COMPUTERS
    FRANCOMANO, E
    MACALUSO, AT
    VAJTERSIC, M
    [J]. COMPUTERS AND ARTIFICIAL INTELLIGENCE, 1995, 14 (03): : 299 - 313