Communication-efficient algorithms for parallel latent Dirichlet allocation

被引:3
|
作者
Yan, Jian-Feng [1 ]
Zeng, Jia [1 ]
Gao, Yang [1 ]
Liu, Zhi-Qiang [2 ]
机构
[1] Suzhou Univ, Sch Comp Sci & Technol, Suzhou 215006, Peoples R China
[2] City Univ Hong Kong, Sch Creat Media, Hong Kong, Hong Kong, Peoples R China
关键词
Latent Dirichlet allocation; Parallel learning; Zipf's law; Belief propagation; Gibbs sampling;
D O I
10.1007/s00500-014-1376-8
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Latent Dirichlet allocation (LDA) is a popular topic modeling method which has found many multimedia applications, such as motion analysis and image categorization. Communication cost is one of the main bottlenecks for large-scale parallel learning of LDA. To reduce communication cost, we introduce Zipf's law and propose novel parallel LDA algorithms that communicate only partial important information at each learning iteration. The proposed algorithms are much more efficient than the current state-of-theart algorithms in both communication and computation costs. Extensive experiments on large-scale data sets demonstrate that our algorithms can greatly reduce communication and computation costs to achieve a better scalability.
引用
收藏
页码:3 / 11
页数:9
相关论文
共 50 条
  • [1] Communication-efficient algorithms for parallel latent Dirichlet allocation
    Jian-Feng Yan
    Jia Zeng
    Yang Gao
    Zhi-Qiang Liu
    Soft Computing, 2015, 19 : 3 - 11
  • [2] Parallel Latent Dirichlet Allocation on GPUs
    Moon, Gordon E.
    Nisa, Israt
    Sukumaran-Rajam, Aravind
    Bandyopadhyay, Bortik
    Parthasarathy, Srinivasan
    Sadayappan, P.
    COMPUTATIONAL SCIENCE - ICCS 2018, PT II, 2018, 10861 : 259 - 272
  • [3] Communication-efficient parallel sorting
    Goodrich, MT
    SIAM JOURNAL ON COMPUTING, 1999, 29 (02) : 416 - 432
  • [4] Communication-efficient parallel sorting
    Goodrich, Michael T.
    SIAM Journal on Computing, 29 (02): : 416 - 432
  • [5] COMMUNICATION-EFFICIENT PARALLEL ALGORITHMS FOR DISTRIBUTED RANDOM-ACCESS MACHINES
    LEISERSON, CE
    MAGGS, BM
    ALGORITHMICA, 1988, 3 (01) : 53 - 77
  • [6] Comparison of Estimation Algorithms for Latent Dirichlet Allocation
    Mardones-Segovia, Constanza
    Choi, Hye-Jeong
    Hong, Minju
    Wheeler, Jordan M.
    Cohen, Allan S.
    QUANTITATIVE PSYCHOLOGY, 2022, 393 : 27 - 37
  • [7] Communication-Efficient Algorithms for Statistical Optimization
    Zhang, Yuchen
    Duchi, John C.
    Wainwright, Martin J.
    2012 IEEE 51ST ANNUAL CONFERENCE ON DECISION AND CONTROL (CDC), 2012, : 6792 - 6792
  • [8] Scalable Parallel EM Algorithms for Latent Dirichlet Allocation in Multi-Core Systems
    Liu, Xiaosheng
    Zeng, Jia
    Yang, Xi
    Yan, Jianfeng
    Yang, Qiang
    PROCEEDINGS OF THE 24TH INTERNATIONAL CONFERENCE ON WORLD WIDE WEB (WWW 2015), 2015, : 669 - 679
  • [9] Communication-efficient parallel Gaussian elimination
    Tiskin, A
    PARALLEL COMPUTING TECHNOLOGIES, PROCEEDINGS, 2003, 2763 : 369 - 383
  • [10] Communication-Efficient Algorithms for Statistical Optimization
    Zhang, Yuchen
    Duchi, John C.
    Wainwright, Martin J.
    JOURNAL OF MACHINE LEARNING RESEARCH, 2013, 14 : 3321 - 3363