Accelerated parallel and distributed algorithm using limited internal memory for nonnegative matrix factorization

被引:0
|
作者
Duy Khuong Nguyen
Tu Bao Ho
机构
[1] Japan Advanced Institute of Science and Technology,University of Engineering and Technology
[2] Vietnam National University,John von Neumann Institute
[3] Vietnam National University,undefined
来源
关键词
Non-negative matrix factorization; Accelerated anti-lopsided algorithm; Cooridinate descent algorithm; Parallel and distributed algorithm;
D O I
暂无
中图分类号
学科分类号
摘要
Nonnegative matrix factorization (NMF) is a powerful technique for dimension reduction, extracting latent factors and learning part-based representation. For large datasets, NMF performance depends on some major issues such as fast algorithms, fully parallel distributed feasibility and limited internal memory. This research designs a fast fully parallel and distributed algorithm using limited internal memory to reach high NMF performance for large datasets. Specially, we propose a flexible accelerated algorithm for NMF with all its L1\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$L_1$$\end{document}L2\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$L_2$$\end{document} regularized variants based on full decomposition, which is a combination of exact line search, greedy coordinate descent, and accelerated search. The proposed algorithm takes advantages of these algorithms to converges linearly at an over-bounded rate (1-μL)(1-μrL)2r\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$(1-\frac{\mu }{L})(1 - \frac{\mu }{rL})^{2r}$$\end{document} in optimizing each factor matrix when fixing the other factor one in the sub-space of passive variables, where r is the number of latent components, and μ\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\mu $$\end{document} and L are bounded as 12≤μ≤L≤r\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\frac{1}{2} \le \mu \le L \le r$$\end{document}. In addition, the algorithm can exploit the data sparseness to run on large datasets with limited internal memory of machines, which is is advanced compared to fast block coordinate descent methods and accelerated methods. Our experimental results are highly competitive with seven state-of-the-art methods about three significant aspects of convergence, optimality and average of the iteration numbers.
引用
收藏
页码:307 / 328
页数:21
相关论文
共 50 条
  • [1] Accelerated parallel and distributed algorithm using limited internal memory for nonnegative matrix factorization
    Duy Khuong Nguyen
    Tu Bao Ho
    [J]. JOURNAL OF GLOBAL OPTIMIZATION, 2017, 68 (02) : 307 - 328
  • [2] Parallel Nonnegative Matrix Factorization Algorithm on the Distributed Memory Platform
    Chao Dong
    Huijie Zhao
    Wei Wang
    [J]. International Journal of Parallel Programming, 2010, 38 : 117 - 137
  • [3] Parallel Nonnegative Matrix Factorization Algorithm on the Distributed Memory Platform
    Dong, Chao
    Zhao, Huijie
    Wang, Wei
    [J]. INTERNATIONAL JOURNAL OF PARALLEL PROGRAMMING, 2010, 38 (02) : 117 - 137
  • [4] Distributed-Memory Parallel Symmetric Nonnegative Matrix Factorization
    Eswar, Srinivas
    Hayashi, Koby
    Ballard, Grey
    Kannan, Ramakrishnan
    Vuduc, Richard
    Park, Haesun
    [J]. PROCEEDINGS OF SC20: THE INTERNATIONAL CONFERENCE FOR HIGH PERFORMANCE COMPUTING, NETWORKING, STORAGE AND ANALYSIS (SC20), 2020,
  • [5] An Accelerated Symmetric Nonnegative Matrix Factorization Algorithm Using Extrapolation
    Wang, Peitao
    He, Zhaoshui
    Lu, Jun
    Tan, Beihai
    Bai, YuLei
    Tan, Ji
    Liu, Taiheng
    Lin, Zhijie
    [J]. SYMMETRY-BASEL, 2020, 12 (07):
  • [6] Distributed Nonnegative Matrix Factorization with HALS Algorithm on MapReduce
    Zdunek, Rafal
    Fonal, Krzysztof
    [J]. ALGORITHMS AND ARCHITECTURES FOR PARALLEL PROCESSING, ICA3PP 2017, 2017, 10393 : 211 - 222
  • [7] A High-Performance Parallel Algorithm for Nonnegative Matrix Factorization
    Kannan, Ramakrishnan
    Ballard, Grey
    Park, Haesun
    [J]. ACM SIGPLAN NOTICES, 2016, 51 (08) : 99 - 109
  • [8] Distributed Nonnegative Matrix Factorization with HALS Algorithm on Apache Spark
    Fonal, Krzysztof
    Zdunek, Rafal
    [J]. ARTIFICIAL INTELLIGENCE AND SOFT COMPUTING (ICAISC 2018), PT II, 2018, 10842 : 333 - 342
  • [9] A Topographical Nonnegative Matrix Factorization algorithm
    Rogovschi, Nicoleta
    Labiod, Lazhar
    Nadif, Mohamed
    [J]. 2013 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2013,
  • [10] Parallel Nonnegative Matrix Factorization with Manifold Regularization
    Liu, Fudong
    Shan, Zheng
    Chen, Yihang
    [J]. JOURNAL OF ELECTRICAL AND COMPUTER ENGINEERING, 2018, 2018