Matrix factorization of large scale data using multistage matrix factorization

被引:0
|
作者
Prasad Bhavana
Vineet Padmanabhan
机构
[1] University of Hyderabad,School of Computer and Information Sciences
来源
Applied Intelligence | 2021年 / 51卷
关键词
Multistage matrix factorization; Two-stage matrix factorization; Hierarchical matrix factorization;
D O I
暂无
中图分类号
学科分类号
摘要
Matrix Factorization (MF) is a resource intensive task that consumes significant memory and computational effort and is not scalable with the quantum of data. When the size of the input matrix and the latent feature matrices are higher than the available memory, both on a Central Processing Unit (CPU) as well as a Graphical Processing Unit (GPU), loading all the required matrices on to CPU/GPU memory may not be possible. Such scenarios call for alternative techniques that not only allow parallelism but also address memory limitations and plays a crucial role in industrial applications. In this paper we propose a divide and conquer technique based on a two stage factorization process. In the first step, we divide the data set into different groups and factorize each group. In the second step, we use factorization based learning model to combine the latent features derived in the first step. Our motivation is to develop a method that can achieve both parallelism and scalability as well as address factorization of incrementally growing data. Our contribution is a novel multi-stage matrix factorization (MsMF) approach. The experimental results demonstrate improvements in RMSE as well as computational efficiency.
引用
收藏
页码:4016 / 4028
页数:12
相关论文
共 50 条
  • [1] Matrix factorization of large scale data using multistage matrix factorization
    Bhavana, Prasad
    Padmanabhan, Vineet
    [J]. APPLIED INTELLIGENCE, 2021, 51 (06) : 4016 - 4028
  • [2] BMF: Matrix Factorization of Large Scale Data Using Block Based Approach
    Bhavana, Prasad
    Padmanabhan, Vineet
    [J]. PRICAI 2019: TRENDS IN ARTIFICIAL INTELLIGENCE, PT II, 2019, 11671 : 431 - 436
  • [3] Efficient Large-Scale Similarity Search Using Matrix Factorization
    Iscen, Ahmet
    Rabbat, Michael
    Furon, Teddy
    [J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 2073 - 2081
  • [4] Data Fusion by Matrix Factorization
    Zitnik, Marinka
    Zupan, Blaz
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2015, 37 (01) : 41 - 53
  • [5] Matrix Factorization for Evolution Data
    Huang, Xiao-Yu
    Xiang, Xian-Hong
    Li, Wubin
    Chen, Kang
    Cai, Wen-Xue
    Li, Lei
    [J]. MATHEMATICAL PROBLEMS IN ENGINEERING, 2014, 2014
  • [6] Clustering Data using a Nonnegative Matrix Factorization (NMF)
    Abdulla, Hussam Dahwa
    Polovincak, Martin
    Snasel, Vaclav
    [J]. 2009 SECOND INTERNATIONAL CONFERENCE ON THE APPLICATIONS OF DIGITAL INFORMATION AND WEB TECHNOLOGIES (ICADIWT 2009), 2009, : 749 - 752
  • [7] Large-Scale Distributed Bayesian Matrix Factorization using Stochastic Gradient MCMC
    Ahn, Sungjin
    Korattikara, Anoop
    Liu, Nathan
    Rajan, Suju
    Welling, Max
    [J]. KDD'15: PROCEEDINGS OF THE 21ST ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2015, : 9 - 18
  • [8] Bayesian Matrix Factorization for Semibounded Data
    Dalhoumi, Oumayma
    Bouguila, Nizar
    Amayri, Manar
    Fan, Wentao
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (06) : 3111 - 3123
  • [9] MATRIX SPECTRAL FACTORIZATION WITH PERTURBED DATA
    Ephremidze, Lasha
    Spitkovsky, Ilya
    [J]. MEMOIRS ON DIFFERENTIAL EQUATIONS AND MATHEMATICAL PHYSICS, 2015, 66 : 65 - 82
  • [10] Tensor Factorization via Matrix Factorization
    Kuleshov, Volodymyr
    Chaganty, Arun Tejasvi
    Liang, Percy
    [J]. ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 38, 2015, 38 : 507 - 516