A Nonlinear Matrix Decomposition for Mining the Zeros of Sparse Data

被引:1
|
作者
Saul, Lawrence K. [1 ]
机构
[1] Univ Calif San Diego, Dept Comp Sci & Engn, La Jolla, CA 92093 USA
来源
关键词
matrix factorization; latent variable modeling; unsupervised learning; NONNEGATIVE MATRIX; NONCONVEX OPTIMIZATION; UNIFIED VIEW; ALGORITHMS; FACTORIZATION; EM; COMPLETION; MODELS; APPROXIMATION; ACCELERATION;
D O I
10.1137/21M1405769
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
We describe a simple iterative solution to a widely recurring problem in multivariate data analysis: given a sparse nonnegative matrix X, how to estimate a low-rank matrix 8 such that X approximate to f(8), where f is an elementwise nonlinearity? We develop a latent variable model for this problem and consider those sparsifying nonlinearities, popular in neural networks, that map all negative values to zero. The model seeks to explain the variability of sparse high-dimensional data in terms of a smaller number of degrees of freedom. We show that exact inference in this model is tractable and derive an expectation-maximization (EM) algorithm to estimate the low-rank matrix 8. Notably, we do not parameterize 8 as a product of smaller matrices to be alternately optimized; instead, we estimate 8 directly via the singular value decomposition of matrices that are repeatedly inferred (at each iteration of the EM algorithm) from the model's posterior distribution. We use the model to analyze large sparse matrices that arise from data sets of binary, grayscale, and color images. In all of these cases, we find that the model discovers much lower-rank decompositions than purely linear approaches.
引用
收藏
页码:431 / 463
页数:33
相关论文
共 50 条
  • [1] Matrix Decomposition Methods for the Improvement of Data Mining in Telecommunications
    Joao, Zolana
    Mzyece, Mjumo
    Kurien, Anish
    [J]. 2009 IEEE 70TH VEHICULAR TECHNOLOGY CONFERENCE FALL, VOLS 1-4, 2009, : 1651 - 1655
  • [2] SPARSE REPRESENTATION OF HYPERSPECTRAL DATA USING CUR MATRIX DECOMPOSITION
    Sigurdsson, Jakob
    Ulfarsson, Magnus O.
    Sveinsson, Johannes R.
    Benediktsson, Jon Atli
    [J]. 2013 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS), 2013, : 433 - 436
  • [3] An Improvement Approach for Reducing Dimensionality of Data with Matrix Decomposition in Data Mining
    Jamshidzadeh, Sasan
    Hosseinkhani, Javad
    [J]. INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2016, 16 (12): : 11 - 14
  • [4] Distributed Bayesian Matrix Decomposition for Big Data Mining and Clustering
    Zhang, Chihao
    Yang, Yang
    Zhou, Wei
    Zhang, Shihua
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2022, 34 (08) : 3701 - 3713
  • [5] INSIDER: Interpretable sparse matrix decomposition for RNA expression data analysis
    Zhao, Kai
    Huang, Sen
    Lin, Cuichan
    Sham, Pak Chung
    So, Hon-Cheong
    Lin, Zhixiang
    [J]. PLOS GENETICS, 2024, 20 (03):
  • [6] Low-Rank and Sparse Matrix Decomposition for Genetic Interaction Data
    Wang, Yishu
    Yang, Dejie
    Deng, Minghua
    [J]. BIOMED RESEARCH INTERNATIONAL, 2015, 2015
  • [7] SPARSE MATRIX TECHNIQUES IN THEORY OF DECOMPOSITION
    BHAT, MV
    KESAVAN, HK
    [J]. IEEE TRANSACTIONS ON POWER APPARATUS AND SYSTEMS, 1975, 94 (06): : 1917 - 1917
  • [8] Robust Matrix Decomposition With Sparse Corruptions
    Hsu, Daniel
    Kakade, Sham M.
    Zhang, Tong
    [J]. IEEE TRANSACTIONS ON INFORMATION THEORY, 2011, 57 (11) : 7221 - 7234
  • [9] Data mining with sparse grids
    Garcke, J
    Griebel, M
    Thess, M
    [J]. COMPUTING, 2001, 67 (03) : 225 - 253
  • [10] Data Mining with Sparse Grids
    J. Garcke
    M. Griebel
    M. Thess
    [J]. Computing, 2001, 67 : 225 - 253