Doubly Nonparametric Sparse Nonnegative Matrix Factorization Based on Dependent Indian Buffet Processes

被引:20
|
作者
Xuan, Junyu [1 ,2 ]
Lu, Jie [1 ,2 ]
Zhang, Guangquan [1 ,2 ]
Xu, Richard Yi Da [1 ,2 ]
Luo, Xiangfeng [2 ]
机构
[1] Univ Technol Sydney, Fac Engn & Informat Technol, Ultimo, NSW 2007, Australia
[2] Shanghai Univ, Sch Comp Engn & Sci, Shanghai 200444, Peoples R China
基金
美国国家科学基金会; 澳大利亚研究理事会;
关键词
Co-clustering; nonnegative matrix factorization; probability graphical model; text mining; VECTOR;
D O I
10.1109/TNNLS.2017.2676817
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Sparse nonnegative matrix factorization (SNMF) aims to factorize a data matrix into two optimized nonnegative sparse factor matrices, which could benefit many tasks, such as document-word co-clustering. However, the traditional SNMF typically assumes the number of latent factors (i.e., dimensionality of the factor matrices) to be fixed. This assumption makes it inflexible in practice. In this paper, we propose a doubly sparse nonparametric NMF framework to mitigate this issue by using dependent Indian buffet processes (dIBP). We apply a correlation function for the generation of two stick weights associated with each column pair of factor matrices while still maintaining their respective marginal distribution specified by IBP. As a consequence, the generation of two factor matrices will be columnwise correlated. Under this framework, two classes of correlation function are proposed: 1) using bivariate Beta distribution and 2) using Copula function. Compared with the single IBP-based NMF, this paper jointly makes two factor matrices nonparametric and sparse, which could be applied to broader scenarios, such as co-clustering. This paper is seen to be much more flexible than Gaussian process-based and hierarchial Beta process-based dIBPs in terms of allowing the two corresponding binary matrix columns to have greater variations in their nonzero entries. Our experiments on synthetic data show the merits of this paper compared with the state-of-the-art models in respect of factorization efficiency, sparsity, and flexibility. Experiments on real-world data sets demonstrate the efficiency of this paper in document-word co-clustering tasks.
引用
收藏
页码:1835 / 1849
页数:15
相关论文
共 50 条
  • [31] A sparse nonnegative matrix factorization technique for graph matching problems
    Jiang, Bo
    Zhao, Haifeng
    Tang, Jin
    Luo, Bin
    PATTERN RECOGNITION, 2014, 47 (02) : 736 - 747
  • [32] Sparse and Unique Nonnegative Matrix Factorization Through Data Preprocessing
    Gillis, Nicolas
    JOURNAL OF MACHINE LEARNING RESEARCH, 2012, 13 : 3349 - 3386
  • [33] Sparse nonnegative matrix factorization with genetic algorithms for microarray analysis
    Stadlthanner, K.
    Lutter, D.
    Theis, F. J.
    Lang, E. W.
    Tome, A. M.
    Georgieva, P.
    Puntonet, C. G.
    2007 IEEE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-6, 2007, : 294 - +
  • [34] Improved hypergraph regularized Nonnegative Matrix Factorization with sparse representation
    Huang, Sheng
    Wang, Hongxing
    Ge, Yongxin
    Huangfu, Luwen
    Zhang, Xiaohong
    Yang, Dan
    PATTERN RECOGNITION LETTERS, 2018, 102 : 8 - 14
  • [35] Spectral Unmixing Using Sparse and Smooth Nonnegative Matrix Factorization
    Wu, Changyuan
    Shen, Chaomin
    2013 21ST INTERNATIONAL CONFERENCE ON GEOINFORMATICS (GEOINFORMATICS), 2013,
  • [36] Sparse nonnegative matrix factorization for hyperspectral optimal band selection
    Shi, B. (carashi@163.com), 1600, SinoMaps Press (42):
  • [37] GROUP SPARSE NONNEGATIVE MATRIX FACTORIZATION FOR HYPERSPECTRAL IMAGE DENOISING
    Xu, Yangyang
    Qian, Yuntao
    2016 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS), 2016, : 6958 - 6961
  • [38] Sparse Symmetric Nonnegative Matrix Factorization Applied to Face Recognition
    Dobrovolskyi, Hennadii
    Keberle, Nataliya
    Ternovyy, Yehor
    PROCEEDINGS OF THE 2017 9TH IEEE INTERNATIONAL CONFERENCE ON INTELLIGENT DATA ACQUISITION AND ADVANCED COMPUTING SYSTEMS: TECHNOLOGY AND APPLICATIONS (IDAACS), VOL 2, 2017, : 1042 - 1045
  • [39] Discriminative separable nonnegative matrix factorization by structured sparse regularization
    Wang, Shengzheng
    Peng, Jing
    Liu, Wei
    SIGNAL PROCESSING, 2016, 120 : 620 - 626
  • [40] Sparse nonnegative matrix factorization applied to microarray data sets
    Stadlthanner, K
    Theis, FJ
    Lang, EW
    Tomé, AM
    Puntonet, CG
    Vilda, PG
    Langmann, T
    Schmitz, G
    INDEPENDENT COMPONENT ANALYSIS AND BLIND SIGNAL SEPARATION, PROCEEDINGS, 2006, 3889 : 254 - 261