Nonnegative Matrix Factorization for Document Clustering: A Survey

被引:0
|
作者
Hosseini-Asl, Ehsan [1 ]
Zurada, Jacek M. [1 ]
机构
[1] Univ Louisville, Dept Elect & Comp Engn, Louisville, KY 40292 USA
关键词
Nonnegative Matrix Factorization; Document clustering; optimization algorithm; CORRENTROPY; ALGORITHMS; DIVERGENCE;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Nonnegative Matrix Factorization (NMF) is a popular dimension reduction technique of clustering by extracting latent features from high-dimensional data and is widely used for text mining. Several optimization algorithms have been developed for NMF with different cost functions. In this paper we apply several methods of NMF that have been developed for data analysis. These methods vary in using different cost function for matrix factorization and different optimization algorithms for minimizing the cost function. Reuters Document Corpus is used for evaluating the performance of each method. The methods are compared with respect to their accuracy, entropy, purity and computational complexity and residual mean square root error. The most efficient methods in terms of each performance measure are also recognized.
引用
收藏
页码:726 / 737
页数:12
相关论文
共 50 条
  • [1] Document clustering using nonnegative matrix factorization/
    Shahnaz, F
    Berry, MW
    Pauca, VP
    Plemmons, RJ
    [J]. INFORMATION PROCESSING & MANAGEMENT, 2006, 42 (02) : 373 - 386
  • [2] Document clustering based on nonnegative sparse matrix factorization
    Yang, CF
    Ye, M
    Zhao, J
    [J]. ADVANCES IN NATURAL COMPUTATION, PT 2, PROCEEDINGS, 2005, 3611 : 557 - 563
  • [3] Automated Graph Regularized Projective Nonnegative Matrix Factorization for Document Clustering
    Pei, Xiaobing
    Wu, Tao
    Chen, Chuanbo
    [J]. IEEE TRANSACTIONS ON CYBERNETICS, 2014, 44 (10) : 1821 - 1831
  • [4] Constrained Clustering With Nonnegative Matrix Factorization
    Zhang, Xianchao
    Zong, Linlin
    Liu, Xinyue
    Luo, Jiebo
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2016, 27 (07) : 1514 - 1526
  • [5] Automatic Multi-document Summarization Based on Clustering and Nonnegative Matrix Factorization
    Park, Sun
    Cha, ByungRea
    An, Dong Un
    [J]. IETE TECHNICAL REVIEW, 2010, 27 (02) : 167 - 178
  • [6] Fast Rank-2 Nonnegative Matrix Factorization for Hierarchical Document Clustering
    Kuang, Da
    Park, Haesun
    [J]. 19TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING (KDD'13), 2013, : 739 - 747
  • [7] A survey of deep nonnegative matrix factorization
    Chen, Wen-Sheng
    Zeng, Qianwen
    Pan, Binbin
    [J]. NEUROCOMPUTING, 2022, 491 : 305 - 320
  • [8] A nonnegative matrix factorization framework for semi-supervised document clustering with dual constraints
    Ma, Huifang
    Zhao, Weizhong
    Shi, Zhongzhi
    [J]. KNOWLEDGE AND INFORMATION SYSTEMS, 2013, 36 (03) : 629 - 651
  • [9] Distributional Clustering Using Nonnegative Matrix Factorization
    Zhu, Zhenfeng
    Ye, Yangdong
    [J]. PROCEEDINGS OF THE 10TH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION (WCICA 2012), 2012, : 4705 - 4711
  • [10] Incremental Clustering via Nonnegative Matrix Factorization
    Bucak, Serhat Selcuk
    Gunsel, Bilge
    [J]. 19TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOLS 1-6, 2008, : 640 - 643