Guided Semi-Supervised Non-Negative Matrix Factorization

被引:2
|
作者
Li, Pengyu [1 ]
Tseng, Christine [1 ]
Zheng, Yaxuan [1 ]
Chew, Joyce A. [1 ]
Huang, Longxiu [1 ]
Jarman, Benjamin [1 ]
Needell, Deanna [1 ]
机构
[1] Univ Calif Los Angeles, Dept Math, Los Angeles, CA 90095 USA
基金
美国国家科学基金会;
关键词
matrix decomposition; topic modeling; classification; semi-supervised learning; legal documents; california innocence project; ALGORITHMS;
D O I
10.3390/a15050136
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Classification and topic modeling are popular techniques in machine learning that extract information from large-scale datasets. By incorporating a priori information such as labels or important features, methods have been developed to perform classification and topic modeling tasks; however, most methods that can perform both do not allow for guidance of the topics or features. In this paper, we propose a novel method, namely Guided Semi-Supervised Non-negative Matrix Factorization (GSSNMF), that performs both classification and topic modeling by incorporating supervision from both pre-assigned document class labels and user-designed seed words. We test the performance of this method on legal documents provided by the California Innocence Project and the 20 Newsgroups dataset. Our results show that the proposed method improves both classification accuracy and topic coherence in comparison to past methods such as Semi-Supervised Non-negative Matrix Factorization (SSNMF), Guided Non-negative Matrix Factorization (Guided NMF), and Topic Supervised NMF.
引用
收藏
页数:18
相关论文
共 50 条
  • [1] Non-negative matrix factorization for semi-supervised data clustering
    Chen, Yanhua
    Rege, Manjeet
    Dong, Ming
    Hua, Jing
    [J]. KNOWLEDGE AND INFORMATION SYSTEMS, 2008, 17 (03) : 355 - 379
  • [2] Non-negative matrix factorization for semi-supervised data clustering
    Yanhua Chen
    Manjeet Rege
    Ming Dong
    Jing Hua
    [J]. Knowledge and Information Systems, 2008, 17 : 355 - 379
  • [3] Discriminative semi-supervised non-negative matrix factorization for data clustering
    Xing, Zhiwei
    Wen, Meng
    Peng, Jigen
    Feng, Jinqian
    [J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2021, 103
  • [4] Semi-Supervised Projective Non-Negative Matrix Factorization for Cancer Classification
    Zhang, Xiang
    Guan, Naiyang
    Jia, Zhilong
    Qiu, Xiaogang
    Luo, Zhigang
    [J]. PLOS ONE, 2015, 10 (09):
  • [5] Semi-Supervised Non-Negative Matrix Factorization With Dissimilarity and Similarity Regularization
    Jia, Yuheng
    Kwong, Sam
    Hou, Junhui
    Wu, Wenhui
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2020, 31 (07) : 2510 - 2521
  • [6] Robust Semi-Supervised Non-Negative Matrix Factorization With Structured Normalization
    Wang, Liujing
    Guan, Naiyang
    Shi, Dianxi
    Fan, Zunlin
    Su, Longfei
    [J]. IEEE ACCESS, 2019, 7 : 133996 - 134013
  • [7] A Fast Optimized Semi-Supervised Non-Negative Matrix Factorization Algorithm
    Lopes, Noel
    Ribeiro, Bernardete
    [J]. 2011 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2011, : 2495 - 2500
  • [8] Semi-supervised Non-negative Local Coordinate Factorization
    Zhou, Cherong
    Zhang, Xiang
    Guan, Naiyang
    Huang, Xuhui
    Luo, Zhigang
    [J]. NEURAL INFORMATION PROCESSING, PT II, 2015, 9490 : 106 - 113
  • [9] Graph Based Semi-Supervised Non-negative Matrix Factorization for Document Clustering
    Guan, Naiyang
    Huang, Xuhui
    Lan, Long
    Luo, Zhigang
    Zhang, Xiang
    [J]. 2012 11TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA 2012), VOL 1, 2012, : 404 - 408
  • [10] Semi-supervised non-negative matrix factorization for image clustering with graph Laplacian
    He, Yangcheng
    Lu, Hongtao
    Xie, Saining
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2014, 72 (02) : 1441 - 1463