Text mining using non-negative matrix factorizations

被引:0
|
作者
Pauca, VP [1 ]
Shahnaz, F [1 ]
Berry, MW [1 ]
Plemmons, RJ [1 ]
机构
[1] Wake Forest Univ, Dept Comp Sci, Winston Salem, NC 27109 USA
关键词
text mining; non-negative matrix factorization; clustering; dimension reduction; semantic feature identification;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This study involves a methodology for the automatic identification of semantic features and document clusters in a heterogeneous text collection. The methodology is based upon encoding the data using low rank nonnegative matrix factorization algorithms to preserve natural data non-negativity and thus avoid subtractive basis vector and encoding interactions present in techniques such as principal component analysis. Some existing non-negative matrix factorization techniques are reviewed and some new ones are proposed. Numerical experiments are reported on the use of a hybrid NMF algorithm to produce a parts-based approximation of a sparse term-by-document matrix. The resulting basis vectors and matrix projection can be used to identify underlying semantic features (topics) and document clusters of the corresponding text collection.
引用
收藏
页码:452 / 456
页数:5
相关论文
共 50 条
  • [41] ON INDEX OF IMPRIMITIVITY OF A NON-NEGATIVE MATRIX
    SCHWARZ, S
    ACTA SCIENTIARUM MATHEMATICARUM, 1967, 28 (1-2): : 185 - &
  • [42] INFINITE NON-NEGATIVE MATRIX FACTORIZATION
    Schmidt, Mikkel N.
    Morup, Morten
    18TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO-2010), 2010, : 905 - 909
  • [43] Collaborative Non-negative Matrix Factorization
    Benlamine, Kaoutar
    Grozavu, Nistor
    Bennani, Younes
    Matei, Basarab
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2019: TEXT AND TIME SERIES, PT IV, 2019, 11730 : 655 - 666
  • [44] Non-negative Matrix Factorization: A Survey
    Gan, Jiangzhang
    Liu, Tong
    Li, Li
    Zhang, Jilian
    COMPUTER JOURNAL, 2021, 64 (07): : 1080 - 1092
  • [45] Algorithms for non-negative matrix factorization
    Lee, DD
    Seung, HS
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 13, 2001, 13 : 556 - 562
  • [46] Non-negative Matrix Factorization for EEG
    Jahan, Ibrahim Salem
    Snasel, Vaclav
    2013 INTERNATIONAL CONFERENCE ON TECHNOLOGICAL ADVANCES IN ELECTRICAL, ELECTRONICS AND COMPUTER ENGINEERING (TAEECE), 2013, : 183 - 187
  • [47] Speaker conversion using kernel non-negative matrix factorization
    Xu Qinyu
    Lu Guanming
    Yan Jingjie
    Li Haibo
    Cheng Xiao
    The Journal of China Universities of Posts and Telecommunications, 2017, (05) : 60 - 67
  • [48] Extractive Document Summarization using Non-negative Matrix Factorization
    Khurana, Alka
    Bhatnagar, Vasudha
    DATABASE AND EXPERT SYSTEMS APPLICATIONS, PT II, 2019, 11707 : 76 - 90
  • [49] Sparse Mathematical Morphology Using Non-negative Matrix Factorization
    Angulo, Jesus
    Velasco-Forero, Santiago
    MATHEMATICAL MORPHOLOGY AND ITS APPLICATIONS TO IMAGE AND SIGNAL PROCESSING, (ISMM 2011), 2011, 6671 : 1 - 12
  • [50] Speaker conversion using kernel non-negative matrix factorization
    Xu Qinyu
    Lu Guanming
    Yan Jingjie
    Li Haibo
    Cheng Xiao
    The Journal of China Universities of Posts and Telecommunications, 2017, 24 (05) : 60 - 67