Text mining using non-negative matrix factorizations

被引:0
|
作者
Pauca, VP [1 ]
Shahnaz, F [1 ]
Berry, MW [1 ]
Plemmons, RJ [1 ]
机构
[1] Wake Forest Univ, Dept Comp Sci, Winston Salem, NC 27109 USA
关键词
text mining; non-negative matrix factorization; clustering; dimension reduction; semantic feature identification;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This study involves a methodology for the automatic identification of semantic features and document clusters in a heterogeneous text collection. The methodology is based upon encoding the data using low rank nonnegative matrix factorization algorithms to preserve natural data non-negativity and thus avoid subtractive basis vector and encoding interactions present in techniques such as principal component analysis. Some existing non-negative matrix factorization techniques are reviewed and some new ones are proposed. Numerical experiments are reported on the use of a hybrid NMF algorithm to produce a parts-based approximation of a sparse term-by-document matrix. The resulting basis vectors and matrix projection can be used to identify underlying semantic features (topics) and document clusters of the corresponding text collection.
引用
收藏
页码:452 / 456
页数:5
相关论文
共 50 条
  • [1] VOLUME REGULARIZED NON-NEGATIVE MATRIX FACTORIZATIONS
    Ang, Andersen M. S.
    Gillis, Nicolas
    2018 9TH WORKSHOP ON HYPERSPECTRAL IMAGE AND SIGNAL PROCESSING: EVOLUTION IN REMOTE SENSING (WHISPERS), 2018,
  • [2] Credit Risk Analysis Using Sparse Non-negative Matrix Factorizations
    Sun, Hao
    Chen, Zhiqian
    Chen, James
    2015 2ND INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE AND CONTROL ENGINEERING ICISCE 2015, 2015, : 181 - 184
  • [3] Fast Non-Negative Matrix Factorizations for Face Recognition
    Chen, Wen-Sheng
    Li, Yugao
    Pan, Binbin
    Xu, Chen
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2018, 32 (04)
  • [4] Subtractive clustering for seeding non-negative matrix factorizations
    Casalino, Gabriella
    Del Buono, Nicoletta
    Mencar, Corrado
    INFORMATION SCIENCES, 2014, 257 : 369 - 387
  • [5] Non-Negative Matrix Factorizations for Multiplex Network Analysis
    Gligorijevic, Vladimir
    Panagakis, Yannis
    Zafeiriou, Stefanos
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2019, 41 (04) : 928 - 940
  • [6] Non-negative matrix factorization based text mining: Feature extraction and classification
    Barman, P. C.
    Iqbal, Nadeem
    Lee, Soo-Young
    NEURAL INFORMATION PROCESSING, PT 2, PROCEEDINGS, 2006, 4233 : 703 - 712
  • [7] Non-negative matrix factorizations of spontaneous electroencephalographic signals for classification
    Liu Mingyu
    Wang Jue
    Zheng Chongxun
    2005 27TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY, VOLS 1-7, 2005, : 2790 - 2793
  • [8] Improving non-negative matrix factorizations through structured initialization
    Wild, S
    Curry, J
    Dougherty, A
    PATTERN RECOGNITION, 2004, 37 (11) : 2217 - 2232
  • [9] Robust image hashing via non-negative matrix factorizations
    Monga, Vishal
    Mihcak, M. Kivanc
    2006 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-13, 2006, : 1473 - 1476
  • [10] A penalty function for computing orthogonal non-negative matrix factorizations
    Del Buono, Nicoletta
    2009 9TH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS DESIGN AND APPLICATIONS, 2009, : 1001 - 1005