Text mining using non-negative matrix factorizations

被引:0
|
作者
Pauca, VP [1 ]
Shahnaz, F [1 ]
Berry, MW [1 ]
Plemmons, RJ [1 ]
机构
[1] Wake Forest Univ, Dept Comp Sci, Winston Salem, NC 27109 USA
来源
PROCEEDINGS OF THE FOURTH SIAM INTERNATIONAL CONFERENCE ON DATA MINING | 2004年
关键词
text mining; non-negative matrix factorization; clustering; dimension reduction; semantic feature identification;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This study involves a methodology for the automatic identification of semantic features and document clusters in a heterogeneous text collection. The methodology is based upon encoding the data using low rank nonnegative matrix factorization algorithms to preserve natural data non-negativity and thus avoid subtractive basis vector and encoding interactions present in techniques such as principal component analysis. Some existing non-negative matrix factorization techniques are reviewed and some new ones are proposed. Numerical experiments are reported on the use of a hybrid NMF algorithm to produce a parts-based approximation of a sparse term-by-document matrix. The resulting basis vectors and matrix projection can be used to identify underlying semantic features (topics) and document clusters of the corresponding text collection.
引用
收藏
页码:452 / 456
页数:5
相关论文
共 50 条
  • [31] Email surveillance using non-negative matrix factorization
    Berry M.W.
    Browne M.
    Computational & Mathematical Organization Theory, 2005, 11 (3): : 249 - 264
  • [32] Collaborative filtering using non-negative matrix factorisation
    Aghdam, Mehdi Hosseinzadeh
    Analoui, Morteza
    Kabiri, Peyman
    JOURNAL OF INFORMATION SCIENCE, 2017, 43 (04) : 567 - 579
  • [33] Pitch Estimation Using Non-negative Matrix Factorization
    Burt, Ryan
    Cinar, Goktug T.
    Principe, Jose C.
    PROCEEDINGS OF THE 2014 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2014, : 2058 - 2062
  • [34] Tumor Classification Using Non-negative Matrix Factorization
    Zhang, Ping
    Zheng, Chun-Hou
    Li, Bo
    Wen, Chang-Gang
    ADVANCED INTELLIGENT COMPUTING THEORIES AND APPLICATIONS, PROCEEDINGS: WITH ASPECTS OF CONTEMPORARY INTELLIGENT COMPUTING TECHNIQUES, 2008, 15 : 236 - +
  • [35] On the Effectiveness of Non-negative Matrix Factorization for Text Open-Set Recognition
    Impedovo, Angelo
    Rizzo, Giuseppe
    MACHINE LEARNING AND PRINCIPLES AND PRACTICE OF KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2023, PT III, 2025, 2135 : 541 - 552
  • [36] Density matrix reconstruction using non-negative matrix product states
    Han, Donghong
    Guo, Chu
    Wang, Xiaoting
    PHYSICAL REVIEW A, 2022, 106 (04)
  • [37] Mining ratio rules via principal sparse non-negative matrix factorization
    Hu, CY
    Zhang, BY
    Yan, SC
    Yang, Q
    Yan, J
    Chen, Z
    Ma, WY
    FOURTH IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2004, : 407 - 410
  • [38] Non-negative matrix factorization on kernels
    Zhang, Daoqiang
    Zhou, Zhi-Hua
    Chen, Songcan
    PRICAI 2006: TRENDS IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2006, 4099 : 404 - 412
  • [39] Dropout non-negative matrix factorization
    Zhicheng He
    Jie Liu
    Caihua Liu
    Yuan Wang
    Airu Yin
    Yalou Huang
    Knowledge and Information Systems, 2019, 60 : 781 - 806
  • [40] Detecting Insider Attacks Using Non-negative Matrix Factorization
    Platos, Jan
    Snasel, Vaclav
    Kromer, Pavel
    Abraham, Ajith
    FIFTH INTERNATIONAL CONFERENCE ON INFORMATION ASSURANCE AND SECURITY, VOL 1, PROCEEDINGS, 2009, : 693 - +