SPARSE TOPIC MODEL FOR TEXT CLASSIFICATION

被引:0
|
作者
Liu, Tao [1 ]
机构
[1] Renmin Univ China, Sch Informat, Beijing 100872, Peoples R China
关键词
Text classification; Topic model; Sparse coding;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper addresses a new text classification method: Sparse Topic Model, which represents documents by the sparse coding of topics. Topics contain more semantic information than words, so it's more effective for feature representation of documents. Topics are extracted from documents by LDA in an unsupervised way. Based on these topics, sparse coding is applied to discover more high-level representation. We compare the Sparse Topic Model with the traditional methods, such as SVM, and the experimental result show that the proposed method achieves better performance, especially when the number of training examples is limited. The effect of topic number and word number per topic on the performance is also investigated. Due to the unsupervised characteristic of Sparse Topic Model, it's very useful for real application.
引用
收藏
页码:1916 / 1920
页数:5
相关论文
共 50 条
  • [1] Neural variational sparse topic model for sparse explainable text representation
    Xie, Qianqian
    Tiwari, Prayag
    Gupta, Deepak
    Huang, Jimin
    Peng, Min
    [J]. INFORMATION PROCESSING & MANAGEMENT, 2021, 58 (05)
  • [2] News Text Classification Model Based on Topic Model
    Li, Zhenzhong
    Shang, Wenqian
    Yan, Menghan
    [J]. 2016 IEEE/ACIS 15TH INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION SCIENCE (ICIS), 2016, : 1197 - 1201
  • [3] A discriminative and sparse topic model for image classification and annotation
    Yang, Liu
    Jing, Liping
    Ng, Michael K.
    Yu, Jian
    [J]. IMAGE AND VISION COMPUTING, 2016, 51 : 22 - 35
  • [4] SHORT TEXT CLASSIFICATION BASED ON LDA TOPIC MODEL
    Chen, Qiuxing
    Yao, Lixiu
    Yang, Jie
    [J]. PROCEEDINGS OF 2016 INTERNATIONAL CONFERENCE ON AUDIO, LANGUAGE AND IMAGE PROCESSING (ICALIP), 2016, : 749 - 753
  • [5] Classification of Text Documents Based on a Probabilistic Topic Model
    S. N. Karpovich
    A. V. Smirnov
    N. N. Teslya
    [J]. Scientific and Technical Information Processing, 2019, 46 : 314 - 320
  • [6] Classification of Text Documents Based on a Probabilistic Topic Model
    Karpovich, S. N.
    Smirnov, A. V.
    Teslya, N. N.
    [J]. SCIENTIFIC AND TECHNICAL INFORMATION PROCESSING, 2019, 46 (05) : 314 - 320
  • [7] Scene Classification Based on the Fully Sparse Semantic Topic Model
    Zhu, Qiqi
    Zhong, Yanfei
    Zhang, Liangpei
    Li, Deren
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2017, 55 (10): : 5525 - 5538
  • [8] A Hybrid Approach for Sparse Data Classification Based on Topic Model
    Wang, Guangjing
    Zhang, Jie
    Yang, Xiaobin
    Li, Li
    [J]. WEB-AGE INFORMATION MANAGEMENT, 2016, 9998 : 17 - 28
  • [9] Text Classification of Network Pyramid Scheme based on Topic Model
    Mu, Pengyu
    He, Jingsha
    Zhu, Nafei
    [J]. NLPIR 2019: 2019 3RD INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING AND INFORMATION RETRIEVAL, 2019, : 15 - 19
  • [10] Topic document model approach for naive Bayes text classification
    Kim, SB
    Rim, HC
    Kim, JD
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2005, E88D (05): : 1091 - 1094