SPARSE TOPIC MODEL FOR TEXT CLASSIFICATION

被引:0
|
作者
Liu, Tao [1 ]
机构
[1] Renmin Univ China, Sch Informat, Beijing 100872, Peoples R China
关键词
Text classification; Topic model; Sparse coding;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper addresses a new text classification method: Sparse Topic Model, which represents documents by the sparse coding of topics. Topics contain more semantic information than words, so it's more effective for feature representation of documents. Topics are extracted from documents by LDA in an unsupervised way. Based on these topics, sparse coding is applied to discover more high-level representation. We compare the Sparse Topic Model with the traditional methods, such as SVM, and the experimental result show that the proposed method achieves better performance, especially when the number of training examples is limited. The effect of topic number and word number per topic on the performance is also investigated. Due to the unsupervised characteristic of Sparse Topic Model, it's very useful for real application.
引用
收藏
页码:1916 / 1920
页数:5
相关论文
共 50 条
  • [31] Labeled Bilingual Topic Model for Cross-Lingual Text Classification and Label Recommendation
    Tian, Ming-Jie
    Huang, Zheng-Hao
    Cui, Rong-Yi
    [J]. 2018 5TH INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE AND CONTROL ENGINEERING (ICISCE 2018), 2018, : 285 - 289
  • [32] Text Categorization Based on Topic Model
    Zhou, Shibin
    Li, Kan
    Liu, Yushu
    [J]. INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE SYSTEMS, 2009, 2 (04) : 398 - 409
  • [33] Text categorization based on topic model
    Zhou, Shibin
    Li, Kan
    Liu, Yushu
    [J]. ROUGH SETS AND KNOWLEDGE TECHNOLOGY, 2008, 5009 : 572 - 579
  • [34] Text Categorization Based on Topic Model
    Shibin Zhou
    Kan Li
    Yushu Liu
    [J]. International Journal of Computational Intelligence Systems, 2009, 2 (4) : 398 - 409
  • [35] Enhanced sparse representation classifier for text classification
    Unnikrishnan, P.
    Govindan, V. K.
    Kumar, S. D. Madhu
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2019, 129 : 260 - 272
  • [36] Sparse Representation Classification for Image Text Detection
    Zhao, Ming
    Li, Shutao
    [J]. SECOND INTERNATIONAL SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE AND DESIGN, VOL 1, PROCEEDINGS, 2009, : 76 - 79
  • [37] Topic Modeling for Interpretable Text Classification From EHRs
    Rijcken, Emil
    Kaymak, Uzay
    Scheepers, Floortje
    Mosteiro, Pablo
    Zervanou, Kalliopi
    Spruit, Marco
    [J]. FRONTIERS IN BIG DATA, 2022, 5
  • [38] Enhancing Summarization with Text Classification via Topic Consistency
    Liu, Jingzhou
    Yang, Yiming
    [J]. MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2021: RESEARCH TRACK, PT III, 2021, 12977 : 661 - 676
  • [39] Multi-topic aspects in clinical text classification
    Sasaki, Yutaka
    Rea, Brian
    Ananiadou, Sophia
    [J]. 2007 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE, PROCEEDINGS, 2007, : 62 - 67
  • [40] Topic Labeled Text Classification: A Weakly Supervised Approach
    Hingmire, Swapnil
    Chakraborti, Sutanu
    [J]. SIGIR'14: PROCEEDINGS OF THE 37TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2014, : 385 - 394