Text Categorization Based on Topic Model

被引:0
|
作者
School of Computer Science and Technology, China University of Mining and Technology, Jiangsu Province, Xuzhou [1 ]
221116, China
不详 [2 ]
100081, China
机构
[1] China University of Mining and Technology,School of Computer Science and Technology
[2] Beijing Institute of Technology,School of Computer Science and Technology
来源
Int. J. Comput. Intell. Syst. | 2009年 / 4卷 / 398-409期
关键词
Computational linguistics - Text processing;
D O I
10.2991/ijcis.2009.2.4.8
中图分类号
学科分类号
摘要
In the text literature, many topic models were proposed to represent documents and words as topics or latent topics in order to process text effectively and accurately. In this paper, we propose LDACLM or Latent Dirichlet Allocation Category Language Model for text categorization and estimate parameters of models by variational inference. As a variant of Latent Dirichlet Allocation Model, LDACLM regards documents of category as Language Model and uses variational parameters to estimate maximum a posteriori of terms. In general, experiments show LDACLM model is effective and outperform Naïve Bayes with Laplace smoothing and Rocchio algorithm but little inferior to SVM for text categorization. © 2009, the authors.
引用
收藏
页码:398 / 409
页数:11
相关论文
共 50 条
  • [1] Text Categorization Based on Topic Model
    Zhou, Shibin
    Li, Kan
    Liu, Yushu
    [J]. INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE SYSTEMS, 2009, 2 (04) : 398 - 409
  • [2] Text categorization based on topic model
    Zhou, Shibin
    Li, Kan
    Liu, Yushu
    [J]. ROUGH SETS AND KNOWLEDGE TECHNOLOGY, 2008, 5009 : 572 - 579
  • [3] A Robust Framework for Short Text Categorization based on Topic Model and Integrated Classifier
    Wang, Peng
    Zhang, Heng
    Wu, Yu-Fang
    Xu, Bo
    Hao, Hong-Wei
    [J]. PROCEEDINGS OF THE 2014 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2014, : 3534 - 3539
  • [4] SLDA-TC: A Novel Text Categorization Approach Based on Supervised Topic Model
    Tang, Huan-Ling
    Dou, Quan-Sheng
    Yu, Li-Ping
    Song, Ying-Jie
    Lu, Ming-Yu
    [J]. Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2019, 47 (06): : 1300 - 1308
  • [5] Biomedical Text Categorization Based on Ensemble Pruning and Optimized Topic Modelling
    Onan, Aytug
    [J]. COMPUTATIONAL AND MATHEMATICAL METHODS IN MEDICINE, 2018, 2018
  • [6] Discriminative Topic Sparse Representation for Text Categorization
    Zheng, Wenbin
    Liu, Yanqiu
    Lu, Huijuan
    Tang, Hong
    [J]. 2017 10TH INTERNATIONAL SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE AND DESIGN (ISCID), VOL. 1, 2017, : 454 - 457
  • [7] Image categorization by a classifier based on probabilistic topic model
    Yamaguchi, Takuma
    Maruyama, Minoru
    [J]. 19TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOLS 1-6, 2008, : 1263 - 1266
  • [8] News Text Classification Model Based on Topic Model
    Li, Zhenzhong
    Shang, Wenqian
    Yan, Menghan
    [J]. 2016 IEEE/ACIS 15TH INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION SCIENCE (ICIS), 2016, : 1197 - 1201
  • [9] A text categorization model based on Hidden Markov models
    Yi, K
    Beheshti, J
    [J]. CANADIAN JOURNAL OF INFORMATION AND LIBRARY SCIENCE-REVUE CANADIENNE DES SCIENCES DE L INFORMATION ET DE BIBLIOTHECONOMIE, 2003, 27 (03): : 149 - 149
  • [10] Research of Text Categorization Model based on Random Forests
    Xue, Dashen
    Li, Fengxin
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND COMMUNICATION TECHNOLOGY CICT 2015, 2015, : 173 - 176