An Improved LDA Algorithm for Text Classification

被引:0
|
作者
Zhao, Dexin [1 ]
He, Jinqun [1 ]
Liu, Jin [2 ]
机构
[1] Tianjin Univ Technol, Tianjin Key Lab Intelligent Comp & Novel Software, Tianjin 300384, Peoples R China
[2] Tianjin Keyilong Decorat Engn Co Ltd, Tianjin 300202, Peoples R China
关键词
topic model; LDA; text classification;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Latent Dirichlet Allocation is a classic topic model which can extract latent topic from large data corpus. This model assumes that if a document is relevant to a topic, then all tokens in the document are relevant to that topic. In this paper, we present an algorithm called gLDA for topic text classification by adding topic-category distribution parameter to LDA, which can make the document generated from the most relevant category. Gibbs sampling is employed to conduct approximate inference, and experiment results in two datasets show the effectiveness of this method.
引用
收藏
页码:216 / +
页数:2
相关论文
共 50 条
  • [31] Text classification based on feature selection and LDA model
    Zheng, C. (csahu@126.com), 1600, Binary Information Press, P.O. Box 162, Bethel, CT 06801-0162, United States (09):
  • [32] A Method of Text Categorization Based on Genetic Algorithm and LDA
    Chen, Lei
    Li, Jun
    Zhang, Li
    PROCEEDINGS OF THE 36TH CHINESE CONTROL CONFERENCE (CCC 2017), 2017, : 10866 - 10870
  • [33] Research of Text Classification Based on Improved TF-IDF Algorithm
    Liu, Cai-zhi
    Sheng, Yan-xiu
    Wei, Zhi-qiang
    Yang, Yong-Quan
    2018 IEEE INTERNATIONAL CONFERENCE OF INTELLIGENT ROBOTICS AND CONTROL ENGINEERING (IRCE), 2018, : 218 - 222
  • [34] A Clustering-Based KNN Improved Algorithm CLKNN for Text Classification
    Zhou, Lijuan
    Wang, Linshuang
    Ge, Xuebin
    Shi, Qian
    2010 2ND INTERNATIONAL ASIA CONFERENCE ON INFORMATICS IN CONTROL, AUTOMATION AND ROBOTICS (CAR 2010), VOL 3, 2010, : 212 - 215
  • [35] An Improved Naive Bayes Text Classification Algorithm In Chinese Information Processing
    Yuan, Lingling
    THIRD INTERNATIONAL SYMPOSIUM ON COMPUTER SCIENCE AND COMPUTATIONAL TECHNOLOGY (ISCSCT 2010), 2010, : 267 - 269
  • [36] Feature selection algorithm for text classification based on improved mutual information
    丛帅
    张积宾
    徐志明
    王宇颖
    Journal of Harbin Institute of Technology(New series), 2011, (03) : 144 - 148
  • [37] Enhanced text classification through an improved discrete laying chicken algorithm
    Daneshfar, Fatemeh
    Aghajani, Mohammad Javad
    EXPERT SYSTEMS, 2024, 41 (08)
  • [38] Research on Web Text Classification Algorithm Based on Improved CNN and SVM
    Wang, Zhiquan
    Qu, Zhiyi
    2017 17TH IEEE INTERNATIONAL CONFERENCE ON COMMUNICATION TECHNOLOGY (ICCT 2017), 2017, : 1958 - 1961
  • [39] An improved web text classification algorithm based on SVM-KNN
    Cao, Jianfang
    Chen, Junjie
    ADVANCES IN MECHATRONICS AND CONTROL ENGINEERING, PTS 1-3, 2013, 278-280 : 1305 - 1308
  • [40] Regularized Least Squares LDA and Its Application in Text Classification
    Liu, ZunXiong
    Zeng, LiHui
    PROCEEDING OF THE 10TH INTERNATIONAL CONFERENCE ON INTELLIGENT TECHNOLOGIES, 2009, : 206 - 210