An Improved LDA Algorithm for Text Classification

被引:0
|
作者
Zhao, Dexin [1 ]
He, Jinqun [1 ]
Liu, Jin [2 ]
机构
[1] Tianjin Univ Technol, Tianjin Key Lab Intelligent Comp & Novel Software, Tianjin 300384, Peoples R China
[2] Tianjin Keyilong Decorat Engn Co Ltd, Tianjin 300202, Peoples R China
关键词
topic model; LDA; text classification;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Latent Dirichlet Allocation is a classic topic model which can extract latent topic from large data corpus. This model assumes that if a document is relevant to a topic, then all tokens in the document are relevant to that topic. In this paper, we present an algorithm called gLDA for topic text classification by adding topic-category distribution parameter to LDA, which can make the document generated from the most relevant category. Gibbs sampling is employed to conduct approximate inference, and experiment results in two datasets show the effectiveness of this method.
引用
收藏
页码:216 / +
页数:2
相关论文
共 50 条
  • [21] Improved Feature Weight Algorithm and Its Application to Text Classification
    Shang, Songtao
    Shi, Minyong
    Shang, Wenqian
    Hong, Zhiguo
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2016, 2016
  • [22] Application of an Improved Convolutional Neural Network Algorithm in Text Classification
    Peng, Jing
    Huo, Shuquan
    JOURNAL OF WEB ENGINEERING, 2024, 23 (03): : 315 - 340
  • [23] Text Classification Research Based on Improved SoftMax Regression Algorithm
    She, Xiangyang
    Zhu, Yinglong
    2018 11TH INTERNATIONAL SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE AND DESIGN (ISCID), VOL 2, 2018, : 273 - 276
  • [24] An Improved LDA-Based ELM Classification for Intrusion Detection Algorithm in IoT Application
    Zheng, Dehua
    Hong, Zhen
    Wang, Ning
    Chen, Ping
    SENSORS, 2020, 20 (06)
  • [25] Classification of Chinese herbal medicine based on improved LDA algorithm using machine olfaction
    Luo, Dehan
    Shao, Yawen
    MEASUREMENT TECHNOLOGY AND ITS APPLICATION, PTS 1 AND 2, 2013, 239-240 : 1532 - 1536
  • [26] Text classification based on Labeled-LDA model
    Li, Wen-Bo
    Sun, Le
    Zhang, Da-Kun
    2008, Science Press (31):
  • [27] Hierarchical Text Classification based on LDA and Domain Ontology
    An, Wei
    Liu, Qihua
    INFORMATION TECHNOLOGY APPLICATIONS IN INDUSTRY II, PTS 1-4, 2013, 411-414 : 1112 - +
  • [28] Performance of Using LDA for Chinese News Text Classification
    Wu, Xiaojun
    Fang, Liying
    Wang, Pu
    Yu, Nan
    2015 IEEE 28TH CANADIAN CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING (CCECE), 2015, : 1260 - 1264
  • [29] SHORT TEXT CLASSIFICATION BASED ON LDA TOPIC MODEL
    Chen, Qiuxing
    Yao, Lixiu
    Yang, Jie
    PROCEEDINGS OF 2016 INTERNATIONAL CONFERENCE ON AUDIO, LANGUAGE AND IMAGE PROCESSING (ICALIP), 2016, : 749 - 753
  • [30] SVD-LDA: A Combined Model for Text Classification
    Nguyen Cao
    Truong Hai
    Kim, Kyung-Im
    Park, Hyuk-Ro
    JOURNAL OF INFORMATION PROCESSING SYSTEMS, 2009, 5 (01): : 5 - 10