An Improved LDA Algorithm for Text Classification

被引:0
|
作者
Zhao, Dexin [1 ]
He, Jinqun [1 ]
Liu, Jin [2 ]
机构
[1] Tianjin Univ Technol, Tianjin Key Lab Intelligent Comp & Novel Software, Tianjin 300384, Peoples R China
[2] Tianjin Keyilong Decorat Engn Co Ltd, Tianjin 300202, Peoples R China
关键词
topic model; LDA; text classification;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Latent Dirichlet Allocation is a classic topic model which can extract latent topic from large data corpus. This model assumes that if a document is relevant to a topic, then all tokens in the document are relevant to that topic. In this paper, we present an algorithm called gLDA for topic text classification by adding topic-category distribution parameter to LDA, which can make the document generated from the most relevant category. Gibbs sampling is employed to conduct approximate inference, and experiment results in two datasets show the effectiveness of this method.
引用
收藏
页码:216 / +
页数:2
相关论文
共 50 条
  • [41] An Improved Weighted KNN Algorithm About Text Classification Based on Spark Framework
    Yang, Tianming
    Du, Shaobo
    2022 IEEE 10TH INTERNATIONAL CONFERENCE ON INFORMATION, COMMUNICATION AND NETWORKS (ICICN 2022), 2022, : 655 - 661
  • [42] Automatic text classification algorithm based on Gauss improved convolutional neural network
    Du, Jian-hai
    JOURNAL OF COMPUTATIONAL SCIENCE, 2017, 21 : 195 - 200
  • [43] Chinese text classification study base on the improved dnn-svm algorithm
    Jiang, M. Y.
    Wang, X. Y.
    Zhang, Z. F.
    Wang, Q. H.
    Jiang, J. Q.
    Pei, Z. L.
    BASIC & CLINICAL PHARMACOLOGY & TOXICOLOGY, 2018, 123 : 59 - 60
  • [44] An improved sample mean KNN algorithm based on LDA
    Xue, Hongye
    Wang, Peiwen
    2019 11TH INTERNATIONAL CONFERENCE ON INTELLIGENT HUMAN-MACHINE SYSTEMS AND CYBERNETICS (IHMSC 2019), VOL 1, 2019, : 266 - 270
  • [45] An Improved Dynamic Collaborative Filtering Algorithm Based on LDA
    Meng Di-Fei
    Liu Na
    Li Ming-Xia
    Su Hao-Long
    IEEE ACCESS, 2021, 9 : 122568 - 122577
  • [46] An Improved Dynamic Collaborative Filtering Algorithm Based on LDA
    DI-Fei, Meng
    Na, Liu
    Ming-Xia, Li
    Hao-Long, Su
    IEEE Access, 2021, 9 : 122568 - 122577
  • [47] SAW Classification Algorithm for Chinese Text Classification
    Guo, Xiaoli
    Sun, Huiyu
    Zhou, Tiehua
    Wang, Ling
    Qu, Zhaoyang
    Zang, Jiannan
    SUSTAINABILITY, 2015, 7 (03) : 2338 - 2352
  • [48] Short Text Classification Based on Hierarchical Heterogeneous Graph and LDA Fusion
    Xu, Xinlan
    Li, Bo
    Shen, Yuhao
    Luo, Bing
    Zhang, Chao
    Hao, Fei
    ELECTRONICS, 2023, 12 (12)
  • [49] Improved Particle Swarm Optimization approach for Classification by using LDA
    Nema, S.
    Thakur, S. S.
    PROCEEDINGS OF 2015 IEEE 9TH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS AND CONTROL (ISCO), 2015,
  • [50] Filtering Spam Text Messages by Using Twitter-LDA Algorithm
    Gunawan, Dani
    Rahmat, Romi Fadillah
    Putra, Arsandi
    Pasha, Muhammad Fermi
    2018 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATION, NETWORKS AND SATELLITE (COMNETSAT), 2018, : 1 - 6