A text classification model constructed by Latent Dirichlet Allocation and Deep Learning

被引:0
|
作者
Liu, Yu [1 ]
Jin, Zhengping [1 ]
机构
[1] Beijing Univ Posts & Telecommun, State Key Lab Networking & Switching Technol, Beijing 100876, Peoples R China
关键词
text classification; latent Dirichlet allocation; deep learning; Gibbs sampling;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this paper, we proposed a mixed model of text classification constructed by latent dirichlet allocation and deep learning. The model present that a text will be represent as a vector computing by latent dirichlet allocation algorithm, and this vector is probabilistic vector of corresponding topic words space. Then we input these topic vectors into a deep learning framework for computing nonlinear relationship of each vector. Finally, we constructed a text classification system. The proposed model achieves a higher accuracy when compared with other current popular algorithms, such as SVM, KNN and TFIDF.
引用
收藏
页码:2501 / 2504
页数:4
相关论文
共 50 条
  • [1] Initializing Deep Learning Based on Latent Dirichlet Allocation for Document Classification
    Jeon, Hyung-Bae
    Lee, Soo-Young
    [J]. NEURAL INFORMATION PROCESSING, ICONIP 2016, PT III, 2016, 9949 : 634 - 641
  • [2] A Latent Dirichlet Allocation and Fuzzy Clustering Based Machine Learning Model for Text Thesaurus
    Luo, J.
    Yu, D.
    Dai, Z.
    [J]. INTERNATIONAL JOURNAL OF COMPUTERS COMMUNICATIONS & CONTROL, 2020, 15 (02)
  • [3] Rail transit fault text classification based on the latent dirichlet allocation
    Li, R.
    Su, S.
    Wang, G.
    Qu, J.
    Cao, Y.
    [J]. 2021 IEEE INTELLIGENT TRANSPORTATION SYSTEMS CONFERENCE (ITSC), 2021, : 1359 - 1364
  • [4] Latent Dirichlet Allocation complement in the vector space model for Multi-Label Text Classification
    Carrera-Trejo, Victor
    Sidorov, Grigori
    Miranda-Jimenez, Sabino
    Moreno Ibarra, Marco
    Cadena Martinez, Rodrigo
    [J]. INTERNATIONAL JOURNAL OF COMBINATORIAL OPTIMIZATION PROBLEMS AND INFORMATICS, 2015, 6 (01): : 7 - 19
  • [5] A comparison of the performance of latent Dirichlet allocation and the Dirichlet multinomial mixture model on short text
    Mazarura, Jocelyn
    de Waal, Alta
    [J]. 2016 PATTERN RECOGNITION ASSOCIATION OF SOUTH AFRICA AND ROBOTICS AND MECHATRONICS INTERNATIONAL CONFERENCE (PRASA-ROBMECH), 2016,
  • [6] A New Latent generalized Dirichlet Allocation Model for Image Classification
    Ihou, Koffi Eddy
    Bouguila, Nizar
    [J]. PROCEEDINGS OF THE 2017 SEVENTH INTERNATIONAL CONFERENCE ON IMAGE PROCESSING THEORY, TOOLS AND APPLICATIONS (IPTA 2017), 2017,
  • [7] BiModal Latent Dirichlet Allocation for Text and Image
    Liao, Xiaofeng
    Jiang, Qingshan
    Zhang, Wei
    Zhang, Kai
    [J]. 2014 4TH IEEE INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE AND TECHNOLOGY (ICIST), 2014, : 736 - 739
  • [8] Evaluation of text semantic features using latent dirichlet allocation model
    Zhou, Chunjie
    Li, Nao
    Zhang, Chi
    Yang, Xiaoyu
    [J]. International Journal of Performability Engineering, 2020, 16 (06) : 968 - 978
  • [9] Using Latent Dirichlet Allocation to Improve Text Classification Performance of Support Vector Machine
    Chen, Yaw-Huei
    Li, Shu-Fong
    [J]. 2016 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2016, : 1280 - 1286
  • [10] Latent Dirichlet Allocation Models for Image Classification
    Rasiwasia, Nikhil
    Vasconcelos, Nuno
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2013, 35 (11) : 2665 - 2679