Neural labeled LDA: a topic model for semi-supervised document classification

被引:0
|
作者
Wei Wang
Bing Guo
Yan Shen
Han Yang
Yaosen Chen
Xinhua Suo
机构
[1] Sichuan University,College of Computer Science
[2] Sobey Technology,Media Intelligence Laboratory
[3] Peng Cheng Laboratory,School of Computer Science
[4] Chengdu University of Information Technology,undefined
来源
Soft Computing | 2021年 / 25卷
关键词
Neural topic model; Semi-supervised learning; Document classification;
D O I
暂无
中图分类号
学科分类号
摘要
Recently, some statistical topic modeling approaches based on LDA have been applied in the field of supervised document classification, where the model generation procedure incorporates prior knowledge to improve the classification performance. However, these customizations of topic modeling are limited by the cumbersome derivation of a specific inference algorithm for each modification. In this paper, we propose a new supervised topic modeling approach for document classification problems, Neural Labeled LDA (NL-LDA), which builds on the VAE framework, and designs a special generative network to incorporate prior information. The proposed model can support semi-supervised learning based on the manifold assumption and low-density assumption. Meanwhile, NL-LDA has a consistent and concise inference method while semi-supervised learning and predicting. Quantitative experimental results demonstrate our model has outstanding performance on supervised document classification relative to the compared approaches, including traditional statistical and neural topic models. Specially, the proposed model can support both single-label and multi-label document classification. The proposed NL-LDA performs significantly well on semi-supervised classification, especially under a small amount of labeled data. Further comparisons with related works also indicate our model is competitive with state-of-the-art topic modeling approaches on semi-supervised classification.
引用
收藏
页码:14561 / 14571
页数:10
相关论文
共 50 条
  • [1] Neural labeled LDA: a topic model for semi-supervised document classification
    Wang, Wei
    Guo, Bing
    Shen, Yan
    Yang, Han
    Chen, Yaosen
    Suo, Xinhua
    [J]. SOFT COMPUTING, 2021, 25 (23) : 14561 - 14571
  • [2] Twin labeled LDA: a supervised topic model for document classification
    Wei Wang
    Bing Guo
    Yan Shen
    Han Yang
    Yaosen Chen
    Xinhua Suo
    [J]. Applied Intelligence, 2020, 50 : 4602 - 4615
  • [3] Twin labeled LDA: a supervised topic model for document classification
    Wang, Wei
    Guo, Bing
    Shen, Yan
    Yang, Han
    Chen, Yaosen
    Suo, Xinhua
    [J]. APPLIED INTELLIGENCE, 2020, 50 (12) : 4602 - 4615
  • [4] Semi-supervised document classification with a mislabeling error model
    Krithara, Anastasia
    Amini, Massih R.
    Renders, Jean-Michel
    Goutte, Cyril
    [J]. ADVANCES IN INFORMATION RETRIEVAL, 2008, 4956 : 370 - +
  • [5] Exploiting the Value of Class Labels in Topic Models for Semi-Supervised Document Classification
    Soleimani, Hossein
    Miller, David J.
    [J]. 2016 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2016, : 4025 - 4031
  • [6] A Hybrid Semi-supervised Topic Model
    Zhang, Yanning
    Wei, Wei
    [J]. INTELLIGENT SCIENCE AND INTELLIGENT DATA ENGINEERING, ISCIDE 2011, 2012, 7202 : 309 - 317
  • [7] Semi-supervised Multi-Label Topic Models for Document Classification and Sentence Labeling
    Soleimani, Hossein
    Miller, David J.
    [J]. CIKM'16: PROCEEDINGS OF THE 2016 ACM CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, 2016, : 105 - 114
  • [8] Semi-supervised topic classification for low resource languages
    Liu, Daben
    McVeety, Sam
    Prasad, Rohit
    Natarajan, Prem
    [J]. 2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 5093 - 5096
  • [9] Semi-supervised Document Clustering Based on Latent Dirichlet Allocation (LDA)
    秦永彬
    李解
    黄瑞章
    李晶
    [J]. Journal of Donghua University(English Edition), 2016, 33 (05) : 685 - 688
  • [10] A jointly distributed semi-supervised topic model
    Zhang, Yanning
    Wei, Wei
    [J]. NEUROCOMPUTING, 2014, 134 : 38 - 45