Neural labeled LDA: a topic model for semi-supervised document classification

被引:8
|
作者
Wang, Wei [1 ,2 ,3 ]
Guo, Bing [1 ]
Shen, Yan [4 ]
Yang, Han [2 ]
Chen, Yaosen [1 ]
Suo, Xinhua [1 ]
机构
[1] Sichuan Univ, Coll Comp Sci, Chengdu, Peoples R China
[2] Sobey Technol, Media Intelligence Lab, Chengdu, Peoples R China
[3] Peng Cheng Lab, Shenzhen, Peoples R China
[4] Chengdu Univ Informat Technol, Sch Comp Sci, Chengdu, Peoples R China
基金
中国国家自然科学基金;
关键词
Neural topic model; Semi-supervised learning; Document classification;
D O I
10.1007/s00500-021-06310-2
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recently, some statistical topic modeling approaches based on LDA have been applied in the field of supervised document classification, where the model generation procedure incorporates prior knowledge to improve the classification performance. However, these customizations of topic modeling are limited by the cumbersome derivation of a specific inference algorithm for each modification. In this paper, we propose a new supervised topic modeling approach for document classification problems, Neural Labeled LDA (NL-LDA), which builds on the VAE framework, and designs a special generative network to incorporate prior information. The proposed model can support semi-supervised learning based on the manifold assumption and low-density assumption. Meanwhile, NL-LDA has a consistent and concise inference method while semi-supervised learning and predicting. Quantitative experimental results demonstrate our model has outstanding performance on supervised document classification relative to the compared approaches, including traditional statistical and neural topic models. Specially, the proposed model can support both single-label and multi-label document classification. The proposed NL-LDA performs significantly well on semi-supervised classification, especially under a small amount of labeled data. Further comparisons with related works also indicate our model is competitive with state-of-the-art topic modeling approaches on semi-supervised classification.
引用
收藏
页码:14561 / 14571
页数:11
相关论文
共 50 条
  • [1] Neural labeled LDA: a topic model for semi-supervised document classification
    Wei Wang
    Bing Guo
    Yan Shen
    Han Yang
    Yaosen Chen
    Xinhua Suo
    [J]. Soft Computing, 2021, 25 : 14561 - 14571
  • [2] Twin labeled LDA: a supervised topic model for document classification
    Wei Wang
    Bing Guo
    Yan Shen
    Han Yang
    Yaosen Chen
    Xinhua Suo
    [J]. Applied Intelligence, 2020, 50 : 4602 - 4615
  • [3] Twin labeled LDA: a supervised topic model for document classification
    Wang, Wei
    Guo, Bing
    Shen, Yan
    Yang, Han
    Chen, Yaosen
    Suo, Xinhua
    [J]. APPLIED INTELLIGENCE, 2020, 50 (12) : 4602 - 4615
  • [4] Semi-supervised document classification with a mislabeling error model
    Krithara, Anastasia
    Amini, Massih R.
    Renders, Jean-Michel
    Goutte, Cyril
    [J]. ADVANCES IN INFORMATION RETRIEVAL, 2008, 4956 : 370 - +
  • [5] Exploiting the Value of Class Labels in Topic Models for Semi-Supervised Document Classification
    Soleimani, Hossein
    Miller, David J.
    [J]. 2016 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2016, : 4025 - 4031
  • [6] A Hybrid Semi-supervised Topic Model
    Zhang, Yanning
    Wei, Wei
    [J]. INTELLIGENT SCIENCE AND INTELLIGENT DATA ENGINEERING, ISCIDE 2011, 2012, 7202 : 309 - 317
  • [7] Semi-supervised Multi-Label Topic Models for Document Classification and Sentence Labeling
    Soleimani, Hossein
    Miller, David J.
    [J]. CIKM'16: PROCEEDINGS OF THE 2016 ACM CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, 2016, : 105 - 114
  • [8] Semi-supervised topic classification for low resource languages
    Liu, Daben
    McVeety, Sam
    Prasad, Rohit
    Natarajan, Prem
    [J]. 2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 5093 - 5096
  • [9] Semi-supervised Document Clustering Based on Latent Dirichlet Allocation (LDA)
    秦永彬
    李解
    黄瑞章
    李晶
    [J]. Journal of Donghua University(English Edition), 2016, 33 (05) : 685 - 688
  • [10] A jointly distributed semi-supervised topic model
    Zhang, Yanning
    Wei, Wei
    [J]. NEUROCOMPUTING, 2014, 134 : 38 - 45