Neural labeled LDA: a topic model for semi-supervised document classification

被引:0
|
作者
Wei Wang
Bing Guo
Yan Shen
Han Yang
Yaosen Chen
Xinhua Suo
机构
[1] Sichuan University,College of Computer Science
[2] Sobey Technology,Media Intelligence Laboratory
[3] Peng Cheng Laboratory,School of Computer Science
[4] Chengdu University of Information Technology,undefined
来源
Soft Computing | 2021年 / 25卷
关键词
Neural topic model; Semi-supervised learning; Document classification;
D O I
暂无
中图分类号
学科分类号
摘要
Recently, some statistical topic modeling approaches based on LDA have been applied in the field of supervised document classification, where the model generation procedure incorporates prior knowledge to improve the classification performance. However, these customizations of topic modeling are limited by the cumbersome derivation of a specific inference algorithm for each modification. In this paper, we propose a new supervised topic modeling approach for document classification problems, Neural Labeled LDA (NL-LDA), which builds on the VAE framework, and designs a special generative network to incorporate prior information. The proposed model can support semi-supervised learning based on the manifold assumption and low-density assumption. Meanwhile, NL-LDA has a consistent and concise inference method while semi-supervised learning and predicting. Quantitative experimental results demonstrate our model has outstanding performance on supervised document classification relative to the compared approaches, including traditional statistical and neural topic models. Specially, the proposed model can support both single-label and multi-label document classification. The proposed NL-LDA performs significantly well on semi-supervised classification, especially under a small amount of labeled data. Further comparisons with related works also indicate our model is competitive with state-of-the-art topic modeling approaches on semi-supervised classification.
引用
收藏
页码:14561 / 14571
页数:10
相关论文
共 50 条
  • [41] Semi-supervised classification trees
    Levatic, Jurica
    Ceci, Michelangelo
    Kocev, Dragi
    Dzeroski, Saso
    [J]. JOURNAL OF INTELLIGENT INFORMATION SYSTEMS, 2017, 49 (03) : 461 - 486
  • [42] Semi-supervised classification of human actions based on Neural Networks
    Iosifidis, Alexandros
    Tefas, Anastasios
    Pitas, Ioannis
    [J]. 2014 22ND INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2014, : 1336 - 1341
  • [43] Bayesian Graph Convolutional Neural Networks for Semi-Supervised Classification
    Zhang, Yingxue
    Pal, Soumyasundar
    Coates, Mark
    Ustebay, Deniz
    [J]. THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 5829 - 5836
  • [44] Watersheds for Semi-Supervised Classification
    Challa, Aditya
    Danda, Sravan
    Sagar, B. S. Daya
    Najman, Laurent
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2019, 26 (05) : 720 - 724
  • [45] Semi-supervised classification trees
    Jurica Levatić
    Michelangelo Ceci
    Dragi Kocev
    Sašo Džeroski
    [J]. Journal of Intelligent Information Systems, 2017, 49 : 461 - 486
  • [46] A Semi-supervised Classification Using Gated Linear Model
    Ren, Yanni
    Li, Weite
    Hu, Jinglu
    [J]. 2019 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2019,
  • [47] Multiview Semi-Supervised Learning Model for Image Classification
    Nie, Feiping
    Tian, Lai
    Wang, Rong
    Li, Xuelong
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2020, 32 (12) : 2389 - 2400
  • [48] Important citations identification with semi-supervised classification model
    Xin An
    Xin Sun
    Shuo Xu
    [J]. Scientometrics, 2022, 127 : 6533 - 6555
  • [49] Important citations identification with semi-supervised classification model
    An, Xin
    Sun, Xin
    Xu, Shuo
    [J]. SCIENTOMETRICS, 2022, 127 (11) : 6533 - 6555
  • [50] A Semi-supervised Hidden Markov Topic Model Based on Prior Knowledge
    Seifollahi, Sattar
    Piccardi, Massimo
    Borzeshi, Ehsan Zare
    [J]. DATA MINING, AUSDM 2017, 2018, 845 : 265 - 276