Nonparametric Topic Modeling with Neural Inference

被引:8
|
作者
Ning, Xuefei [1 ]
Zheng, Yin [2 ]
Jiang, Zhuxi [3 ]
Wang, Yu [1 ]
Yang, Huazhong [1 ]
Huang, Junzhou [4 ]
Zhao, Peilin [4 ]
机构
[1] Tsinghua Univ, Beijing, Peoples R China
[2] Tencent, Weixin Grp, Shenzhen, Peoples R China
[3] Momenta, Beijing, Peoples R China
[4] Tencent AI Lab, Shenzhen, Peoples R China
基金
中国国家自然科学基金;
关键词
D O I
10.1016/j.neucom.2019.12.128
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This work focuses on combining nonparametric topic models with Auto-Encoding Variational Bayes (AEVB). Specifically, we first propose iTM-VAE, where the topics are treated as trainable parameters and the document-specific topic proportions are obtained by a stick-breaking construction. The inference of iTM-VAE is modeled by neural networks such that it can be computed in a simple feed-forward manner. We also describe how to introduce a hyper-prior into iTM-VAE so as to model the uncertainty of the prior parameter. Actually, the hyper-prior technique is quite general and we show that it can be applied to other AEVB based models to alleviate the collapse-to-prior problem elegantly. Moreover, we also propose HiTM-VAE, where the document-specific topic distributions are generated in a hierarchical manner. HiTM-VAE is even more flexible and can generate topic representations with better variability and sparsity. Experimental results on 20News and Reuters RCV1-V2 datasets show that the proposed models outperform the state-of-the-art baselines significantly. The advantages of the hyper-prior technique and the hierarchical model construction are also confirmed by experiments. (c) 2020 Elsevier B.V. All rights reserved.
引用
收藏
页码:296 / 306
页数:11
相关论文
共 50 条
  • [41] Nonparametric Statistical Inference
    Shalabh
    JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES A-STATISTICS IN SOCIETY, 2011, 174 : 508 - 509
  • [42] Robust Nonparametric Inference
    Nordhausen, Klaus
    Oja, Hannu
    ANNUAL REVIEW OF STATISTICS AND ITS APPLICATION, VOL 5, 2018, 5 : 473 - 500
  • [43] ASPECTS OF NONPARAMETRIC INFERENCE
    PURI, ML
    INTERNATIONAL STATISTICAL REVIEW, 1972, 40 (03) : 299 - 327
  • [44] Review of Trends in Topic Modeling Techniques, Tools, Inference Algorithms and Applications
    Mulunda, Christine K.
    Wagacha, Peter W.
    Muchemi, Lawrence
    2018 5TH INTERNATIONAL CONFERENCE ON SOFT COMPUTING & MACHINE INTELLIGENCE (ISCMI), 2018, : 28 - 37
  • [45] Improving neural topic modeling via Sinkhorn divergence
    Liu, Luyang
    Huang, Heyan
    Gao, Yang
    Zhang, Yongfeng
    INFORMATION PROCESSING & MANAGEMENT, 2022, 59 (03)
  • [46] Neural Topic Modeling by Incorporating Document Relationship Graph
    Zhou, Deyu
    Hu, Xuemeng
    Wang, Rui
    PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP), 2020, : 3790 - 3796
  • [47] Integration of Neural Embeddings and Probabilistic Models in Topic Modeling
    Koochemeshkian, Pantea
    Bouguila, Nizar
    APPLIED ARTIFICIAL INTELLIGENCE, 2024, 38 (01)
  • [48] Neural Topic Modeling with Deep Mutual Information Estimation
    Xu, Kang
    Lu, Xiaoqiu
    Li, Yuan-fang
    Wu, Tongtong
    Qi, Guilin
    Ye, Ning
    Wang, Dong
    Zhou, Zheng
    BIG DATA RESEARCH, 2022, 30
  • [49] Bayesian Nonparametric Inference of the Neutron Star Equation of State via a Neural Network
    Han, Ming-Zhe
    Jiang, Jin-Liang
    Tang, Shao-Peng
    Fan, Yi-Zhong
    ASTROPHYSICAL JOURNAL, 2021, 919 (01):
  • [50] Nonparametric method of topic identification using granularity concept and graph-based modeling
    Ganguli, Isha
    Sil, Jaya
    Sengupta, Nandita
    NEURAL COMPUTING & APPLICATIONS, 2023, 35 (02): : 1055 - 1075