Nonparametric Topic Modeling with Neural Inference

被引:8
|
作者
Ning, Xuefei [1 ]
Zheng, Yin [2 ]
Jiang, Zhuxi [3 ]
Wang, Yu [1 ]
Yang, Huazhong [1 ]
Huang, Junzhou [4 ]
Zhao, Peilin [4 ]
机构
[1] Tsinghua Univ, Beijing, Peoples R China
[2] Tencent, Weixin Grp, Shenzhen, Peoples R China
[3] Momenta, Beijing, Peoples R China
[4] Tencent AI Lab, Shenzhen, Peoples R China
基金
中国国家自然科学基金;
关键词
D O I
10.1016/j.neucom.2019.12.128
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This work focuses on combining nonparametric topic models with Auto-Encoding Variational Bayes (AEVB). Specifically, we first propose iTM-VAE, where the topics are treated as trainable parameters and the document-specific topic proportions are obtained by a stick-breaking construction. The inference of iTM-VAE is modeled by neural networks such that it can be computed in a simple feed-forward manner. We also describe how to introduce a hyper-prior into iTM-VAE so as to model the uncertainty of the prior parameter. Actually, the hyper-prior technique is quite general and we show that it can be applied to other AEVB based models to alleviate the collapse-to-prior problem elegantly. Moreover, we also propose HiTM-VAE, where the document-specific topic distributions are generated in a hierarchical manner. HiTM-VAE is even more flexible and can generate topic representations with better variability and sparsity. Experimental results on 20News and Reuters RCV1-V2 datasets show that the proposed models outperform the state-of-the-art baselines significantly. The advantages of the hyper-prior technique and the hierarchical model construction are also confirmed by experiments. (c) 2020 Elsevier B.V. All rights reserved.
引用
收藏
页码:296 / 306
页数:11
相关论文
共 50 条
  • [1] Tree-Structured Topic Modeling with Nonparametric Neural Variational Inference
    Chen, Ziye
    Ding, Cheng
    Zhang, Zusheng
    Rao, Yanghui
    Xie, Haoran
    59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING, VOL 1 (ACL-IJCNLP 2021), 2021, : 2343 - 2353
  • [2] Neural Topic Modeling via Discrete Variational Inference
    Gupta, Amulya
    Zhang, Zhu
    ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2023, 14 (02)
  • [3] Nonparametric neural topic modeling for customer insight extraction about the tire industry
    Palencia-Olivar, Miguel
    Bonnevay, Stephane
    Aussem, Alexandre
    Canitia, Bruno
    2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
  • [4] NONPARAMETRIC MODELING AND INFERENCE - AN ALTERNATIVE
    ARJOON, S
    SOCIAL AND ECONOMIC STUDIES, 1993, 42 (2-3) : 209 - 224
  • [5] Encouraging Sparsity in Neural Topic Modeling with Non-Mean-Field Inference
    Chen, Jiayao
    Wang, Rui
    He, Jueying
    Li, Mark Junjie
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES: RESEARCH TRACK, ECML PKDD 2023, PT IV, 2023, 14172 : 142 - 158
  • [6] Bayesian nonparametric inference of latent topic hierarchies for multimodal data
    Shimamawari, Takuji
    Eguchi, Koji
    Takasu, Atsuhiro
    PROCEEDINGS 3RD IAPR ASIAN CONFERENCE ON PATTERN RECOGNITION ACPR 2015, 2015, : 236 - 240
  • [7] Bayesian Nonparametric Modeling for Causal Inference
    Hill, Jennifer L.
    JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 2011, 20 (01) : 217 - 240
  • [8] Nonparametric Spherical Topic Modeling with Word Embeddings
    Batmanghelich, Nematollah Kayhan
    Saeedi, Ardavan
    Narasimhan, Karthik R.
    Gershman, Samuel J.
    PROCEEDINGS OF THE 54TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2016), VOL 2, 2016, : 537 - 542
  • [9] The Nested Chinese Restaurant Process and Bayesian Nonparametric Inference of Topic Hierarchies
    Blei, David M.
    Griffiths, Thomas L.
    Jordan, Michael I.
    JOURNAL OF THE ACM, 2010, 57 (02)
  • [10] Robustness and inference in nonparametric partial frontier modeling
    Daouia, Abdelaati
    Gijbels, Irene
    JOURNAL OF ECONOMETRICS, 2011, 161 (02) : 147 - 165