Benchmarking Neural Topic Models: An Empirical Study

被引:0
|
作者
Thanh-Nam Doan [1 ]
Tuan-Anh Hoang [2 ]
机构
[1] Univ Tennessee, Chattanooga, TN USA
[2] VNU Univ Sci, 334 Nguyen Trai, Hanoi, Vietnam
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Neural topic modeling approach has been attracting much attention recently as it is able to leverage the advantages of both neural networks and probabilistic topic models. Previous works have proposed several models that are based on this framework and obtained impressive experimental results compared to traditional probabilistic models. However, the reported result is not consistent across the works, making them hard for gaining a rigorous assessment of these approaches. This work aims to address this issue by offering an extensive empirical evaluation of typical neural topic models in different aspects using large, diverse datasets as well as a thorough set of metrics. Precisely, we examine the performance of these models in three tasks, namely uncovering cohesive topics, modeling the input documents, and representing them for downstream classification. Our results show that while the neural topic models are better in the first and the third tasks, the traditional probabilistic models are still a strong baseline and are better in the second task in many cases. These findings give us more insights for choosing off-the-shelf topic modeling toolboxes in different contexts, as well as for designing more comprehensive evaluation for neural topic models.
引用
收藏
页码:4363 / 4368
页数:6
相关论文
共 50 条
  • [31] Do Neural Topic Models Really Need Dropout? Analysis of the Effect of Dropout in Topic Modeling
    Adhya, Suman
    Lahiri, Avishek
    Sanyal, Debarshi Kumar
    [J]. 17TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EACL 2023, 2023, : 2220 - 2229
  • [32] Document Informed Neural Autoregressive Topic Models with Distributional Prior
    Gupta, Pankaj
    Chaudhary, Yatin
    Buettner, Florian
    Schuetze, Hinrich
    [J]. THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 6505 - 6512
  • [33] Sales Forecasting with Partial Recurrent Neural Networks: Empirical Insights and Benchmarking Results
    Mueller-Navarra, Moritz
    Lessmann, Stefan
    Voss, Stefan
    [J]. 2015 48TH HAWAII INTERNATIONAL CONFERENCE ON SYSTEM SCIENCES (HICSS), 2015, : 1108 - 1116
  • [34] An Empirical Study on Intelligent Rural Tourism Service by Neural Network Algorithm Models
    Chen, Jingzhi
    Xue, Hongbo
    Tsa, Sang-Bing
    [J]. COMPLEXITY, 2021, 2021
  • [35] A Comparative Study of Topic Models for Topic Clustering of Chinese Web News
    Wu, Yonghui
    Ding, Yuxin
    Wang, Xiaolong
    Xu, Jun
    [J]. PROCEEDINGS OF 2010 3RD IEEE INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND INFORMATION TECHNOLOGY (ICCSIT 2010), VOL 5, 2010, : 236 - 240
  • [36] Information technology maturity stages and enterprise benchmarking: an empirical study
    Leem, Choon Seong
    Kim, Byeong Wan
    Yu, Eun Jung
    Paek, Min Ho
    [J]. INDUSTRIAL MANAGEMENT & DATA SYSTEMS, 2008, 108 (09) : 1200 - 1218
  • [37] An empirical study of benchmarking evaluation using MCDM in service industries
    Singh, Bhupender
    Grover, Sandeep
    Singh, Vikram
    [J]. MANAGERIAL AUDITING JOURNAL, 2017, 32 (02) : 111 - 147
  • [38] Implementation of benchmarking concepts in Indian automobile industry - an empirical study
    Panwar, Avinash
    Nepal, Bimal
    Jain, Rakesh
    Yadav, Om Prakash
    [J]. BENCHMARKING-AN INTERNATIONAL JOURNAL, 2013, 20 (06) : 777 - 804
  • [39] Adaptation of Language Models for SMT Using Neural Networks with Topic Information
    Zhao, Yinggong
    Huang, Shujian
    Dai, Xin-Yu
    Chen, Jiajun
    [J]. ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2016, 15 (03)
  • [40] Diversity-Aware Coherence Loss for Improving Neural Topic Models
    Li, Raymond
    Gonzalez-Pizarro, Felipe
    Xing, Linzi
    Murray, Gabriel
    Carenini, Giuseppe
    [J]. 61ST CONFERENCE OF THE THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, VOL 2, 2023, : 1710 - 1722