Integration of Neural Embeddings and Probabilistic Models in Topic Modeling

被引:0
|
作者
Koochemeshkian, Pantea [1 ]
Bouguila, Nizar [1 ]
机构
[1] Concordia Inst Informat Syst Engn CIISE, Informat Syst Engn, Montreal, PQ, Canada
关键词
DIRICHLET; EXTRACTION;
D O I
10.1080/08839514.2024.2403904
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Topic modeling, a way to find topics in large volumes of text, has grown with the help of deep learning. This paper presents two novel approaches to topic modeling by integrating embeddings derived from Bert-Topic with the multi-grain clustering topic model (MGCTM). Recognizing the inherent hierarchical and multi-scale nature of topics in corpora, our methods utilize MGCTM to capture topic structures at multiple levels of granularity. We enhance the expressiveness of MGCTM by introducing the Generalized Dirichlet and Beta-Liouville distributions as priors, which provide greater flexibility in modeling topic proportions and capturing richer topic relationships. Comprehensive experiments on various datasets showcase the effectiveness of our proposed models in achieving superior topic coherence and granularity compared to state-of-the-art methods. Our findings underscore the potential of leveraging hybrid architectures, marrying neural embeddings with advanced probabilistic modeling, to push the boundaries of topic modeling.
引用
收藏
页数:33
相关论文
共 50 条
  • [41] Topic Modeling over Short Texts by Incorporating Word Embeddings
    Qiang, Jipeng
    Chen, Ping
    Wang, Tong
    Wu, Xindong
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2017, PT II, 2017, 10235 : 363 - 374
  • [42] Contextual Word Embeddings and Topic Modeling in Healthy Dieting and Obesity
    Vijaya Kumari Yeruva
    Sidrah Junaid
    Yugyung Lee
    Journal of Healthcare Informatics Research, 2019, 3 : 159 - 183
  • [43] A Study on Stochastic Variational Inference for Topic Modeling with Word Embeddings
    Ozaki, Kana
    Kobayashie, Ichiro
    COMPUTACION Y SISTEMAS, 2022, 26 (03): : 1225 - 1232
  • [44] Cycling topic graph learning for neural topic modeling
    Liu, Yanyan
    Gong, Zhiguo
    KNOWLEDGE-BASED SYSTEMS, 2025, 310
  • [45] Neural Topic Models for Hierarchical Topic Detection and Visualization
    Pham, Dang
    Le, Than M., V
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2021: RESEARCH TRACK, PT III, 2021, 12977 : 35 - 51
  • [46] Integration of Neural Networks and Probabilistic Spatial Models for Acoustic Blind Source Separation
    Drude, Lukas
    Haeb-Umbach, Reinhold
    IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2019, 13 (04) : 815 - 826
  • [47] Relational Biterm Topic Model: Short-Text Topic Modeling using Word Embeddings
    Li, Ximing
    Zhang, Ang
    Li, Changchun
    Guo, Lantian
    Wang, Wenting
    Ouyang, Jihong
    COMPUTER JOURNAL, 2019, 62 (03): : 359 - 372
  • [48] Relational Biterm Topic Model: Short-Text Topic Modeling using Word Embeddings
    Li, Ximing
    Zhang, Ang
    Li, Changchun
    Guo, Lantian
    Wang, Wenting
    Ouyang, Jihong
    Computer Journal, 2019, 62 (03): : 359 - 372
  • [49] Neural Variational Correlated Topic Modeling
    Liu, Luyang
    Huang, Heyan
    Gao, Yang
    Wei, Xiaochi
    Zhang, Yongfeng
    WEB CONFERENCE 2019: PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE (WWW 2019), 2019, : 1142 - 1152
  • [50] Probabilistic neural theories of multisensory integration
    Beck, Jeff
    COGNITIVE PROCESSING, 2012, 13 : S34 - S34