GraphBTM: Graph Enhanced Autoencoded Variational Inference for Biterm Topic Model

被引:0
|
作者
Zhu, Qile [1 ]
Feng, Zheng [1 ]
Li, Xiaolin [1 ]
机构
[1] Univ Florida, NSF Ctr Big Learning, Large Scale Intelligent Syst Lab, Gainesville, FL 32611 USA
基金
美国国家卫生研究院; 美国国家科学基金会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Discovering the latent topics within texts has been a fundamental task for many applications. However, conventional topic models suffer different problems in different settings. The Latent Dirichlet Allocation (LDA) may not work well for short texts due to the data sparsity (i.e., the sparse word co-occurrence patterns in short documents). The Biterm Topic Model (BTM) learns topics by modeling the word-pairs named biterms in the whole corpus. This assumption is very strong when documents are long with rich topic information and do not exhibit the transitivity of biterms. In this paper, we propose a novel way called GraphBTM to represent biterms as graphs and design Graph Convolutional Networks (GCNs) with residual connections to extract transitive features from biterms. To overcome the data sparsity of LDA and the strong assumption of BTM, we sample a fixed number of documents to form a mini-corpus as a training instance. We also propose a dataset called All News extracted from (Thompson, 2017), in which documents are much longer than 20 Newsgroups. We present an amortized variational inference method for GraphBTM. Our method generates more coherent topics compared with previous approaches. Experiments show that the sampling strategy improves performance by a large margin.
引用
收藏
页码:4663 / 4672
页数:10
相关论文
共 50 条
  • [21] Cross-Lingual Taxonomy Alignment with Bilingual Biterm Topic Model
    Wu, Tianxing
    Qi, Guilin
    Wang, Haofen
    Xu, Kang
    Cui, Xuan
    THIRTIETH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2016, : 287 - 293
  • [22] Relational Biterm Topic Model: Short-Text Topic Modeling using Word Embeddings
    Li, Ximing
    Zhang, Ang
    Li, Changchun
    Guo, Lantian
    Wang, Wenting
    Ouyang, Jihong
    COMPUTER JOURNAL, 2019, 62 (03): : 359 - 372
  • [23] Relational Biterm Topic Model: Short-Text Topic Modeling using Word Embeddings
    Li, Ximing
    Zhang, Ang
    Li, Changchun
    Guo, Lantian
    Wang, Wenting
    Ouyang, Jihong
    Computer Journal, 2019, 62 (03): : 359 - 372
  • [24] A Novel Perspective to Mining Online Hotel Reviews Based on Biterm Topic Model
    Ma, Qianqian
    Du, Huiying
    Wang, Zhiyuan
    2021 IEEE 6TH INTERNATIONAL CONFERENCE ON BIG DATA ANALYTICS (ICBDA 2021), 2021, : 310 - 315
  • [25] Neural Topic Modeling via Discrete Variational Inference
    Gupta, Amulya
    Zhang, Zhu
    ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2023, 14 (02)
  • [26] Empirical study on variational inference methods for topic models
    Chi, Jinjin
    Ouyang, Jihong
    Li, Ximing
    Li, Changchun
    JOURNAL OF EXPERIMENTAL & THEORETICAL ARTIFICIAL INTELLIGENCE, 2018, 30 (01) : 129 - 142
  • [27] Topic Modeling on Health Journals with Regularized Variational Inference
    Giaquinto, Robert
    Banerjee, Arindam
    THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 3021 - 3028
  • [28] Stochastic Variational Inference for Dynamic Correlated Topic Models
    Tomasi, Federico
    Ravichandran, Praveen
    Levy-Fix, Gal
    Lalmas, Mounia
    Dai, Zhenwen
    CONFERENCE ON UNCERTAINTY IN ARTIFICIAL INTELLIGENCE (UAI 2020), 2020, 124 : 859 - 868
  • [29] Dataless Short Text Classification Based on Biterm Topic Model and Word Embeddings
    Yang, Yi
    Wang, Hongan
    Zhu, Jiaqi
    Wu, Yunkun
    Jiang, Kailong
    Guo, Wenli
    Shi, Wandong
    PROCEEDINGS OF THE TWENTY-NINTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, : 3969 - 3975
  • [30] Variational Inference with Graph Regularization for Image Annotation
    Shao, Yuanlong
    Zhou, Yuan
    Cai, Deng
    ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2011, 2 (02)