Topic change point detection using a mixed Bayesian model

被引:0
|
作者
Xiaoling Lu
Yuxuan Guo
Jiayi Chen
Feifei Wang
机构
[1] Renmin University of China,Center for Applied Statistics
[2] Renmin University of China,School of Statistics
[3] Alibaba Group,Intelligent Marketing Platform
来源
关键词
Change point detection; Dynamic topic models; Latent Dirichlet allocation; Markov chain Monte Carlo;
D O I
暂无
中图分类号
学科分类号
摘要
Dynamic text documents, including news articles, user reviews, and blogs, are now commonly encountered in many fields. Accordingly, the topics underlying text streams also change over time. To grasp the topic changes in the increasing accumulation of text documents, there is a great need to develop automatic text analysis models to find the key changes in topics. To this end, this study proposes a topic change point detection (Topic-CD) model. Different from previous studies, we define the change point of topics from the perspective of hyperparameters associated with topic-word distributions. This allows the model to detect change points underlying the whole topic set. Under this definition, the topic modeling and change point detection are combined in a unified framework and then performed simultaneously using a Markov chain Monte Carlo algorithm. In addition, the Topic-CD model is free from setting the number of change points in advance, which makes it more convenient for practical use. We investigate the performance of the Topic-CD model numerically using synthetic data and three real datasets. The results show that the Topic-CD model can well identify the change points in topics when compared with several state-of-the-art methods.
引用
收藏
页码:146 / 173
页数:27
相关论文
共 50 条
  • [1] Topic change point detection using a mixed Bayesian model
    Lu, Xiaoling
    Guo, Yuxuan
    Chen, Jiayi
    Wang, Feifei
    DATA MINING AND KNOWLEDGE DISCOVERY, 2022, 36 (01) : 146 - 173
  • [2] Bayesian Change Point Detection for Mixed Data with Missing Values
    Murph, Alexander C.
    Storlie, Curtis B.
    2022 IEEE 10TH INTERNATIONAL CONFERENCE ON HEALTHCARE INFORMATICS (ICHI 2022), 2022, : 499 - 501
  • [3] Bayesian Model Selection for Change Point Detection and Clustering
    Mazhar, Othmane
    Rojas, Cristian R.
    Fischione, Carlo
    Hesamzadeh, Mohammad Reza
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 80, 2018, 80
  • [4] A Bayesian Change Point Model for Epileptic Seizure Detection
    Yildiz, Cagatay
    Bingol, Haluk O.
    Irim-Celik, Gulcin
    Aktekin, Berrin
    Aykut-Bingol, Canan
    2017 25TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2017,
  • [5] Bayesian Hierarchical Model for Change Point Detection in Multivariate Sequences
    Jin, Huaqing
    Yin, Guosheng
    Yuan, Binhang
    Jiang, Fei
    TECHNOMETRICS, 2022, 64 (02) : 177 - 186
  • [6] Bayesian Complex Network Community Detection Using Nonparametric Topic Model
    Zhu, Ruimin
    Jiang, Wenxin
    COMPLEX NETWORKS AND THEIR APPLICATIONS VII, VOL 1, 2019, 812 : 280 - 291
  • [7] Change-point detection in astronomical data by using a hierarchical model and a Bayesian sampling approach
    Dobigeon, Nicolas
    Tourneret, Jean-Yves
    Scargle, Jeffrey D.
    2005 IEEE/SP 13th Workshop on Statistical Signal Processing (SSP), Vols 1 and 2, 2005, : 335 - 340
  • [8] BAYESIAN ONLINE CHANGE POINT DETECTION IN FINANCE
    Habibi, Reza
    FINANCIAL INTERNET QUARTERLY, 2022, 17 (04) : 27 - 33
  • [9] Bayesian change point detection for functional data
    Li, Xiuqi
    Ghosal, Subhashis
    JOURNAL OF STATISTICAL PLANNING AND INFERENCE, 2021, 213 : 193 - 205
  • [10] ON THE OPTIMALITY OF BAYESIAN CHANGE-POINT DETECTION
    Han, Dong
    Tsung, Fugee
    Xian, Jinguo
    ANNALS OF STATISTICS, 2017, 45 (04): : 1375 - 1402