WINDOW-BASED TOPIC MODEL FOR HDP

被引:0
|
作者
Liu, Di [1 ]
Zeng, Ye [1 ]
Luo, Yu [1 ]
Pang, Hong [1 ]
Wu, Xiao-Hua [1 ]
机构
[1] Univ Elect Sci & Technol China, Sch Informat & Software Engn, Chengdu 610054, Peoples R China
关键词
Hierarchical Dirichlet process; Topic model; Window; Belief propagation;
D O I
10.1109/iccwamtip47768.2019.9067737
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Hierarchical Dirichlet process (HDP) is a non-parametric Bayesian model, and has been widely applied in the application of topic models. However, the model is based on the "bag of words" hypothesis, ignoring the order of words in the document, resulting in a lack of word context semantics. In this regard, this paper proposes a window-based hierarchical Dirichlet process model (WHDP). The model uses windows to divide documents into smaller fragments, and keeps the order between words while moving windows, so as to reduce the semantic confusion of the text. We applied our method in real dataset and compared with other existing methods, such as sampling belief propagation algorithm for HDP, LDA model, and sliding window based topic model. The results show that the proposed method performs the superiority in convergence rate, perplexity and generalization ability.
引用
收藏
页码:70 / 75
页数:6
相关论文
共 50 条
  • [1] Fluid model for window-based congestion control mechanism
    La, RJ
    WSC'01: PROCEEDINGS OF THE 2001 WINTER SIMULATION CONFERENCE, VOLS 1 AND 2, 2001, : 1282 - 1290
  • [2] WINDOW-BASED SURVEILLANCE STRATEGIES
    KRISHNA, CM
    GANZ, A
    WANG, X
    IEE PROCEEDINGS-COMPUTERS AND DIGITAL TECHNIQUES, 1995, 142 (03): : 233 - 236
  • [3] Topic mining from microblogs based on MB-HDP model
    Liu, Shao-Peng
    Yin, Jian
    Ouyang, Jia
    Huang, Yun
    Yang, Xiao-Ying
    Jisuanji Xuebao/Chinese Journal of Computers, 2015, 38 (07): : 1408 - 1419
  • [4] A window-based inverse Hough transform
    Kesidis, AL
    Papamarkos, N
    PATTERN RECOGNITION, 2000, 33 (06) : 1105 - 1117
  • [5] A window-based algorithm for skyline queries
    Yu, J
    Liu, X
    Liu, GH
    PDCAT 2005: Sixth International Conference on Parallel and Distributed Computing, Applications and Technologies, Proceedings, 2005, : 907 - 909
  • [6] Window-based, discontinuity preserving stereo
    Agrawal, M
    Davis, LS
    PROCEEDINGS OF THE 2004 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 1, 2004, : 66 - 73
  • [7] Mining Unstructured Economic Indicators Based on PSP_HDP Topic Model
    Zhang Y.-T.
    Wan C.-X.
    Liu X.-P.
    Jiang T.-J.
    Liu D.-X.
    Liao G.-Q.
    Ruan Jian Xue Bao/Journal of Software, 2020, 31 (03): : 845 - 865
  • [8] Window-based method for information retrieval
    Jin, QL
    Zhao, J
    Xu, B
    NATURAL LANGUAGE PROCESSING - IJCNLP 2004, 2005, 3248 : 120 - 129
  • [9] Window-based capillary flow porometer
    不详
    CANADIAN CERAMICS QUARTERLY-JOURNAL OF THE CANADIAN CERAMIC SOCIETY, 1996, 65 (02): : 94 - 94
  • [10] Window-Based Constant Beamwidth Beamformer
    Long, Tao
    Cohen, Israel
    Berdugo, Baruch
    Yang, Yan
    Chen, Jingdong
    SENSORS, 2019, 19 (09)