User Based Aggregation for Biterm Topic Model

被引:0
|
作者
Chen, Weizheng [1 ]
Wang, Jinpeng [1 ]
Zhang, Yan [1 ]
Yan, Hongfei [1 ]
Li, Xiaoming [1 ]
机构
[1] Peking Univ, Sch Elect Engn & Comp Sci, Beijing, Peoples R China
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Biterm Topic Model (BTM) is designed to model the generative process of the word co-occurrence patterns in short texts such as tweets. However, two aspects of BTM may restrict its performance: 1) user individualities are ignored to obtain the corpus level words co-occurrence patterns; and 2) the strong assumptions that two co-occurring words will be assigned the same topic label could not distinguish background words from topical words. In this paper, we propose Twitter-BTM model to address those issues by considering user level personalization in BTM. Firstly, we use user based biterms aggregation to learn user specific topic distribution. Secondly, each user's preference between background words and topical words is estimated by incorporating a background topic. Experiments on a large-scale real-world Twitter dataset show that Twitter-BTM outperforms several state-of-the-art baselines.
引用
收藏
页码:489 / 494
页数:6
相关论文
共 50 条
  • [1] A Robust User Sentiment Biterm Topic Mixture Model Based on User Aggregation Strategy to Avoid Data Sparsity for Short Text
    Nimala K
    Jebakumar R
    [J]. Journal of Medical Systems, 2019, 43 (4)
  • [2] A Robust User Sentiment Biterm Topic Mixture Model Based on User Aggregation Strategy to Avoid Data Sparsity for Short Text
    Nimala, K.
    Jebakumar, R.
    [J]. JOURNAL OF MEDICAL SYSTEMS, 2019, 43 (04)
  • [3] Sparse Biterm Topic Model for Short Texts
    Zhu, Bingshan
    Cai, Yi
    Zhang, Huakui
    [J]. WEB AND BIG DATA, APWEB-WAIM 2021, PT I, 2021, 12858 : 227 - 241
  • [4] A Biterm-based Dirichlet Process Topic Model for Short Texts
    Pan, Yali
    Yin, Jian
    Liu, Shaopeng
    Li, Jing
    [J]. PROCEEDINGS OF THE 3RD INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND SERVICE SYSTEM (CSSS), 2014, 109 : 301 - 304
  • [5] A Biterm Topic Model for Sparse Mutation Data
    Sason, Itay
    Chen, Yuexi
    Leiserson, Mark D. M.
    Sharan, Roded
    [J]. CANCERS, 2023, 15 (05)
  • [6] Improving biterm topic model with word embeddings
    Jiajia Huang
    Min Peng
    Pengwei Li
    Zhiwei Hu
    Chao Xu
    [J]. World Wide Web, 2020, 23 : 3099 - 3124
  • [7] Improving biterm topic model with word embeddings
    Huang, Jiajia
    Peng, Min
    Li, Pengwei
    Hu, Zhiwei
    Xu, Chao
    [J]. WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS, 2020, 23 (06): : 3099 - 3124
  • [8] A Novel Perspective to Mining Online Hotel Reviews Based on Biterm Topic Model
    Ma, Qianqian
    Du, Huiying
    Wang, Zhiyuan
    [J]. 2021 IEEE 6TH INTERNATIONAL CONFERENCE ON BIG DATA ANALYTICS (ICBDA 2021), 2021, : 310 - 315
  • [9] Dataless Short Text Classification Based on Biterm Topic Model and Word Embeddings
    Yang, Yi
    Wang, Hongan
    Zhu, Jiaqi
    Wu, Yunkun
    Jiang, Kailong
    Guo, Wenli
    Shi, Wandong
    [J]. PROCEEDINGS OF THE TWENTY-NINTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, : 3969 - 3975
  • [10] FastBTM: Reducing the sampling time for biterm topic model
    He, Xingwei
    Xu, Hua
    Li, Jia
    He, Liu
    Yu, Linlin
    [J]. KNOWLEDGE-BASED SYSTEMS, 2017, 132 : 11 - 20