Keyword-based Topic Modeling and Keyword Selection

被引:0
|
作者
Wang, Xingyu [1 ]
Zhang, Lida [2 ]
Klabjan, Diego [1 ]
机构
[1] Northwestern Univ, Evanston, IL 60208 USA
[2] Texas A&M Univ, College Stn, TX USA
关键词
Machine Learning; Topic Modeling; Generative Model; Social Media;
D O I
10.1109/BigData52589.2021.9671416
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Certain type of documents such as tweets are collected by specifying a set of keywords. As topics of interest change with time it is beneficial t o a djust k eywords d ynamically. The challenge is that these need to be specified ahead of knowing the forthcoming documents and the underlying topics. The future topics should mimic past topics of interest yet there should be some novelty in them. We develop a keyword-based topic model that dynamically selects a subset of keywords to be used to collect future documents. The generative process first selects keywords and then the underlying documents based on the specified k eywords. The model i s t rained b y u sing a variational lower bound and stochastic gradient optimization. The inference consists of finding a subset of keywords where given a subset the model predicts the underlying topic-word matrix for the unknown forthcoming documents. We compare the keyword topic model against a benchmark model using viral predictions of tweets combined with a topic model. The keyword-based topic model outperforms this sophisticated baseline model by 67%.
引用
收藏
页码:1148 / 1154
页数:7
相关论文
共 50 条
  • [1] Digital business foresight: Keyword-based analysis and CorEx topic modeling
    Bzhalava, Levan
    Kaivo-oja, Jari
    Hassan, Sohaib S.
    [J]. FUTURES, 2024, 155
  • [2] Keyword-based Deep Web database selection
    Fan, Ju
    Zhou, Li-Zhu
    [J]. Jisuanji Xuebao/Chinese Journal of Computers, 2011, 34 (10): : 1797 - 1804
  • [3] Keyword-based Vehicle Retrieval
    Park, Eun-Ju
    Kim, Hoyoung
    Jeong, Seonghwan
    Kang, Byungkon
    Kwon, YoungMin
    [J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2021, 2021, : 4215 - 4222
  • [4] Timely and keyword-based dynamic content selection for public displays
    Ribeiro, Fernando Reinaldo
    Jose, Rui
    [J]. PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON COMPLEX, INTELLIGENT AND SOFTWARE INTENSIVE SYSTEMS (CISIS 2010), 2010, : 655 - 660
  • [5] Keyword-Based Delegable Proofs of Storage
    Sengupta, Binanda
    Ruj, Sushmita
    [J]. INFORMATION SECURITY PRACTICE AND EXPERIENCE (ISPEC 2018), 2018, 11125 : 282 - 298
  • [6] Keyword-based information retrieval for the WoT
    Xylomenos, George
    Zafeiratos, Evangelos
    Prokopakis, Marios
    [J]. SEC'19: PROCEEDINGS OF THE 4TH ACM/IEEE SYMPOSIUM ON EDGE COMPUTING, 2019, : 407 - 412
  • [7] Keyword-based information retrieval for the WoT
    Xylomenos, George
    Zafeiratos, Evangelos
    Prokopakis, Marios
    [J]. Proceedings of the 4th ACM/IEEE Symposium on Edge Computing, SEC 2019, 2019, : 407 - 412
  • [8] Keyword-Based Source Code Summarization
    Zhang, Shikun
    Xie, Rui
    Ye, Wei
    Hen, Long
    [J]. Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2020, 57 (09): : 1987 - 2000
  • [9] Matching Similarity for Keyword-Based Clustering
    Rezaei, Mohammad
    Franti, Pasi
    [J]. STRUCTURAL, SYNTACTIC, AND STATISTICAL PATTERN RECOGNITION, 2014, 8621 : 193 - 202
  • [10] Keyword-Based Metadata Modeling for Experimental Omics Data Dissemination
    Hwang, Kyoung Soon
    Lee, Keon Myung
    [J]. BIG DATA APPLICATIONS AND SERVICES 2017, 2019, 770 : 139 - 150