Detecting global and local topics via mining twitter data

被引:12
|
作者
Liu, Huan [1 ]
Ge, Yong [2 ]
Zheng, Qinghua [1 ]
Lin, Rongcheng [3 ]
Li, Huayu [3 ]
机构
[1] Xi An Jiao Tong Univ, Dept Comp Sci & Technol, MOEKLINNS Lab, Xian, Shaanxi, Peoples R China
[2] Nanjing Univ Finance & Econ, Coll Informat Engn, Nanjing, Jiangsu, Peoples R China
[3] Univ North Carolina Charlotte, Dept Comp Sci, Charlotte, NC USA
基金
中国国家自然科学基金;
关键词
Social event; Probabilistic graphical model; Twitter; Global and local topic; EVENT DETECTION;
D O I
10.1016/j.neucom.2017.07.056
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Detecting topics from Twitter has been widely studied for understanding social events. There are two types of topics, i.e., global topics attracting widespread tweets with larger volume and local topics drawing attention of limited tweets of somewhere. However, most of existent works neglect the difference between them and suffer from the Long Tail Effect, resulting in the inability to detect the local one. In this paper, we distinguish global and local topics by associating each tweet with both of them simultaneously. We propose a probabilistic graphical model to extract global and local topics related to social events in a unified framework at the same time. Our model learns global topics using tweets scattered around all locations, while studies local topics merely utilizing tweets within the corresponding location. We collect two tweet datasets on Twitter from several cities in USA and evaluate our model over them. The experimental results show significant improvement of our model compared to baseline methods. (C) 2017 Elsevier B.V. All rights reserved.
引用
收藏
页码:120 / 132
页数:13
相关论文
共 50 条
  • [41] Data Mining of Twitter Retweets: A Visual and Practical Representation
    Zhan, Tiffany
    2021 IEEE 11TH ANNUAL COMPUTING AND COMMUNICATION WORKSHOP AND CONFERENCE (CCWC), 2021, : 672 - 676
  • [42] Using Twitter to engage with customers: a data mining approach
    Okazaki, Shintaro
    Diaz-Martin, Ana M.
    Rozano, Mercedes
    David Menendez-Benito, Hector
    INTERNET RESEARCH, 2015, 25 (03) : 416 - 434
  • [43] Analysis of Product Twitter Data though Opinion Mining
    Fernandes, Roshan
    D'Souza, Rio
    2016 IEEE ANNUAL INDIA CONFERENCE (INDICON), 2016,
  • [44] Detecting sentiment dynamics and clusters of Twitter users for trending topics in COVID-19 pandemic
    Ahmed, Md Shoaib
    Aurpa, Tanjim Taharat
    Anwar, Md Musfique
    PLOS ONE, 2021, 16 (08):
  • [45] Application of Data Mining For Identifying Topics at the Document Level
    Reza, Marifa Farzin
    Matin, Rizwana
    2013 INTERNATIONAL CONFERENCE ON INFORMATICS, ELECTRONICS & VISION (ICIEV), 2013,
  • [46] Mining Topics in Documents: Standing on the Shoulders of Big Data
    Chen, Zhiyuan
    Liu, Bing
    PROCEEDINGS OF THE 20TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING (KDD'14), 2014, : 1116 - 1125
  • [47] A Paralleled Big Data Algorithm with MapReduce Framework for Mining Twitter Data
    Li Bing
    Chan, Keith C. C.
    2014 IEEE FOURTH INTERNATIONAL CONFERENCE ON BIG DATA AND CLOUD COMPUTING (BDCLOUD), 2014, : 121 - 128
  • [48] Detecting Online Gambling Promotions on Indonesian Twitter Using Text Mining Algorithm
    Perdana, Reza Bayu
    Ardin, Indra
    Budi, Indra
    Santoso, Aris Budi
    Ramadiah, Amanah
    Putra, Prabu Kresna
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2024, 15 (08) : 942 - 949
  • [49] Detecting Research Topics via the Correlation between Graphs and Texts
    Jo, Yookyung
    Lagoze, Carl
    Giles, C. Lee
    KDD-2007 PROCEEDINGS OF THE THIRTEENTH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2007, : 370 - +
  • [50] Sentiment Analysis of Shared Tweets on Global Warming on Twitter with Data Mining Methods: A Case Study on Turkish Language
    Kirelli, Yasin
    Arslankaya, Seher
    COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2020, 2020 (2020)