Detecting global and local topics via mining twitter data

被引:12
|
作者
Liu, Huan [1 ]
Ge, Yong [2 ]
Zheng, Qinghua [1 ]
Lin, Rongcheng [3 ]
Li, Huayu [3 ]
机构
[1] Xi An Jiao Tong Univ, Dept Comp Sci & Technol, MOEKLINNS Lab, Xian, Shaanxi, Peoples R China
[2] Nanjing Univ Finance & Econ, Coll Informat Engn, Nanjing, Jiangsu, Peoples R China
[3] Univ North Carolina Charlotte, Dept Comp Sci, Charlotte, NC USA
基金
中国国家自然科学基金;
关键词
Social event; Probabilistic graphical model; Twitter; Global and local topic; EVENT DETECTION;
D O I
10.1016/j.neucom.2017.07.056
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Detecting topics from Twitter has been widely studied for understanding social events. There are two types of topics, i.e., global topics attracting widespread tweets with larger volume and local topics drawing attention of limited tweets of somewhere. However, most of existent works neglect the difference between them and suffer from the Long Tail Effect, resulting in the inability to detect the local one. In this paper, we distinguish global and local topics by associating each tweet with both of them simultaneously. We propose a probabilistic graphical model to extract global and local topics related to social events in a unified framework at the same time. Our model learns global topics using tweets scattered around all locations, while studies local topics merely utilizing tweets within the corresponding location. We collect two tweet datasets on Twitter from several cities in USA and evaluate our model over them. The experimental results show significant improvement of our model compared to baseline methods. (C) 2017 Elsevier B.V. All rights reserved.
引用
收藏
页码:120 / 132
页数:13
相关论文
共 50 条
  • [31] Local and global methods in data mining: Basic techniques and open problems
    Mannila, H
    AUTOMATA, LANGUAGES AND PROGRAMMING, 2002, 2380 : 57 - 68
  • [32] How people talk about health? Detecting Health Topics from Twitter Streams
    Comito, Carmela
    Pizzuti, Clara
    Procopio, Nicola
    INTERNATIONAL CONFERENCE ON BIG DATA AND INTERNET OF THINGS (BDIOT 2017), 2017, : 85 - 90
  • [33] Mining bursty topics from twitter text streams based on Labeled-LDA
    Dai, Guangying
    Xu, Ming
    Xu, Jian
    Ren, Yizhi
    Zhang, Haiping
    Zheng, Ning
    Journal of Computational Information Systems, 2014, 10 (11): : 4905 - 4912
  • [34] Detecting Citizen Problems and Their Locations Using Twitter Data
    Abali, Gizem
    Karaarslan, Enis
    Hurriyetoglu, Ali
    Dalkilic, Feristah
    2018 6TH INTERNATIONAL ISTANBUL SMART GRIDS AND CITIES CONGRESS AND FAIR (ICSG ISTANBUL 2018), 2018, : 30 - 33
  • [35] Detecting Hot Topics From Academic Big Data
    Wang, Beibei
    Yang, Bo
    Shan, Shuangshuang
    Chen, Hechang
    IEEE ACCESS, 2019, 7 : 185916 - 185927
  • [36] Hawkes process marked with topics and its application to Twitter data analysis
    Goda, Masatoshi
    Mizuno, Takayuki
    Yano, Ryosuke
    EPL, 2022, 140 (06)
  • [37] Using Word Embedding to Evaluate the Coherence of Topics from Twitter Data
    Fang, Anjie
    Macdonald, Craig
    Ounis, Iadh
    Habel, Philip
    SIGIR'16: PROCEEDINGS OF THE 39TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2016, : 1057 - 1060
  • [38] Mining Twitter Data for Improved Understanding of Disaster Resilience
    Zou, Lei
    Lam, Nina S. N.
    Cai, Heng
    Qiang, Yi
    ANNALS OF THE AMERICAN ASSOCIATION OF GEOGRAPHERS, 2018, 108 (05) : 1422 - 1441
  • [39] Twitter Data Mining for the Social Awareness of Emerging Technologies
    Li Xin
    Xie Qianqian
    Huang Lucheng
    Yuan Zhou
    2017 PORTLAND INTERNATIONAL CONFERENCE ON MANAGEMENT OF ENGINEERING AND TECHNOLOGY (PICMET), 2017,
  • [40] Opinion Mining and Sentiment Analysis on a Twitter Data Stream
    Gokulakrishnan, Balakrishnan
    Priyanthan, Pavalanathan
    Ragavan, Thiruchittampalam
    Prasath, Nadarajah
    Perera, A. Shehan
    INTERNATIONAL CONFERENCE ON ADVANCES IN ICT FOR EMERGING REGIONS (ICTER2012), 2012, : 182 - 188