A probabilistic method for emerging topic tracking in Microblog stream

被引:96
|
作者
Huang, Jiajia [1 ]
Peng, Min [1 ]
Wang, Hua [2 ]
Cao, Jinli [3 ]
Gao, Wang [1 ]
Zhang, Xiuzhen [4 ]
机构
[1] Wuhan Univ, State Key Lab Software Engn, Wuhan 430072, Peoples R China
[2] Victoria Univ, Ctr Appl Informat, Melbourne, Vic 3001, Australia
[3] La Trobe Univ, Comp Sci & Comp Engn, Bundoora, Vic 3086, Australia
[4] RMIT Univ, Sch CS&IT, GPO Box 2476, Melbourne, Vic 3001, Australia
基金
美国国家科学基金会;
关键词
Microblog stream; Emerging topic; LWLR; Topic evolution; Optimization problem;
D O I
10.1007/s11280-016-0390-4
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Microblog is a popular and open platform for discovering and sharing the latest news about social issues and daily life. The quickly-updated microblog streams make it urgent to develop an effective tool to monitor such streams. Emerging topic tracking is one of such tools to reveal what new events are attracting the most online attention at present. However, due to the fast changing, high noise and short length of the microblog feeds, two challenges should be addressed in emerging topic tracking. One is the problem of detecting emerging topics early, long before they become hot, and the other is how to effectively monitor evolving topics over time. In this study, we propose a novel emerging topics tracking method, which aligns emerging word detection from temporal perspective with coherent topic mining from spatial perspective. Specifically, we first design a metric to estimate word novelty and fading based on local weighted linear regression (LWLR), which can highlight the word novelty of expressing an emerging topic and suppress the word novelty of expressing an existing topic. We then track emerging topics by leveraging topic novelty and fading probabilities, which are learnt by designing and solving an optimization problem. We evaluate our method on a microblog stream containing over one million feeds. Experimental results show the promising performance of the proposed method in detecting emerging topic and tracking topic evolution over time on both effectiveness and efficiency.
引用
收藏
页码:325 / 350
页数:26
相关论文
共 50 条
  • [1] A probabilistic method for emerging topic tracking in Microblog stream
    Jiajia Huang
    Min Peng
    Hua Wang
    Jinli Cao
    Wang Gao
    Xiuzhen Zhang
    [J]. World Wide Web, 2017, 20 : 325 - 350
  • [2] A Self-Adaptive Microblog Topic Tracking Method by User Relationship
    [J]. Zhang, Chuang (zhangchuang@iie.ac.cn), 2017, Chinese Institute of Electronics (45):
  • [3] A Refined Method for Detecting Interpretable and Real-Time Bursty Topic in Microblog Stream
    Zhang, Tao
    Zhou, Bin
    Huang, Jiuming
    Jia, Yan
    Zhang, Bing
    Li, Zhi
    [J]. WEB INFORMATION SYSTEMS ENGINEERING, WISE 2017, PT I, 2017, 10569 : 3 - 17
  • [4] A Topic Detection Method for Chinese Microblog
    Xie, Jing
    Liu, Gongshen
    Ning, Wei
    [J]. 2012 INTERNATIONAL SYMPOSIUM ON INFORMATION SCIENCE AND ENGINEERING (ISISE), 2012, : 100 - 103
  • [5] Emerging Topic Detection from Microblog Streams Based on Emerging Pattern Mining
    Peng, Min
    Ouyang, Shuang
    Zhu, Jiahui
    Huang, Jiajia
    Wang, Hua
    Yong, Jianming
    [J]. PROCEEDINGS OF THE 2018 IEEE 22ND INTERNATIONAL CONFERENCE ON COMPUTER SUPPORTED COOPERATIVE WORK IN DESIGN ((CSCWD)), 2018, : 259 - 264
  • [6] A Topic Detection Method Based on Microblog Weight
    Guo, Kaijie
    Shi, Liang
    [J]. 2015 INTERNATIONAL CONFERENCE ON CYBER-ENABLED DISTRIBUTED COMPUTING AND KNOWLEDGE DISCOVERY, 2015, : 209 - 212
  • [7] Identifying and tracking topic-level influencers in the microblog streams
    Su, Sen
    Wang, Yakun
    Zhang, Zhongbao
    Chang, Cheng
    Zia, Muhammad Azam
    [J]. MACHINE LEARNING, 2018, 107 (03) : 551 - 578
  • [8] Identifying and tracking topic-level influencers in the microblog streams
    Sen Su
    Yakun Wang
    Zhongbao Zhang
    Cheng Chang
    Muhammad Azam Zia
    [J]. Machine Learning, 2018, 107 : 551 - 578
  • [9] Emerging topic tracking system
    Bun, KK
    Ishizuka, M
    [J]. THIRD INTERNATIONAL WORKSHOP ON ADVANCED ISSUES OF E-COMMERCE AND WEB-BASED INFORMATION SYSTEMS, PROCEEDINGS, 2001, : 2 - 11
  • [10] Microblog Hot Spot Mining Based on PAM Probabilistic Topic Model
    Zheng, Yaxin
    Ling, Liu
    [J]. INTERNATIONAL CONFERENCE ON ENGINEERING TECHNOLOGY AND APPLICATION (ICETA 2015), 2015, 22