Event detection from real-time twitter streaming data using community detection algorithm

被引:0
|
作者
Jagrati Singh
Digvijay Pandey
Anil Kumar Singh
机构
[1] Motilal Nehru National Institute of Technology,Computer Science & Engineering
[2] Electronics & Communication Engineering,Department of Technical Education (Government of U.P)
来源
关键词
Twitter stream; Clustering; Supervised; Unsupervised technique; Semantic correlation; Keyword co-occurrence; Topic modeling;
D O I
暂无
中图分类号
学科分类号
摘要
The increasing popularity of social media services has led to more and more people using Twitter. There are millions of tweets with a high amount of noisy data that propagate daily on the Internet. Twitter acts as a source of information for events and breaking news. However, it is very challenging for any person to extract useful information related to important events manually, from the end- less stream of tweets. Hence, it is desired to automate the whole process of event detection, so that important events can be identified in real-time from a stream of tweets, as early as possible, after the actual happening. Most of the existing approaches are more focussed on “What happened”. To define any event, answers of “When” and “Where” are also required. To handle emergency events, location and time parameters play a very important role. This article proposes a faster location based event detection approach without compromising accuracy, which automatically extracts separate clusters concerning local or global events from real-time streaming data. The proposed approach consists of four major steps. In the first step, a new dynamic weighting scheme named Conditional Term Frequency-Average Inverse Window Frequency (CTF-AIWF) based on TF-IDF is proposed to capture emerging keywords from the temporal dynamics of data. Next, a new clustering algorithm named Edge Significance based Louvain Algorithm (ESBLA) is proposed to group the same event keywords. This clustering helps in improving the run-time performance up to 50% while maintaining the quality performance (F1-score) comparable to the baseline models. In the third step, a new content-based location detection technique is proposed to detect the location of the event. This technique is able to handle various issues like use of informal text, short form of a text, and misspelled keywords of microblogging data. Finally, Google Map is used to visualize the events in happening locations. This step makes the decision faster regarding the detected events. For the experimentation, tweets are collected in real-time and stored in MongoDB NoSQL database for processing.
引用
收藏
页码:23437 / 23464
页数:27
相关论文
共 50 条
  • [21] Real Time Road Traffic Event Detection Using Twitter and Spark
    Pandhare, Ketan R.
    Shah, Medha A.
    [J]. PROCEEDINGS OF THE 2017 INTERNATIONAL CONFERENCE ON INVENTIVE COMMUNICATION AND COMPUTATIONAL TECHNOLOGIES (ICICCT), 2017, : 445 - 449
  • [22] Mining Streaming Tweets for Real-Time Event Credibility Prediction in Twitter
    Zou, Jun
    Fekri, Faramarz
    McLaughlin, Steven W.
    [J]. PROCEEDINGS OF THE 2015 IEEE/ACM INTERNATIONAL CONFERENCE ON ADVANCES IN SOCIAL NETWORKS ANALYSIS AND MINING (ASONAM 2015), 2015, : 1586 - 1589
  • [23] Real-Time Bot Detection from Twitter Using the Twitterbot plus Framework
    Daouadi, Kheir Eddine
    Rebai, Rim Zghal
    Amous, Ikram
    [J]. JOURNAL OF UNIVERSAL COMPUTER SCIENCE, 2020, 26 (04) : 496 - 507
  • [24] Earthquake shakes Twitter users: Real-time event detection by social sensors
    University of Tokyo, Yayoi 2-11-16, Bunkyo-ku Tokyo, Japan
    [J]. Proc. Int. Conf. World Wide Web, WWW, (851-860):
  • [25] Real-time anomaly detection method for space imager streaming data based on HTM algorithm
    Song, Lei
    Liang, Haoran
    Zheng, Taisheng
    [J]. 201919TH IEEE INTERNATIONAL SYMPOSIUM ON HIGH ASSURANCE SYSTEMS ENGINEERING (HASE 2019), 2019, : 33 - 38
  • [26] Developing a Real-time Data Analytics Framework For Twitter Streaming Data
    Yadranjiaghdam, Babak
    Yasrobi, Seyedfaraz
    Tabrizi, Nasseh
    [J]. 2017 IEEE 6TH INTERNATIONAL CONGRESS ON BIG DATA (BIGDATA CONGRESS 2017), 2017, : 329 - 336
  • [27] TopicSketch: Real-Time Bursty Topic Detection from Twitter
    Xie, Wei
    Zhu, Feida
    Jiang, Jing
    Lim, Ee-Peng
    Wang, Ke
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2016, 28 (08) : 2216 - 2229
  • [28] Real-Time Event Detection and Feature Extraction using PMU Measurement Data
    Xu, Ti
    Overbye, Thomas
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON SMART GRID COMMUNICATIONS (SMARTGRIDCOMM), 2015, : 265 - 270
  • [29] Real-Time Detection of Traffic From Twitter Stream Analysis
    D'Andrea, Eleonora
    Ducange, Pietro
    Lazzerini, Beatrice
    Marcelloni, Francesco
    [J]. IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2015, 16 (04) : 2269 - 2283
  • [30] Using Federated Learning in Anomaly Detection and Analytics on Real-time Streaming Data of Healthcare
    Yogitha, M.
    Srinivas, K. S.
    [J]. PROCEEDINGS OF 2023 THE 7TH INTERNATIONAL CONFERENCE ON GRAPHICS AND SIGNAL PROCESSING, ICGSP, 2023, : 29 - 34