Event detection from real-time twitter streaming data using community detection algorithm

被引:0
|
作者
Jagrati Singh
Digvijay Pandey
Anil Kumar Singh
机构
[1] Motilal Nehru National Institute of Technology,Computer Science & Engineering
[2] Electronics & Communication Engineering,Department of Technical Education (Government of U.P)
来源
关键词
Twitter stream; Clustering; Supervised; Unsupervised technique; Semantic correlation; Keyword co-occurrence; Topic modeling;
D O I
暂无
中图分类号
学科分类号
摘要
The increasing popularity of social media services has led to more and more people using Twitter. There are millions of tweets with a high amount of noisy data that propagate daily on the Internet. Twitter acts as a source of information for events and breaking news. However, it is very challenging for any person to extract useful information related to important events manually, from the end- less stream of tweets. Hence, it is desired to automate the whole process of event detection, so that important events can be identified in real-time from a stream of tweets, as early as possible, after the actual happening. Most of the existing approaches are more focussed on “What happened”. To define any event, answers of “When” and “Where” are also required. To handle emergency events, location and time parameters play a very important role. This article proposes a faster location based event detection approach without compromising accuracy, which automatically extracts separate clusters concerning local or global events from real-time streaming data. The proposed approach consists of four major steps. In the first step, a new dynamic weighting scheme named Conditional Term Frequency-Average Inverse Window Frequency (CTF-AIWF) based on TF-IDF is proposed to capture emerging keywords from the temporal dynamics of data. Next, a new clustering algorithm named Edge Significance based Louvain Algorithm (ESBLA) is proposed to group the same event keywords. This clustering helps in improving the run-time performance up to 50% while maintaining the quality performance (F1-score) comparable to the baseline models. In the third step, a new content-based location detection technique is proposed to detect the location of the event. This technique is able to handle various issues like use of informal text, short form of a text, and misspelled keywords of microblogging data. Finally, Google Map is used to visualize the events in happening locations. This step makes the decision faster regarding the detected events. For the experimentation, tweets are collected in real-time and stored in MongoDB NoSQL database for processing.
引用
收藏
页码:23437 / 23464
页数:27
相关论文
共 50 条
  • [31] Real-time Anomaly Detection for Streaming Data using Burst Code on a Neurosynaptic Processor
    Chen, Qiuwen
    Qiu, Qinru
    [J]. PROCEEDINGS OF THE 2017 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION (DATE), 2017, : 205 - 207
  • [32] TopicSketch: Real-time Bursty Topic Detection from Twitter
    Xie, Wei
    Zhu, Feida
    Jiang, Jing
    Lim, Ee-Peng
    Wang, Ke
    [J]. 2013 IEEE 13TH INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2013, : 837 - 846
  • [33] Real-Time Detection and Visualization of Traffic Conditions by Mining Twitter Data
    Khetarpaul, Sonia
    Sharma, Dolly
    Jose, Jackson I.
    Saragur, Mohith
    [J]. DATABASES THEORY AND APPLICATIONS (ADC 2022), 2022, 13459 : 141 - 152
  • [34] A Framework for Real-Time Spam Detection in Twitter
    Gupta, Himank
    Jamal, Mohd. Saalim
    Madisetty, Sreekanth
    Desarkar, Maunendra Sankar
    [J]. 2018 10TH INTERNATIONAL CONFERENCE ON COMMUNICATION SYSTEMS & NETWORKS (COMSNETS), 2018, : 380 - 387
  • [35] Hot Topic Detection Using Twitter Streaming Data
    Jagic, Teodor
    Brkic, Ljiljana
    [J]. 2020 43RD INTERNATIONAL CONVENTION ON INFORMATION, COMMUNICATION AND ELECTRONIC TECHNOLOGY (MIPRO 2020), 2020, : 1730 - 1735
  • [36] Optimal Filtering for Grid Event Detection from Real-time Synchrophasor Data
    Konakalla, Sai Akhil R.
    de Callafon, Raymond
    [J]. INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE 2016 (ICCS 2016), 2016, 80 : 931 - 940
  • [37] iblueCulture: Data Streaming and Object Detection in a Real-Time Video Streaming Underwater System
    Vlachos, Apostolos
    Bargiota, Eleftheria
    Krinidis, Stelios
    Papadimitriou, Kimon
    Manglis, Angelos
    Fourkiotou, Anastasia
    Tzovaras, Dimitrios
    [J]. REMOTE SENSING, 2024, 16 (13)
  • [38] Real-Time Adaptive Event Detection in Astronomical Data Streams
    Thompson, David R.
    Burke-Spolaor, Sarah
    Deller, Adam T.
    Majid, Walid A.
    Palaniswamy, Divya
    Tingay, Steven J.
    Wagstaff, Kiri L.
    Wayth, Randall B.
    [J]. IEEE INTELLIGENT SYSTEMS, 2014, 29 (01) : 48 - 55
  • [39] Real-time adaptive event detection in astronomical data streams
    [J]. 1600, Institute of Electrical and Electronics Engineers Inc., United States (29):
  • [40] Real-time Object Detection for Streaming Perception
    Yang, Jinrong
    Liu, Songtao
    Li, Zeming
    Li, Xiaoping
    Sun, Jian
    [J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 5375 - 5385