Event detection from real-time twitter streaming data using community detection algorithm

被引:0
|
作者
Jagrati Singh
Digvijay Pandey
Anil Kumar Singh
机构
[1] Motilal Nehru National Institute of Technology,Computer Science & Engineering
[2] Electronics & Communication Engineering,Department of Technical Education (Government of U.P)
来源
关键词
Twitter stream; Clustering; Supervised; Unsupervised technique; Semantic correlation; Keyword co-occurrence; Topic modeling;
D O I
暂无
中图分类号
学科分类号
摘要
The increasing popularity of social media services has led to more and more people using Twitter. There are millions of tweets with a high amount of noisy data that propagate daily on the Internet. Twitter acts as a source of information for events and breaking news. However, it is very challenging for any person to extract useful information related to important events manually, from the end- less stream of tweets. Hence, it is desired to automate the whole process of event detection, so that important events can be identified in real-time from a stream of tweets, as early as possible, after the actual happening. Most of the existing approaches are more focussed on “What happened”. To define any event, answers of “When” and “Where” are also required. To handle emergency events, location and time parameters play a very important role. This article proposes a faster location based event detection approach without compromising accuracy, which automatically extracts separate clusters concerning local or global events from real-time streaming data. The proposed approach consists of four major steps. In the first step, a new dynamic weighting scheme named Conditional Term Frequency-Average Inverse Window Frequency (CTF-AIWF) based on TF-IDF is proposed to capture emerging keywords from the temporal dynamics of data. Next, a new clustering algorithm named Edge Significance based Louvain Algorithm (ESBLA) is proposed to group the same event keywords. This clustering helps in improving the run-time performance up to 50% while maintaining the quality performance (F1-score) comparable to the baseline models. In the third step, a new content-based location detection technique is proposed to detect the location of the event. This technique is able to handle various issues like use of informal text, short form of a text, and misspelled keywords of microblogging data. Finally, Google Map is used to visualize the events in happening locations. This step makes the decision faster regarding the detected events. For the experimentation, tweets are collected in real-time and stored in MongoDB NoSQL database for processing.
引用
收藏
页码:23437 / 23464
页数:27
相关论文
共 50 条
  • [1] Event detection from real-time twitter streaming data using community detection algorithm
    Singh, Jagrati
    Pandey, Digvijay
    Singh, Anil Kumar
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 83 (8) : 23437 - 23464
  • [2] Real-time traffic event detection using Twitter data
    Jones, Angelica Salas
    Georgakis, Panagiotis
    Petalas, Yannis
    Suresh, Renukappa
    [J]. INFRASTRUCTURE ASSET MANAGEMENT, 2018, 5 (03) : 77 - 84
  • [3] A survey on real-time event detection from the Twitter data stream
    Hasan, Mahmud
    Orgun, Mehmet A.
    Schwitter, Rolf
    [J]. JOURNAL OF INFORMATION SCIENCE, 2018, 44 (04) : 443 - 463
  • [4] Real-time event detection from the Twitter data stream using the TwitterNews plus Framework
    Hasan, Mahmud
    Orgun, Mehmet A.
    Schwitter, Rolf
    [J]. INFORMATION PROCESSING & MANAGEMENT, 2019, 56 (03) : 1146 - 1165
  • [5] Real-time Event Detection in Twitter: A Case Study
    Sani, Ali Momen
    Moeini, Ali
    [J]. 2020 6TH INTERNATIONAL CONFERENCE ON WEB RESEARCH (ICWR), 2020, : 48 - 51
  • [6] EventRadar: A Real-Time Local Event Detection Scheme Using Twitter Stream
    Boettcher, Alexander
    Lee, Dongman
    [J]. 2012 IEEE INTERNATIONAL CONFERENCE ON GREEN COMPUTING AND COMMUNICATIONS, CONFERENCE ON INTERNET OF THINGS, AND CONFERENCE ON CYBER, PHYSICAL AND SOCIAL COMPUTING (GREENCOM 2012), 2012, : 358 - 367
  • [7] Real-Time Entity-Based Event Detection for Twitter
    McMinn, Andrew J.
    Jose, Joemon M.
    [J]. EXPERIMENTAL IR MEETS MULTILINGUALITY, MULTIMODALITY, AND INTERACTION, 2015, 9283 : 66 - 78
  • [8] Real-time anomaly detection using parallelized intrusion detection architecture for streaming data
    Chellammal, P.
    Malarchelvi, Sheba Kezia P. D.
    [J]. CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2020, 32 (04):
  • [9] Real-time Outlier Detection over Streaming Data
    Yu, Kangqing
    Shi, Wei
    Santoro, Nicola
    Ma, Xiangyu
    [J]. 2019 IEEE SMARTWORLD, UBIQUITOUS INTELLIGENCE & COMPUTING, ADVANCED & TRUSTED COMPUTING, SCALABLE COMPUTING & COMMUNICATIONS, CLOUD & BIG DATA COMPUTING, INTERNET OF PEOPLE AND SMART CITY INNOVATION (SMARTWORLD/SCALCOM/UIC/ATC/CBDCOM/IOP/SCI 2019), 2019, : 125 - 132
  • [10] Real-time Spread Burst Detection in Data Streaming
    Wang, Haibo
    Melissourgos, Dimitrios
    Ma, Chaoyi
    Chen, Shigang
    [J]. PROCEEDINGS OF THE ACM ON MEASUREMENT AND ANALYSIS OF COMPUTING SYSTEMS, 2023, 7 (02) : 1 - 31