Topical Anomaly Detection From Twitter Stream

被引:0
|
作者
Anantharam, Pramod [1 ]
Thirunarayan, Krishnaprasad [1 ]
Sheth, Amit [1 ]
机构
[1] Wright State Univ, Knoesis Ctr, Dayton, OH 45435 USA
关键词
Anomaly detection; spam and off-topic content detection; binary classification; twitter stream analysis;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we spot topically anomalous tweets in twitter streams by analyzing the content of the document pointed to by the URLs in the tweets in preference to their textual content. Existing approaches to anomaly detection ignore such URLs thereby missing opportunities to detect off-topic tweets. Specifically, we determine the divergence of claimed topic of a tweet as reflected by the hashtags and the actual topic as reflected by the referenced document content. Our approach avoids the need for labeled samples by selecting documents from reliable sources gleaned from the URLs present in the tweets. These documents are used for comparison against documents associated with unknown URLs in incoming tweets improving reliability, scalability and adaptability to rapidly changing topics. We evaluate our approach on three events and show that it can find topical inconsistencies not detectable by existing approaches.
引用
收藏
页码:11 / 14
页数:4
相关论文
共 50 条
  • [1] Topical Event Detection on Twitter
    Cui, Lishan
    Zhang, Xiuzhen
    Zhou, Xiangmin
    Salim, Flora
    [J]. DATABASES THEORY AND APPLICATIONS, (ADC 2016), 2016, 9877 : 257 - 268
  • [2] An Anomaly Detection Framework for Twitter Data
    Kumar, Sandeep
    Khan, Muhammad Badruddin
    Abul Hasanat, Mozaherul Hoque
    Saudagar, Abdul Khader Jilani
    AlTameem, Abdullah
    AlKhathami, Mohammed
    [J]. APPLIED SCIENCES-BASEL, 2022, 12 (21):
  • [3] Gulf Stream Detection from SAR Doppler Anomaly
    Biron, Katerina
    Van Wychen, Wesley
    Vachon, Paris W.
    [J]. CANADIAN JOURNAL OF REMOTE SENSING, 2018, 44 (04) : 311 - 320
  • [4] Real-Time Detection of Traffic From Twitter Stream Analysis
    D'Andrea, Eleonora
    Ducange, Pietro
    Lazzerini, Beatrice
    Marcelloni, Francesco
    [J]. IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2015, 16 (04) : 2269 - 2283
  • [5] A survey on real-time event detection from the Twitter data stream
    Hasan, Mahmud
    Orgun, Mehmet A.
    Schwitter, Rolf
    [J]. JOURNAL OF INFORMATION SCIENCE, 2018, 44 (04) : 443 - 463
  • [6] New Word Detection and Tagging on Chinese Twitter Stream
    Liang, Yuzhi
    Yin, Pengcheng
    Yiu, S. M.
    [J]. TRANSACTIONS ON LARGE-SCALE DATA- AND KNOWLEDGE-CENTERED SYSTEMS XXXII, 2017, 10420 : 69 - 90
  • [7] Twitter Stream Event Detection for Critical Situation Management
    Bicchierai, Irene
    Brancati, Francesco
    Itria, Massimiliano L.
    Giunta, Gabriele
    Magaldi, Massimo
    [J]. INTELLIGENT ENVIRONMENTS 2018, 2018, 23 : 216 - 225
  • [8] Twitter spammer detection using data stream clustering
    Miller, Zachary
    Dickinson, Brian
    Deitrick, William
    Hu, Wei
    Wang, Alex Hai
    [J]. INFORMATION SCIENCES, 2014, 260 : 64 - 73
  • [9] A Novel Stream Clustering Framework for Spam Detection in Twitter
    Tajalizadeh, Hadi
    Boostani, Reza
    [J]. IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2019, 6 (03) : 525 - 534
  • [10] Automatic Unsupervised Polarity Detection on a Twitter Data Stream
    Terrana, Diego
    Augello, Agnese
    Pilato, Giovanni
    [J]. 2014 IEEE INTERNATIONAL CONFERENCE ON SEMANTIC COMPUTING (ICSC), 2014, : 128 - 134