Statistical Detection of Online Drifting Twitter Spam [Invited Paper]

被引:44
|
作者
Liu, Shigang [1 ]
Zhang, Jun [1 ]
Xiang, Yang [1 ]
机构
[1] Deakin Univ, Sch Informat Technol, 221 Burwood Hwy, Burwood, Vic 3125, Australia
关键词
Twitter spam detection; social network security; security data analytics;
D O I
10.1145/2897845.2897928
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Spam has become a critical problem in online social networks. This paper focuses on Twitter spam detection. Recent research works focus on applying machine learning techniques for Twitter spam detection, which make use of the statistical features of tweets. We observe existing machine learning based detection methods suffer from the problem of Twitter spam drift, i.e., the statistical properties of spam tweets vary over time. To avoid this problem, an effective solution is to train one twitter spam classifier every day. However, it faces a challenge of the small number of im-balanced training data because labelling spam samples is time-consuming. This paper proposes a new method to address this challenge. The new method employs two new techniques, fuzzy-based redistribution and asymmetric sampling. We develop a fuzzy-based information decomposition technique to re-distribute the spam class and generate more spam samples. Moreover, an asymmetric sampling technique is proposed to re-balance the sizes of spam samples and non-spam samples in the training data. Finally, we apply the ensemble technique to combine the spam classifiers over two different training sets. A number of experiments are performed on a real-world 10-day ground-truth dataset to evaluate the new method. Experiments results show that the new method can significantly improve the detection performance for drifting Twitter spam.
引用
收藏
页码:1 / 10
页数:10
相关论文
共 50 条
  • [1] Statistical Twitter Spam Detection Demystified: Performance, Stability and Scalability
    Lin, Guanjun
    Sun, Nan
    Nepal, Surya
    Zhang, Jun
    Xiang, Yang
    Hassan, Houcine
    [J]. IEEE ACCESS, 2017, 5 : 11142 - 11154
  • [2] Spam Detection on Twitter : A Survey
    Kaur, Prabhjot
    Singhal, Anuhha
    Kaur, Jasleen
    [J]. PROCEEDINGS OF THE 10TH INDIACOM - 2016 3RD INTERNATIONAL CONFERENCE ON COMPUTING FOR SUSTAINABLE GLOBAL DEVELOPMENT, 2016, : 2570 - 2573
  • [3] Online Detection and Prevention of Phishing Attacks (Invited Paper)
    Chen, Juan
    Guo, Chuanxiong
    [J]. 2006 FIRST INTERNATIONAL CONFERENCE ON COMMUNICATIONS AND NETWORKING IN CHINA, 2006,
  • [4] A generic statistical approach for spam detection in Online Social Networks
    Ahmed, Faraz
    Abulaish, Muhammad
    [J]. COMPUTER COMMUNICATIONS, 2013, 36 (10-11) : 1120 - 1129
  • [5] A Survey On Spam URLs Detection In Twitter
    Daffa, Wafaa
    Bamasag, Omaimah
    AlMansour, Amal
    [J]. 2018 1ST INTERNATIONAL CONFERENCE ON COMPUTER APPLICATIONS & INFORMATION SECURITY (ICCAIS' 2018), 2018,
  • [6] A Hybrid Approach for Spam Detection for Twitter
    Mateen, Malik
    Aleem, Muhammad
    Iqbal, Muhammad Azhar
    Islam, Muhammad Arshad
    [J]. PROCEEDINGS OF 2017 14TH INTERNATIONAL BHURBAN CONFERENCE ON APPLIED SCIENCES AND TECHNOLOGY (IBCAST), 2017, : 466 - 471
  • [7] State of the Art on Twitter Spam Detection
    Borse, Dipalee
    Borse, Swati
    [J]. Smart Innovation, Systems and Technologies, 2022, 303 SIST : 486 - 496
  • [8] "TwitterSpamDetector" A Spam Detection Framework for Twitter
    Kabakus, Abdullah Talha
    Kara, Resul
    [J]. INTERNATIONAL JOURNAL OF KNOWLEDGE AND SYSTEMS SCIENCE, 2019, 10 (03) : 1 - 14
  • [9] Sentiment Based Twitter Spam Detection
    Perveen, Nasira
    Missen, Malik M. Saad
    Rasool, Qaisar
    Akhtar, Nadeem
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2016, 7 (07) : 568 - 573
  • [10] Statistical Features-Based Real-Time Detection of Drifted Twitter Spam
    Chen, Chao
    Wang, Yu
    Zhang, Jun
    Xiang, Yang
    Zhou, Wanlei
    Min, Geyong
    [J]. IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2017, 12 (04) : 914 - 925