Detecting Arabic Spammers and Content Polluters on Twitter

被引:0
|
作者
El-Mawass, Nour [1 ]
Alaboodi, Saad [1 ]
机构
[1] King Saud Univ, Coll Comp & Informat Sci, Riyadh, Saudi Arabia
关键词
Online Social Networks; Social Spam Detection; Machine Learning; Supervised Classification; Twitter; Arabic Spam;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Spam is thriving on Arabic Twitter. With a large online population, a mounting political unrest, and an undersized and unspecialized response effort, the current state of Arabic online social networks (OSNs) offers a perfect target for the spam industry, bringing both abuse and manipulation to the scene. The result is a ubiquitous spam presence that redefines the signal to noise ratio, and makes spam a de facto component of the online social platforms. English spam on online social networks has been heavily studied in the literature. To date however, social spam in other languages has been largely ignored. Our own analysis of spam content on Arabic trending hash tags in Saudi Arabia results in an estimate of about three quarters of the total generated content. This alarming rate, backed by independent concurrent estimates, makes the development of adaptive spam detection techniques a very real and pressing need. In this study, we present a first attempt at detecting accounts that promote spam and content pollution on Arabic Twitter. Using a large crawled dataset of more than 23 million Arabic tweets, and a manually labeled sample of more than 5000 tweets, we analyze the spam content on Saudi Twitter, and assess the performance of previous spam detection features on our recently gathered dataset. We also adapt the previously proposed features to respond to spammers evading techniques, and use these features to build a new highly accurate data-driven detection system.
引用
收藏
页码:53 / 58
页数:6
相关论文
共 50 条
  • [31] Segregating Spammers and Unsolicited Bloggers from Genuine Experts on Twitter
    Khan, Muhammad Usman Shahid
    Ali, Mazhar
    Abbas, Assad
    Khan, Samee U.
    Zomaya, Albert Y.
    IEEE TRANSACTIONS ON DEPENDABLE AND SECURE COMPUTING, 2018, 15 (04) : 551 - 560
  • [32] Detecting and Classifying Crimes from Arabic Twitter Posts using Text Mining Techniques
    Al-Saif, Hissah
    Al-Dossari, Hmood
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2018, 9 (10) : 377 - 387
  • [33] Measuring the impact of spammers on e-mail and Twitter networks
    Colladon, Andrea Fronzetti
    Gloor, Peter A.
    INTERNATIONAL JOURNAL OF INFORMATION MANAGEMENT, 2019, 48 : 254 - 262
  • [34] Multilingual Cyberbullying Detection System Detecting Cyberbullying in Arabic Content
    Haidar, Batoul
    Chamoun, Maroun
    Serhrouchni, Ahmed
    2017 1ST CYBER SECURITY IN NETWORKING CONFERENCE (CSNET), 2017,
  • [35] Detecting Product Review Spammers using Activity Model
    Jiang, Bo
    Cao, Renhao
    Chen, Bi
    PROCEEDINGS OF THE 2013 INTERNATIONAL CONFERENCE ON ADVANCED COMPUTER SCIENCE AND ELECTRONICS INFORMATION (ICACSEI 2013), 2013, 41 : 650 - 653
  • [36] Detecting Crowdsourcing Spammers in Community Question Answering Websites
    Hao, Kaiqing
    Wang, Lei
    ADVANCES IN INTERNETWORKING, DATA & WEB TECHNOLOGIES, EIDWT-2017, 2018, 6 : 412 - 423
  • [37] Probabilistic graphical model for detecting spammers in microblog websites
    Han, Zhongming
    Yang, Ke
    Xu, Fengmin
    Duan, Dagao
    INTERNATIONAL JOURNAL OF EMBEDDED SYSTEMS, 2016, 8 (01) : 12 - 23
  • [38] A Hybrid Approach for Detecting Spammers in Online Social Networks
    Alghamdi, Bandar
    Xu, Yue
    Watson, Jason
    WEB INFORMATION SYSTEMS ENGINEERING, WISE 2018, PT I, 2018, 11233 : 189 - 198
  • [39] Detecting Spammers on Social Networks Based on a Hybrid Model
    Xu, Guangxia
    Qi, Jin
    Huang, Deling
    Daneshmand, Mahmoud
    2016 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2016, : 3062 - 3068
  • [40] Survey on Designing Framework for Analyzing Twitter Spammers using Forensic Method
    Ghate, Ankita M.
    Malik, L. G.
    2015 INTERNATIONAL CONFERENCE ON PERVASIVE COMPUTING (ICPC), 2015,