Real-time Twitter Content Polluter Detection Based on Direct Features

被引:0
|
作者
Chen, Weiling [1 ]
Yeo, Chai Kiat [1 ]
Lau, Chiew Tong [1 ]
Lee, Bu Sung [1 ]
机构
[1] Nanyang Technol Univ, Sch Comp Engn, Singapore, Singapore
关键词
Spam detection; Twitter; Online Social Networks; feature evaluation;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Too many content polluters on social networks make it difficult for users to browse valuable contents. Some research has been done in spam and phishing detection on social networks but these are only a small part of all content polluters. What bother users most are those large amount of repeated low quality advertisements. Hence it is necessary to filter these content polluters to improve users' experiences. Moreover, most of the phishing/spam detection works are done offline and some of the features used take too much time to extract making it impossible for real-time detection. We perform a study on an extensive twitter dataset and present a definition of content polluters. We further propose some novel features and together with other commonly used features in phishing/spam detection, we classify them into two categories - direct features and indirect features. A simple random forest classifier is applied based on our proposed direct features alone for real-time content polluter detection and it achieves a reasonable high accuracy with high F1 values.
引用
收藏
页码:240 / 243
页数:4
相关论文
共 50 条
  • [1] Statistical Features-Based Real-Time Detection of Drifted Twitter Spam
    Chen, Chao
    Wang, Yu
    Zhang, Jun
    Xiang, Yang
    Zhou, Wanlei
    Min, Geyong
    [J]. IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2017, 12 (04) : 914 - 925
  • [2] Real-time Detection of Content Polluters in Partially Observable Twitter Networks
    Nasim, Mehwish
    Nguyen, Andrew
    Lothian, Nick
    Cope, Robert
    Mitchell, Lewis
    [J]. COMPANION PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE 2018 (WWW 2018), 2018, : 1331 - 1339
  • [3] Real-time trending topics detection and description from Twitter content
    Madani, Amina
    Boussaid, Omar
    Zegour, Djamel Eddine
    [J]. SOCIAL NETWORK ANALYSIS AND MINING, 2015, 5 (01) : 1 - 13
  • [4] GraphJet: Real-Time Content Recommendations at Twitter
    Sharma, Aneesh
    Jiang, Jerry
    Bommannavar, Praveen
    Larson, Brian
    Lin, Jimmy
    [J]. PROCEEDINGS OF THE VLDB ENDOWMENT, 2016, 9 (13): : 1281 - 1292
  • [5] Real-Time Entity-Based Event Detection for Twitter
    McMinn, Andrew J.
    Jose, Joemon M.
    [J]. EXPERIMENTAL IR MEETS MULTILINGUALITY, MULTIMODALITY, AND INTERACTION, 2015, 9283 : 66 - 78
  • [6] Real-time, Scalable, Content-based Twitter Users Recommendation
    Subercaze, Julien
    Gravier, Christophe
    Laforest, Frederique
    [J]. COMPANION PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE 2018 (WWW 2018), 2018, : 1367 - 1367
  • [7] Real-time, scalable, content-based Twitter users recommendation
    Subercaze, Julien
    Gravier, Christophe
    Laforest, Frederique
    [J]. WEB INTELLIGENCE, 2016, 14 (01) : 17 - 29
  • [8] A Framework for Real-Time Spam Detection in Twitter
    Gupta, Himank
    Jamal, Mohd. Saalim
    Madisetty, Sreekanth
    Desarkar, Maunendra Sankar
    [J]. 2018 10TH INTERNATIONAL CONFERENCE ON COMMUNICATION SYSTEMS & NETWORKS (COMSNETS), 2018, : 380 - 387
  • [9] TweetCred: Real-Time Credibility Assessment of Content on Twitter
    Gupta, Aditi
    Kumaraguru, Ponnurangam
    Castillo, Carlos
    Meier, Patrick
    [J]. SOCIAL INFORMATICS, SOCINFO 2014, 2014, 8851 : 228 - 243
  • [10] Real-time Event Detection in Twitter: A Case Study
    Sani, Ali Momen
    Moeini, Ali
    [J]. 2020 6TH INTERNATIONAL CONFERENCE ON WEB RESEARCH (ICWR), 2020, : 48 - 51