Threshold and Associative Based Classification for Social Spam Profile Detection on Twitter

被引:8
|
作者
Hua, Willian [1 ]
Zhang, Yanqing [2 ]
机构
[1] Coll New Jersey, Dept Comp Sci, Ewing, NJ 08628 USA
[2] Georgia State Univ, Dept Comp Sci, Atlanta, GA 30302 USA
基金
美国国家科学基金会;
关键词
Social Spam Profile (SSP); Online Social Network (OSN); Content-Based; Behavioral-Based; Graph-Based;
D O I
10.1109/SKG.2013.15
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Online Social Networks (OSNs) such as Facebook and Twitter are the fastest growing online entities. Because they do not require much authentication for a user to create an account, they are susceptible to social spam attacks. These low-quality, unsolicited, and unwanted bulk messages commonly originate from Social Spam Profiles (SSPs). Spam messages may contain harmful virus links that infect users and propagate throughout OSNs. Twitter, a fast growing OSN site, has a large number of SSPs that have the potential to harm legitimate users. In this paper, a fast and scalable approach is proposed to detect SSPs on Twitter using content, behavioral, and graph-based data. After various investigations, a threshold and associative based classifier is created. Then, the new classifier is compared with the supervised machine learning algorithm, SVM, and two other existing algorithms in terms of accuracy, precision, sensitivity, and specificity. The new classifier with an accuracy of 79.26% is better than SVM with an accuracy of 69.32%. In summary, SSPs are younger, have more statuses, more tweets in succession, and contain keywords that differentiate a spam profile from a non-spam profile.
引用
收藏
页码:113 / 120
页数:8
相关论文
共 50 条
  • [1] Twitter spam account detection based on clustering and classification methods
    Kayode Sakariyah Adewole
    Tao Han
    Wanqing Wu
    Houbing Song
    Arun Kumar Sangaiah
    [J]. The Journal of Supercomputing, 2020, 76 : 4802 - 4837
  • [2] Twitter spam account detection based on clustering and classification methods
    Adewole, Kayode Sakariyah
    Hang, Tao
    Wu, Wanqing
    Songs, Houbing
    Sangaiah, Arun Kumar
    [J]. JOURNAL OF SUPERCOMPUTING, 2020, 76 (07): : 4802 - 4837
  • [3] Lazy associative classification for content-based spam detection
    Veloso, Adriano
    Meira, Wagner, Jr.
    [J]. LA-WEB 06: FOURTH LATIN AMERICAN WEB CONGRESS, PROCEEDINGS, 2006, : 154 - +
  • [4] Social-spam Profile Detection based on Content Classification and User Behavior
    Thi-Hong Vuong
    Van-Hien Tran
    Minh-Duc Nguyen
    Cam-Van Thi Nguyen
    Thanh-Huyen Pham
    Mai-Vu Tran
    [J]. 2016 EIGHTH INTERNATIONAL CONFERENCE ON KNOWLEDGE AND SYSTEMS ENGINEERING (KSE), 2016, : 264 - 267
  • [5] Sentiment Based Twitter Spam Detection
    Perveen, Nasira
    Missen, Malik M. Saad
    Rasool, Qaisar
    Akhtar, Nadeem
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2016, 7 (07) : 568 - 573
  • [6] Adaptive Classification for Spam Detection on Twitter with Specific Data
    Dangkesee, Thayakorn
    Puntheeranurak, Sutheera
    [J]. 2017 21ST INTERNATIONAL COMPUTER SCIENCE AND ENGINEERING CONFERENCE (ICSEC 2017), 2017, : 243 - 246
  • [7] Spam Profile Detection in Social Networks Based on Public Features
    Al-Zoubi, Ala' M.
    Alqatawna, Ja'far
    Faris, Hossam
    [J]. 2017 8TH INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION SYSTEMS (ICICS), 2017, : 130 - 135
  • [8] Tweet and Account Based Spam Detection on Twitter
    Gungor, Kubra Nur
    Erdem, O. Ayhan
    Dogru, Ibrahim Alper
    [J]. ARTIFICIAL INTELLIGENCE AND APPLIED MATHEMATICS IN ENGINEERING PROBLEMS, 2020, 43 : 898 - 905
  • [9] A hybrid classification method for Twitter spam detection based on differential evolution and random forest
    Bazzaz Abkenar, Sepideh
    Mahdipour, Ebrahim
    Jameii, Seyed Mahdi
    Haghi Kashani, Mostafa
    [J]. CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2021, 33 (21):
  • [10] Boosting Social Spam Detection via Attention Mechanisms on Twitter
    Shen, Hua
    Liu, Xinyue
    Zhang, Xianchao
    [J]. ELECTRONICS, 2022, 11 (07)