An enhanced graph-based semi-supervised learning algorithm to detect fake users on Twitter

被引:1
|
作者
M. BalaAnand
N. Karthikeyan
S. Karthik
R. Varatharajan
Gunasekaran Manogaran
C. B. Sivaparthipan
机构
[1] V.R.S. College of Engineering & Technology,Department of Computer Science & Engineering
[2] SNS College of Engineering,Department of MCA
[3] SNS College of Technology,Department of Computer Science & Engineering
[4] Anna University,Department of Computer Science & Engineering
[5] University of California,Department of Computer Science
来源
关键词
Fake user detection; Identity deception; Sock puppets; Semi-supervised learning algorithm;
D O I
暂无
中图分类号
学科分类号
摘要
Over the the past decade, social networking services (SNS) have proliferated on the web. The nature of such sites makes identity deception easy, providing a fast means for creating and managing identities, and then connecting with and deceiving others. Fake users are those accounts specifically created for purposes such as stalking or abuse of another user, for slander, or for marketing. The current system for detecting deception depends on behavioral, non-behavioral and user-generated content (UGC) information gathered from users. Although these methods have high detection accuracy, they cannot be implemented in databases with massive volumes of data. To address this issue, this paper proposes an enhanced graph-based semi-supervised learning algorithm (EGSLA) to detect fake users from a large volume of Twitter data. The proposed method encompasses four modules: data collection, feature extraction, classification and decision making. Data collected from Twitter using Scrapy is utilized for the evaluation. The performance of the proposed algorithm is tested with existing game theory, k-nearest neighbor (KNN), support vector machine (SVM) and decision tree techniques. The results show that the proposed EGSLA algorithm achieves 90.3% accuracy in spotting fake users.
引用
收藏
页码:6085 / 6105
页数:20
相关论文
共 50 条
  • [1] An enhanced graph-based semi-supervised learning algorithm to detect fake users on Twitter
    BalaAnand, M.
    Karthikeyan, N.
    Karthik, S.
    Varatharajan, R.
    Manogaran, Gunasekaran
    Sivaparthipan, C. B.
    [J]. JOURNAL OF SUPERCOMPUTING, 2019, 75 (09): : 6085 - 6105
  • [2] Graph-based semi-supervised learning
    Zhang, Changshui
    Wang, Fei
    [J]. ARTIFICIAL LIFE AND ROBOTICS, 2009, 14 (04) : 445 - 448
  • [3] Graph-based semi-supervised learning
    Changshui Zhang
    Fei Wang
    [J]. Artificial Life and Robotics, 2009, 14 (4) : 445 - 448
  • [4] Graph-based semi-supervised learning
    Subramanya, Amarnag
    Talukdar, Partha Pratim
    [J]. Synthesis Lectures on Artificial Intelligence and Machine Learning, 2014, 29 : 1 - 126
  • [5] Fairness in graph-based semi-supervised learning
    Tao Zhang
    Tianqing Zhu
    Mengde Han
    Fengwen Chen
    Jing Li
    Wanlei Zhou
    Philip S Yu
    [J]. Knowledge and Information Systems, 2023, 65 : 543 - 570
  • [6] On Consistency of Graph-based Semi-supervised Learning
    Du, Chengan
    Zhao, Yunpeng
    Wang, Feng
    [J]. 2019 39TH IEEE INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS (ICDCS 2019), 2019, : 483 - 491
  • [7] Fairness in graph-based semi-supervised learning
    Zhang, Tao
    Zhu, Tianqing
    Han, Mengde
    Chen, Fengwen
    Li, Jing
    Zhou, Wanlei
    Yu, Philip S.
    [J]. KNOWLEDGE AND INFORMATION SYSTEMS, 2023, 65 (02) : 543 - 570
  • [8] Graph-based semi-supervised learning: A review
    Chong, Yanwen
    Ding, Yun
    Yan, Qing
    Pan, Shaoming
    [J]. NEUROCOMPUTING, 2020, 408 (408) : 216 - 230
  • [9] Fractional Graph-based Semi-Supervised Learning
    de Nigris, S.
    Bautista, E.
    Abry, P.
    Avrachenkov, K.
    Gonclaves, P.
    [J]. 2017 25TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2017, : 356 - 360
  • [10] A graph-based semi-supervised learning algorithm for web page classification
    Liu, Rong
    Zhou, Jianzhong
    Liu, Ming
    [J]. ISDA 2006: SIXTH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS DESIGN AND APPLICATIONS, VOL 2, 2006, : 856 - +