A Self-Supervised Approach to Comment Spam Detection Based on Content Analysis

被引:8
|
作者
Bhattarai, A. [1 ]
Dasgupta, D. [1 ]
机构
[1] Univ Memphis, Memphis, TN 38152 USA
关键词
Self-Supervised Learning; Spam Feature Extraction; Spam Filtering; Text Analysis; Text Characteristics;
D O I
10.4018/jisp.2011010102
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
This paper studies the problems and threats posed by a type of spam in the blogosphere, called blog comment spam. It explores the challenges introduced by comment spam, generalizing the analysis substantially to any other short text type spam. The authors analyze different high-level features of spam and legitimate comments based on the content of blog postings. The authors use these features to cluster data separately for each feature using K-Means clustering algorithm. The authors also use self-supervised learning, which could classify spam and legitimate comments automatically. Compared with existing solutions, this approach demonstrates more flexibility and adaptability to the environment, as it requires minimal human intervention. The preliminary evaluation of the proposed spam detection system shows promising results.
引用
收藏
页码:14 / 32
页数:19
相关论文
共 50 条
  • [1] Hierarchical Detection of Network Anomalies : A Self-Supervised Learning Approach
    Kye, Hyoseon
    Kim, Miru
    Kwon, Minhae
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2022, 29 : 1908 - 1912
  • [2] SMT: Self-supervised Approach for Multiple Animal Detection and Tracking
    Moosa, Muhammad
    Yamin, Muhammad Mudassar
    Hashmi, Ehtesham
    Beghdadi, Azeddine
    Imran, Ali Shariq
    Cheikh, Faouzi Alaya
    Ullah, Mohib
    [J]. ARTIFICIAL INTELLIGENCE APPLICATIONS AND INNOVATIONS, PT II, AIAI 2024, 2024, 712 : 307 - 317
  • [3] Hierarchical Detection of Network Anomalies : A Self-Supervised Learning Approach
    Kye, Hyoseon
    Kim, Miru
    Kwon, Minhae
    [J]. IEEE Signal Processing Letters, 2022, 29 : 1908 - 1912
  • [4] Self-supervised learning for outlier detection
    Diers, Jan
    Pigorsch, Christian
    [J]. STAT, 2021, 10 (01):
  • [5] Self-Supervised Bug Detection and Repair
    Allamanis, Miltiadis
    Jackson-Flux, Henry
    Brockschmidt, Marc
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [6] Self-Supervised Multisensor Change Detection
    Saha, Sudipan
    Ebel, Patrick
    Zhu, Xiao Xiang
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [7] A Self-supervised Approach for Adversarial Robustness
    Naseer, Muzammal
    Khan, Salman
    Hayat, Munawar
    Khan, Fahad Shahbaz
    Porikli, Fatih
    [J]. 2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 259 - 268
  • [8] Classification-Based Self-Supervised Learning for Anomaly Detection
    Li, Honghu
    Zhu, Yuesheng
    He, Ying
    [J]. THIRTEENTH INTERNATIONAL CONFERENCE ON DIGITAL IMAGE PROCESSING (ICDIP 2021), 2021, 11878
  • [9] Visual defect obfuscation based self-supervised anomaly detection
    Park, YeongHyeon
    Kang, Sungho
    Kim, Myung Jin
    Lee, Yeonho
    Kim, Hyeong Seok
    Yi, Juneho
    [J]. SCIENTIFIC REPORTS, 2024, 14 (01):
  • [10] Machine Anomalous Sound Detection Based on Self-Supervised Classification
    Wang, Shuxian
    Du, Jun
    Wang, Yajian
    [J]. PROCEEDINGS OF 2022 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2022, : 449 - 454