Query or Spam: Detecting fraudulent web requests using stream clustering

被引:0
|
作者
Shakiba, Tahere [1 ]
Zarifzadeh, Sajjad [1 ]
Derhami, Vali [1 ]
机构
[1] Yazd Univ, Dept Elect & Comp Engn, Yazd, Iran
关键词
Search Engin; Spain Query; Botnet; Activity Log;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Today, we are surrounded by a huge amount of data in the Internet, especially in the World Wide Web. Search engines are developed as an important tool to facilitate the access of user to the requested information. However, a spammer tries to exploit these engines and make them work the way he wants by sending spam queries. So, detecting these spam queries which are usually sent by botnets is of great importance. In this paper, we propose a method based on a semi-supervised stream clustering algorithm which analyzes the activity log of users based on their sessions and identifies such spammers. To evaluate the method, we have used k-fold cross validation which resulted in a satisfactory accuracy.
引用
收藏
页码:853 / 859
页数:7
相关论文
共 50 条
  • [21] Detecting IoT Botnet Formation using Data Stream Clustering Algorithms
    Arimatea, Gabriel de Carvalho
    Lima Ribeiro, Admilson de Ribamar
    PROCEEDINGS OF THE 16TH INTERNATIONAL CONFERENCE ON WEB INFORMATION SYSTEMS AND TECHNOLOGIES (WEBIST), 2020, : 395 - 402
  • [22] Query clustering for boosting web page ranking
    BaezaYates, R
    Hurtado, C
    Mendoza, M
    ADVANCES IN WEB INTELLIGENCE, PROCEEDINGS, 2004, 3034 : 164 - 175
  • [23] Fast parallel PageRank technique for detecting spam web pages
    Khare, Nilay
    Dubey, Hema
    INTERNATIONAL JOURNAL OF DATA MINING MODELLING AND MANAGEMENT, 2019, 11 (04) : 350 - 365
  • [24] Spam detector using text clustering
    Sasaki, M
    Shinnou, H
    2005 INTERNATIONAL CONFERENCE ON CYBERWORLDS, PROCEEDINGS, 2005, : 316 - 319
  • [25] Detecting Structured Query Language Injections in Web Microservices Using Machine Learning
    Peralta-Garcia, Edwin
    Quevedo-Monsalbe, Juan
    Tuesta-Monteza, Victor
    Arcila-Diaz, Juan
    INFORMATICS-BASEL, 2024, 11 (02):
  • [26] A new Query Reformulation Approach using Web Result Clustering and User Profile
    Silem, Abd El Heq
    Taktak, Hajer
    Moussa, Faouzi
    KNOWLEDGE-BASED AND INTELLIGENT INFORMATION & ENGINEERING SYSTEMS (KSE 2021), 2021, 192 : 1180 - 1189
  • [27] Detecting Web Spam Based on Novel Features from Web Page Source Code
    Liu, Jiayong
    Su, Yu
    Lv, Shun
    Huang, Cheng
    SECURITY AND COMMUNICATION NETWORKS, 2020, 2020
  • [28] Detecting Fraudulent Transactions using Hybrid Fusion Techniques
    Shinde, Yashowardhan
    Chadha, Akalbir Singh
    Shitole, Ajitkumar
    2021 3RD INTERNATIONAL CONFERENCE ON ELECTRICAL, CONTROL AND INSTRUMENTATION ENGINEERING (IEEE ICECIE'2021), 2021,
  • [29] Detecting link spam using temporal information
    Shen, Guoyang
    Gao, Bin
    Liu, Tie-Yan
    Feng, Guang
    Song, Shiji
    Li, Hang
    ICDM 2006: SIXTH INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2006, : 1049 - 1053
  • [30] A structural, content-similarity measure for detecting spam documents on the web
    Pera, Maria Soledad
    Yiu-Kai Ng
    INTERNATIONAL JOURNAL OF WEB INFORMATION SYSTEMS, 2009, 5 (04) : 431 - 464