Query or Spam: Detecting fraudulent web requests using stream clustering

被引:0
|
作者
Shakiba, Tahere [1 ]
Zarifzadeh, Sajjad [1 ]
Derhami, Vali [1 ]
机构
[1] Yazd Univ, Dept Elect & Comp Engn, Yazd, Iran
关键词
Search Engin; Spain Query; Botnet; Activity Log;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Today, we are surrounded by a huge amount of data in the Internet, especially in the World Wide Web. Search engines are developed as an important tool to facilitate the access of user to the requested information. However, a spammer tries to exploit these engines and make them work the way he wants by sending spam queries. So, detecting these spam queries which are usually sent by botnets is of great importance. In this paper, we propose a method based on a semi-supervised stream clustering algorithm which analyzes the activity log of users based on their sessions and identifies such spammers. To evaluate the method, we have used k-fold cross validation which resulted in a satisfactory accuracy.
引用
收藏
页码:853 / 859
页数:7
相关论文
共 50 条
  • [1] Spam query detection using stream clustering
    Shakiba, Tahere
    Zarifzadeh, Sajjad
    Derhami, Vali
    WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS, 2018, 21 (02): : 557 - 572
  • [2] Spam query detection using stream clustering
    Tahere Shakiba
    Sajjad Zarifzadeh
    Vali Derhami
    World Wide Web, 2018, 21 : 557 - 572
  • [3] Detecting Spam Tweets In Twitter Using a Data Stream Clustering Algorithm
    Eshraqi, Nasim
    Jalali, Mehrdad
    Moattar, Mohammad Hossein
    SECOND INTERNATIONAL CONGRESS ON TECHNOLOGY, COMMUNICATION AND KNOWLEDGE (ICTCK 2015), 2015, : 347 - 351
  • [4] Detecting Fraudulent Words: Using PFCM Clustering
    Singhal, Ritika
    Deepika, N.
    2016 IEEE INTERNATIONAL CONFERENCE ON RECENT TRENDS IN ELECTRONICS, INFORMATION & COMMUNICATION TECHNOLOGY (RTEICT), 2016, : 2015 - 2017
  • [5] Detecting Web Spam using a Recovering Web Links System
    Araujo, Lourdes
    Martinez-Romo, Juan
    PROCESAMIENTO DEL LENGUAJE NATURAL, 2009, (42): : 39 - 46
  • [6] Detecting Spam in Web Corpora
    Baisa, Vit
    Suchomel, Vit
    RASLAN 2012: RECENT ADVANCES IN SLAVONIC NATURAL LANGUAGE PROCESSING, 2012, : 69 - 76
  • [7] Detecting Malicious Web Requests Using an Enhanced TextCNN
    Yu, Lian
    Chen, Lihao
    Dong, Jingtao
    Li, Mengyuan
    Liu, Lijun
    Zhao, Bai
    Zhang, Chen
    2020 IEEE 44TH ANNUAL COMPUTERS, SOFTWARE, AND APPLICATIONS CONFERENCE (COMPSAC 2020), 2020, : 768 - 777
  • [8] Crafting and Detecting Adversarial Web Requests
    Gong, Xinyu
    Zhu, Huidi
    Deng, Ruofan
    Wang, Fu
    Lu, Jialiang
    4TH IEEE INTERNATIONAL CONFERENCE ON SMART CLOUD (SMARTCLOUD 2019) / 3RD INTERNATIONAL SYMPOSIUM ON REINFORCEMENT LEARNING (ISRL 2019), 2019, : 237 - 242
  • [9] Effectively Detecting Content Spam on the Web Using Topical Diversity Measures
    Dong, Cailing
    Zhou, Bin
    2012 IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE AND INTELLIGENT AGENT TECHNOLOGY (WI-IAT 2012), VOL 1, 2012, : 266 - 273
  • [10] Detecting Changes in Stream Query Results
    Ghayoori, Majid
    Salmani, Khosro
    Haghjoo, Mostafa S.
    NEW CHALLENGES FOR INTELLIGENT INFORMATION AND DATABASE SYSTEMS, 2011, 351 : 13 - 24