Query or Spam: Detecting fraudulent web requests using stream clustering

被引:0
|
作者
Shakiba, Tahere [1 ]
Zarifzadeh, Sajjad [1 ]
Derhami, Vali [1 ]
机构
[1] Yazd Univ, Dept Elect & Comp Engn, Yazd, Iran
关键词
Search Engin; Spain Query; Botnet; Activity Log;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Today, we are surrounded by a huge amount of data in the Internet, especially in the World Wide Web. Search engines are developed as an important tool to facilitate the access of user to the requested information. However, a spammer tries to exploit these engines and make them work the way he wants by sending spam queries. So, detecting these spam queries which are usually sent by botnets is of great importance. In this paper, we propose a method based on a semi-supervised stream clustering algorithm which analyzes the activity log of users based on their sessions and identifies such spammers. To evaluate the method, we have used k-fold cross validation which resulted in a satisfactory accuracy.
引用
收藏
页码:853 / 859
页数:7
相关论文
共 50 条
  • [41] A Density Based Clustering Approach to Distinguish Between Web Robot and Human Requests to a Web Server
    Zabihi, Mahdieh
    Jahan, Majid Vafaei
    Hamidzadeh, Javad
    ISECURE-ISC INTERNATIONAL JOURNAL OF INFORMATION SECURITY, 2014, 6 (01): : 77 - 89
  • [42] Detecting the Spam Review Using Tri-training
    Ji Chengzhang
    Kang, Dae-Ki
    2015 17TH INTERNATIONAL CONFERENCE ON ADVANCED COMMUNICATION TECHNOLOGY (ICACT), 2015, : 374 - 377
  • [43] Detecting Streaming of Twitter Spam Using Hybrid Method
    N. Senthil Murugan
    G. Usha Devi
    Wireless Personal Communications, 2018, 103 : 1353 - 1374
  • [44] Detecting Streaming of Twitter Spam Using Hybrid Method
    Murugan, N. Senthil
    Devi, G. Usha
    WIRELESS PERSONAL COMMUNICATIONS, 2018, 103 (02) : 1353 - 1374
  • [45] Detection using clustering query results
    Goharian, Nazli
    Platt, Alana
    INTELLIGENCE AND SECURITY INFORMATICS, PROCEEDINGS, 2006, 3975 : 671 - 673
  • [46] Retrieving web search results using Max-Max soft clustering for Hindi query
    Jain, Amita
    Tayal, Devendra K.
    Yadav, Sudesh
    INTERNATIONAL JOURNAL OF SYSTEM ASSURANCE ENGINEERING AND MANAGEMENT, 2016, 7 (01) : 70 - 81
  • [47] Query clustering using user logs
    Wen, JR
    Nie, JY
    Zhang, HJ
    ACM TRANSACTIONS ON INFORMATION SYSTEMS, 2002, 20 (01) : 59 - 81
  • [48] Clustering Algorithm of Web Click Stream Frequency Pattern
    Li Yang
    Zhang Liang
    2011 INTERNATIONAL CONFERENCE ON COMPUTERS, COMMUNICATIONS, CONTROL AND AUTOMATION (CCCA 2011), VOL III, 2010, : 388 - 391
  • [49] Spam Detection Using Clustering-Based SVM
    Pandya, Darshit
    PROCEEDINGS OF THE 2019 2ND INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND MACHINE INTELLIGENCE (MLMI 2019), 2019, : 12 - 15
  • [50] Dynamic classifier selection using clustering for spam detection
    Saeedian, Mehrnoush Famil
    Beigy, Hamid
    2009 IEEE SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE AND DATA MINING, 2009, : 84 - 88