Detecting and Characterizing Web Bot Traffic in a Large E-commerce Marketplace

被引:7
|
作者
Xu, Haitao [1 ]
Li, Zhao [2 ]
Chu, Chen [2 ]
Chen, Yuanmi [2 ]
Yang, Yifan [2 ]
Lu, Haifeng [2 ]
Wang, Haining [3 ]
Stavrou, Angelos [4 ]
机构
[1] Arizona State Univ, Glendale, AZ 85306 USA
[2] Alibaba Grp, Hangzhou, Zhejiang, Peoples R China
[3] Univ Delaware, Newark, DE 19716 USA
[4] George Mason Univ, Fairfax, VA 22030 USA
来源
关键词
D O I
10.1007/978-3-319-98989-1_8
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
A certain amount of web traffic is attributed to web bots on the Internet. Web bot traffic has raised serious concerns among website operators, because they usually consume considerable resources at web servers, resulting in high workloads and longer response time, while not bringing in any profit. Even worse, the content of the pages it crawled might later be used for other fraudulent activities. Thus, it is important to detect web bot traffic and characterize it. In this paper, we first propose an efficient approach to detect web bot traffic in a large e-commerce marketplace and then perform an in-depth analysis on the characteristics of web bot traffic. Specifically, our proposed bot detection approach consists of the following modules: (1) an Expectation Maximization (EM)based feature selection method to extract the most distinguishable features, (2) a gradient based decision tree to calculate the likelihood of being a bot IP, and (3) a threshold estimation mechanism aiming to recover a reasonable amount of non-bot traffic flow. The detection approach has been applied on Taobao/Tmall platforms, and its detection capability has been demonstrated by identifying a considerable amount of web bot traffic. Based on data samples of traffic originating from web bots and normal users, we conduct a comparative analysis to uncover the behavioral patterns of web bots different from normal users. The analysis results reveal their differences in terms of active time, search queries, item and store preferences, and many other aspects. These findings provide new insights for public websites to further improve web bot traffic detection for protecting valuable web contents.
引用
收藏
页码:143 / 163
页数:21
相关论文
共 50 条
  • [31] A Voice Controlled E-Commerce Web Application
    Kandhari, Mandeep Singh
    Zulkernine, Farhana
    Isah, Haruna
    2018 IEEE 9TH ANNUAL INFORMATION TECHNOLOGY, ELECTRONICS AND MOBILE COMMUNICATION CONFERENCE (IEMCON), 2018, : 118 - 124
  • [32] Adaptive delivery of E-commerce web sites
    Gupta, Ashish
    Mathur, Ajay
    Intelligent Data Analysis, 2002, 6 (05) : 469 - 480
  • [33] A secured web browser for e-commerce transactions
    Dellisanti, B
    Dunning, LA
    Ramakrishnan, S
    Proceedings of the ISCA 20th International Conference on Computers and Their Applications, 2005, : 232 - 235
  • [34] Research on Application of Web Mining in E-Commerce
    Bin, Ning
    Lei, Yuan
    MEMS, NANO AND SMART SYSTEMS, PTS 1-6, 2012, 403-408 : 1830 - 1833
  • [35] Directions for web and e-commerce applications security
    Thuraisingham, B
    Clifton, C
    Gupta, A
    Bertino, E
    Ferrari, E
    PROCEEDINGS OF THE TENTH IEEE INTERNATIONAL WORKSHOPS ON ENABLING TECHNOLOGIES: INFRASTRUCTURE FOR COLLABORATIVE ENTERPRISES, 2001, : 200 - 204
  • [36] Web-based coordination for e-commerce
    Zhang, Xiguo
    Wang, Guihe
    Fan, Limin
    INTEGRATION AND INNOVATION ORIENT TO E-SOCIETY, VOL 1, 2007, 251 : 515 - +
  • [37] A web agent for automating e-commerce operations
    Raposo, J
    Alvarez, M
    Viña, A
    Montoto, P
    Hidalgo, J
    Pan, A
    IEEE INTERNATIONAL CONFERENCE ON E-COMMERCE, 2003, : 16 - 19
  • [38] Mining web browsing patterns for E-commerce
    Song, Qinbao
    Shepperd, Martin
    COMPUTERS IN INDUSTRY, 2006, 57 (07) : 622 - 630
  • [39] Analysis and Development of E-Commerce Web Application
    Tyagi, Shivanshu
    Yadav, Shashwat
    Singhal, Utkarsh
    Chaudhary, Himanshi
    Proceedings - 2022 5th International Conference on Computational Intelligence and Communication Technologies, CCICT 2022, 2022, : 65 - 72
  • [40] WEB data mining applications in e-commerce
    Zhao, Yonghua
    Lin, Hong
    2014 PROCEEDINGS OF THE 9TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE & EDUCATION (ICCSE 2014), 2014, : 557 - 559