Knockin' on Trackers' Door: Large-Scale Automatic Analysis of Web Tracking

被引:1
|
作者
Sanchez-Rola, Iskander [1 ]
Santos, Igor [1 ]
机构
[1] Univ Deusto, DeustoTech, Bilbao, Spain
关键词
Device fingerprinting; Privacy; Web tracking;
D O I
10.1007/978-3-319-93411-2_13
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we present the first generic large-scale analysis of different known and unknown web tracking scripts on the Internet to understand its current ecosystem and their behavior. To this end, we implemented TRACKINGINSPECTOR the first automatic method capable of detecting generically different types of web tracking scripts. This method automatically retrieves the existing scripts from a website and, through code similarity and machine learning, detects modifications of known tracking scripts and discovers unknown tracking script candidates. TRACKINGINSPECTOR analyzed the Alexa top 1M websites, computing the web tracking prevalence and its ecosystem, as well as the influence of hosting, website category, and website reputation. More than 90% websites performed some sort of tracking and more than 50% scripts were used for web tracking. Over 2,000,000 versions of known tracking scripts were found. We discovered several script renaming techniques used to avoid blacklists, performing a comprehensive analysis of them. 5,500,000 completely unknown likely tracking scripts were found, including more than 700 new different potential device fingerprinting unique scripts. Our system also automatically detected the fingerprinting behavior of a previously reported targeted fingerprinting-driven malware campaign in two different websites not previously documented.
引用
收藏
页码:281 / 302
页数:22
相关论文
共 50 条
  • [1] Large-Scale Web Data Analysis
    Leskovec, Jure
    [J]. IEEE INTELLIGENT SYSTEMS, 2011, 26 (01) : 11 - 11
  • [2] A Computer Vision Pipeline for Automatic Large-scale Inventory Tracking
    Gregory, Stephen
    Singh, Utkarsh
    Gray, Jeff
    Hobbs, Jon
    [J]. ACMSE 2021: PROCEEDINGS OF THE 2021 ACM SOUTHEAST CONFERENCE, 2021, : 100 - 107
  • [3] Laser trackers for large-scale dimensional metrology: A review
    Muralikrishnan, Bala
    Phillips, Steve
    Sawyer, Daniel
    [J]. PRECISION ENGINEERING-JOURNAL OF THE INTERNATIONAL SOCIETIES FOR PRECISION ENGINEERING AND NANOTECHNOLOGY, 2016, 44 : 13 - 28
  • [4] Large-Scale Collaborative Analysis and Extraction of Web Data
    Weigel, Felix
    Panda, Biswanath
    Riedewald, Mirek
    Gehrke, Johannes
    Calimlim, Manuel
    [J]. PROCEEDINGS OF THE VLDB ENDOWMENT, 2008, 1 (02): : 1476 - 1479
  • [5] A Large-Scale Empirical Analysis of Chinese Web Passwords
    Li, Zhigong
    Han, Weili
    Xu, Wenyuan
    [J]. PROCEEDINGS OF THE 23RD USENIX SECURITY SYMPOSIUM, 2014, : 559 - 574
  • [6] Radiation hard strip detectors for large-scale silicon trackers
    Andricek, L
    Hauff, D
    Kemmer, J
    Lükewille, P
    Lutz, G
    Moser, HG
    Richter, RH
    Rohe, T
    Stolze, K
    Viehl, A
    [J]. NUCLEAR INSTRUMENTS & METHODS IN PHYSICS RESEARCH SECTION A-ACCELERATORS SPECTROMETERS DETECTORS AND ASSOCIATED EQUIPMENT, 1999, 436 (1-2): : 262 - 271
  • [7] Automatic Acquisition of Large-scale Academic Bilingual Parallel Corpus from the Web
    Han Yong
    Li Yu
    He Xiaoning
    Yang Muyun
    Lei Guohua
    [J]. 2009 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING, 2009, : 318 - 321
  • [8] Experimenting with computational methods for large-scale studies of tracking technologies in web archives
    Nielsen, Janne
    [J]. INTERNET HISTORIES, 2019, 3 (3-4): : 293 - 315
  • [9] Automatic analysis of large-scale pairwise alignments of protein sequences
    Codani, JJ
    Comet, JP
    Aude, JC
    Glémet, E
    Wozniak, A
    Risler, JL
    Hénaut, A
    Slonimski, PP
    [J]. METHODS IN MICROBIOLOGY, VOL 28, 1999, 28 : 229 - 244
  • [10] Temporal Sentiment Tracking and Analysis on Large-scale Social Events
    Hazimeh, Hussein
    Harissa, Mohammad
    Mugellini, Elena
    Abou Khaled, Omar
    [J]. 2019 8TH INTERNATIONAL CONFERENCE ON SOFTWARE AND COMPUTER APPLICATIONS (ICSCA 2019), 2019, : 17 - 21