Analysis of Web Browsing Data: A Guide

被引:1
|
作者
von Hohenberg, Bernhard Clemm [1 ,10 ]
Stier, Sebastian [2 ,8 ]
Cardenal, Ana S. [3 ]
Guess, Andrew M. [4 ,5 ]
Menchen-Trevino, Ericka [6 ]
Wojcieszak, Magdalena [7 ,9 ]
机构
[1] GESIS Leibniz Inst Social Sci, Cologne, Germany
[2] GESIS Leibniz Inst Social Sci, Computat Social Sci Dept, Cologne, Germany
[3] Univ Oberta Catalunya, Barcelona, Spain
[4] Princeton Univ, Polit & Publ Affairs, Princeton, NJ USA
[5] Princeton Univ, Ctr Informat Technol Policy, Princeton, NJ USA
[6] Amer Univ, Washington, DC USA
[7] Univ Calif Davis, Davis, CA USA
[8] Univ Mannheim, Sch Social Sci, Mannheim, Germany
[9] Univ Amsterdam, Amsterdam Sch Commun Res, Amsterdam, Netherlands
[10] GESIS Leibniz Inst SocialSciences, Dept Computat Social Sci, D-50667 Cologne, Germany
基金
欧洲研究理事会;
关键词
web browsing data; digital trace data; web tracking data; computational social science; ONLINE; NEWS;
D O I
10.1177/08944393241227868
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
The use of individual-level browsing data, that is, the records of a person's visits to online content through a desktop or mobile browser, is of increasing importance for social scientists. Browsing data have characteristics that raise many questions for statistical analysis, yet to date, little hands-on guidance on how to handle them exists. Reviewing extant research, and exploring data sets collected by our four research teams spanning seven countries and several years, with over 14,000 participants and 360 million web visits, we derive recommendations along four steps: preprocessing the raw data; filtering out observations; classifying web visits; and modelling browsing behavior. The recommendations we formulate aim to foster best practices in the field, which so far has paid little attention to justifying the many decisions researchers need to take when analyzing web browsing data.
引用
收藏
页数:26
相关论文
共 50 条
  • [31] Towards Privacy-Preserving Data Trading for Web Browsing History
    Cai, Hui
    Ye, Fan
    Yang, Yuanyuan
    Zhu, Yanmin
    Li, Jie
    [J]. PROCEEDINGS OF THE IEEE/ACM INTERNATIONAL SYMPOSIUM ON QUALITY OF SERVICE (IWQOS 2019), 2019,
  • [32] Efficient Anonymous Web Browsing Preventing Traffic Analysis Attacks
    Priyanka, A. R.
    BalaSubramanian, Kannan
    [J]. 2013 IEEE INTERNATIONAL CONFERENCE ON EMERGING TRENDS IN COMPUTING, COMMUNICATION AND NANOTECHNOLOGY (ICE-CCN'13), 2013, : 426 - 431
  • [33] Interactive Visual Analysis of Browsing and Authoring Behaviors in Web Media
    Kuan, Yen-Ting
    Tai, Ming-Hung
    Kuan, Hsuan-Hao
    Ho, Tan-Chi
    Wang, Yu-Shuen
    Lin, Wen-Chieh
    Chuang, Jung-Hong
    [J]. 2015 BIG DATA VISUAL ANALYTICS (BDVA), 2015,
  • [34] A user-friendly, dynamic web environment for remote data browsing and analysis of multiparametric geophysical data within the MULTIMO project
    Carniel, R
    Di Cecca, M
    Jaquet, O
    [J]. JOURNAL OF VOLCANOLOGY AND GEOTHERMAL RESEARCH, 2006, 153 (1-2) : 80 - 96
  • [35] Secure web browsing with the OP web browser
    Grier, Chris
    Tang, Shuo
    King, Samuel T.
    [J]. PROCEEDINGS OF THE 2008 IEEE SYMPOSIUM ON SECURITY AND PRIVACY, 2008, : 402 - 416
  • [36] Extreme Web Caching for Faster Web Browsing
    Raza, Ali
    Zaki, Yasir
    Poetsch, Thomas
    Chen, Jay
    Subramanian, Lakshmi
    [J]. ACM SIGCOMM COMPUTER COMMUNICATION REVIEW, 2015, 45 (04) : 111 - 112
  • [37] Extreme Web Caching for Faster Web Browsing
    Raza, Ali
    Zaki, Yasir
    Poetsch, Thomas
    Chen, Jay
    Subramanian, Lakshmi
    [J]. SIGCOMM'15: PROCEEDINGS OF THE 2015 ACM CONFERENCE ON SPECIAL INTEREST GROUP ON DATA COMMUNICATION, 2015, : 111 - 112
  • [38] Mobile Web on the Desktop: Simpler Web Browsing
    Hoehl, Jeffery
    Lewis, Clayton
    [J]. ASSETS 11: PROCEEDINGS OF THE 13TH INTERNATIONAL ACM SIGACCESS CONFERENCE ON COMPUTERS AND ACCESSIBILITY, 2011, : 263 - 264
  • [39] Web site off-line structure reconfiguration:: A web user browsing analysis
    Rios, Sebastian A.
    Velasquez, Juan D.
    Yasuda, Hiroshi
    Aoki, Terumasa
    [J]. KNOWLEDGE-BASED INTELLIGENT INFORMATION AND ENGINEERING SYSTEMS, PT 2, PROCEEDINGS, 2006, 4252 : 371 - 378
  • [40] Browsing behaviour analysis using data mining
    Seemi, Farhana
    Aslam, Hania
    Mukhtar, Hamid
    Khattak, Sana
    [J]. International Journal of Advanced Computer Science and Applications, 2019, 10 (02): : 490 - 498