Googling the Internet: Profiling Internet Endpoints via the World Wide Web

被引:28
|
作者
Trestian, Ionut [1 ]
Ranjan, Supranamaya [2 ]
Kuzmanovic, Aleksandar [1 ]
Nucci, Antonio [2 ]
机构
[1] Northwestern Univ, Dept Elect Engn & Comp Sci, Evanston, IL 60208 USA
[2] Narus Inc, Mountain View, CA 94043 USA
关键词
Clustering; endpoint profiling; Google; traffic classification; traffic locality;
D O I
10.1109/TNET.2009.2031175
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Understanding Internet access trends at a global scale, i.e., how people use the Internet, is a challenging problem that is typically addressed by analyzing network traces. However, obtaining such traces presents its own set of challenges owing to either privacy concerns or to other operational difficulties. The key hypothesis of our work here is that most of the information needed to profile the Internet endpoints is already available around us-on the Web. In this paper, we introduce a novel approach for profiling and classifying endpoints. We implement and deploy a Google-based profiling tool, that accurately characterizes endpoint behavior by collecting and strategically combining information freely available on the Web. Our Web-based "unconstrained endpoint profiling" (UEP) approach shows advances in the following scenarios: 1) even when no packet traces are available, it can accurately infer application and protocol usage trends at arbitrary networks; 2) when network traces are available, it outperforms state-of-the-art classification tools such as BLINC; 3) when sampled flow-level traces are available, it retains high classification capabilities. We explore other complementary UEP approaches, such as p2p- and reverse-DNS-lookup-based schemes, and show that they can further improve the results of the Web-based UEP. Using this approach, we perform unconstrained endpoint profiling at a global scale: for clients in four different world regions (Asia, South and North America, and Europe). We provide the first-of-its-kind endpoint analysis that reveals fascinating similarities and differences among these regions.
引用
收藏
页码:666 / 679
页数:14
相关论文
共 50 条
  • [1] Internet and World Wide Web
    Maid, U
    [J]. ERNAHRUNGS-UMSCHAU, 1996, 43 (11): : B41 - B43
  • [2] Internet and the world wide web
    Neelima Shrikhande
    [J]. Resonance, 1997, 2 (2) : 64 - 74
  • [3] The Internet and the World Wide Web
    Weaver, AC
    [J]. IECON '97 - PROCEEDINGS OF THE 23RD INTERNATIONAL CONFERENCE ON INDUSTRIAL ELECTRONICS, CONTROL, AND INSTRUMENTATION, VOLS. 1-4, 1997, : 1529 - 1540
  • [4] The Internet and the World Wide Web
    Collen, MF
    [J]. M D COMPUTING, 1999, 16 (05): : 72 - 72
  • [5] Unconstrained endpoint profiling (Googling the Internet)
    Trestian, Ionut
    Ranjan, Supranamaya
    Kuzmanovic, Aleksandar
    Nucci, Antonio
    [J]. ACM SIGCOMM COMPUTER COMMUNICATION REVIEW, 2008, 38 (04) : 279 - 290
  • [6] Researching on the World Wide Web Internet
    Whitener, L
    [J]. JOURNAL OF RURAL HEALTH, 1997, 13 (03): : 257 - 263
  • [7] Introduction of the Internet and World Wide Web
    He, J
    [J]. EXPERIMENTAL TECHNIQUES, 1997, 21 (05) : 29 - 33
  • [8] THE WORLD WIDE WEB - INTERNET BOOMTOWN
    SEMICH, JW
    [J]. DATAMATION, 1995, 41 (01): : 37 - 41
  • [9] Introduction of the internet and world wide web
    J. He
    [J]. Experimental Techniques, 1997, 21 : 29 - 33
  • [10] Internet, World Wide Web, and creativity
    Siau, K
    [J]. JOURNAL OF CREATIVE BEHAVIOR, 1999, 33 (03): : 191 - 201