Distributed Network Monitoring and Debugging with SwitchPointer

被引:0
|
作者
Tammana, Praveen [1 ]
Agarwal, Rachit [2 ]
Lee, Myungjin [1 ]
机构
[1] Univ Edinburgh, Edinburgh, Midlothian, Scotland
[2] Cornell Univ, Ithaca, NY 14853 USA
基金
英国工程与自然科学研究理事会;
关键词
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Monitoring and debugging large-scale networks remains a challenging problem. Existing solutions operate at one of the two extremes-systems running at end-hosts (more resources but less visibility into the network) or at network switches (more visibility, but limited resources). We present SwitchPointer, a network monitoring and debugging system that integrates the best of the two worlds. SwitchPointer exploits end-host resources and programmability to collect and monitor telemetry data. The key contribution of SwitchPointer is to efficiently provide network visibility by using switch memory as a "directory service"-each switch, rather than storing the data necessary for monitoring functionalities, stores pointers to end-hosts where relevant telemetry data is stored. We demonstrate, via experiments over real-world testbeds, that SwitchPointer can efficiently monitor and debug network problems, many of which were either hard or even infeasible with existing designs.
引用
收藏
页码:453 / 466
页数:14
相关论文
共 50 条
  • [21] Replay debugging for distributed applications
    Geels, Dennis
    Altekar, Gautam
    Shenker, Scott
    Stoica, Ion
    [J]. USENIX ASSOCIATION PROCEEDINGS OF THE 2006 USENIX ANNUAL TECHNICAL CONFERENCE, 2006, : 289 - +
  • [22] Testing and debugging of distributed software
    Cunha, JC
    Krawczyk, H
    [J]. COMPUTERS AND ARTIFICIAL INTELLIGENCE, 2000, 19 (06): : 495 - 510
  • [23] Live Debugging of Distributed Systems
    Dao, Darren
    Albrecht, Jeannie
    Killian, Charles
    Vahdat, Amin
    [J]. COMPILER CONSTRUCTION, PROCEEDINGS, 2009, 5501 : 94 - +
  • [24] DEBUGGING A DISTRIBUTED COMPUTING SYSTEM
    GARCIAMOLINA, H
    GERMANO, F
    KOHLER, WH
    [J]. IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 1984, 10 (02) : 210 - 219
  • [25] WBEM based distributed network monitoring
    Bo, L
    Hui, L
    [J]. GRID AND COOPERATIVE COMPUTING GCC 2004 WORKSHOPS, PROCEEDINGS, 2004, 3252 : 351 - 357
  • [26] Automated and Distributed Network Service Monitoring
    Germoglio, Giovan
    Dias, Bruno
    Sousa, Pedro
    [J]. MANAGEMENT ENABLING THE FUTURE INTERNET FOR CHANGING BUSINESS AND NEW COMPUTING SERVICES, PROCEEDINGS, 2009, 5787 : 143 - 150
  • [27] Toward efficient distributed network monitoring
    Du, XJ
    [J]. CONFERENCE PROCEEDINGS OF THE 2004 IEEE INTERNATIONAL PERFORMANCE, COMPUTING, AND COMMUNICATIONS CONFERENCE, 2004, : 87 - 94
  • [28] Network load monitoring in distributed systems
    Islam, KMJ
    Shirazi, BA
    Welch, LR
    Tjaden, BC
    Cavanaugh, C
    Anwar, S
    [J]. PARALLEL AND DISTRIBUTED PROCESSING, PROCEEDINGS, 2000, 1800 : 800 - 807
  • [29] Strategy of deterministic replay debugging based on the event model in distributed debugging
    Li Q.-S.
    Li J.
    Ye H.
    Du L.
    [J]. Xi'an Dianzi Keji Daxue Xuebao/Journal of Xidian University, 2010, 37 (05): : 872 - 878
  • [30] GLOBAL CONDITIONS IN DEBUGGING DISTRIBUTED PROGRAMS
    MANABE, Y
    IMASE, M
    [J]. JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 1992, 15 (01) : 62 - 69