Live Debugging of Distributed Systems

被引:0
|
作者
Dao, Darren [1 ]
Albrecht, Jeannie [2 ]
Killian, Charles [3 ]
Vahdat, Amin [1 ]
机构
[1] Univ Calif San Diego, La Jolla, CA 92093 USA
[2] Williams Coll, Williamstown, MA 01267 USA
[3] Purdue Univ, W Lafayette, IN 47906 USA
来源
关键词
D O I
暂无
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Debugging distributed systems is challenging. Although incremental debugging during development finds some bugs, developers are rarely able to fully test their systems under realistic operating conditions prior to deployment. While deploying a system exposes it to realistic conditions, debugging requires the developer to: (i) detect a bug, (ii) gather the system state necessary for diagnosis, and (iii) sift through the gathered state to determine a root cause. In this paper, we present MaceODB, a tool to assist programmers with debugging deployed distributed systems. Programmers define a set of runtime properties for their system, which MaceODB checks for violations during execution. Once MaceODB detects a violation, it provides the programmer with the information to determine its root cause. We have been able to diagnose several non-trivial bugs in existing mature distributed systems using MaceODB; we discuss two of these bugs in this paper. Benchmarks indicate that the approach has low overhead and is suitable for in situ debugging of deployed systems.
引用
收藏
页码:94 / +
页数:2
相关论文
共 50 条
  • [1] Debugging on live systems
    Vicious K.
    [J]. Queue, 2011, 9 (09):
  • [2] Debugging Distributed Systems
    Beschastnikh, Ivan
    Wang, Patty
    Brun, Yuriy
    Ernst, Michael D.
    [J]. COMMUNICATIONS OF THE ACM, 2016, 59 (08) : 32 - 37
  • [3] Kode Vicious Debugging on Live Systems
    Neville-Neil, George V.
    [J]. COMMUNICATIONS OF THE ACM, 2011, 54 (12) : 32 - 33
  • [4] Debugging and monitoring distributed heterogeneous systems
    Galatenko, V.A.
    Kostyukhin, K.A.
    [J]. Programmirovanie, 2002, 28 (01): : 27 - 38
  • [5] Debugging and Monitoring Distributed Heterogeneous Systems
    V. A. Galatenko
    K. A. Kostyukhin
    [J]. Programming and Computer Software, 2002, 28 : 20 - 27
  • [6] Visualization and debugging of distributed multiagent systems
    Ndumu, DT
    Nwana, HS
    Lee, LC
    Haynes, HR
    [J]. APPLIED ARTIFICIAL INTELLIGENCE, 1999, 13 (1-2) : 187 - 208
  • [7] MULTIBUG - INTERACTIVE DEBUGGING IN DISTRIBUTED SYSTEMS
    CORSINI, P
    PRETE, CA
    [J]. IEEE MICRO, 1986, 6 (03) : 26 - 33
  • [8] Unified Debugging of Distributed Systems with Recon
    Lee, Kyu Hyung
    Sumner, Nick
    Zhang, Xiangyu
    Eugster, Patrick
    [J]. 2011 IEEE/IFIP 41ST INTERNATIONAL CONFERENCE ON DEPENDABLE SYSTEMS AND NETWORKS (DSN), 2011, : 85 - 96
  • [9] Debugging and monitoring distributed heterogeneous systems
    Galatenko, VA
    Kostyukhin, KA
    [J]. PROGRAMMING AND COMPUTER SOFTWARE, 2002, 28 (01) : 20 - 27
  • [10] A suite of tools for debugging distributed autonomous systems
    Kortenkamp, D
    Simmons, R
    Milam, T
    Fernández, JL
    [J]. FORMAL METHODS IN SYSTEM DESIGN, 2004, 24 (02) : 157 - 188