A Scalable Prescriptive Parallel Debugging Model

被引:1
|
作者
Jensen, Nicklas Bo [1 ]
Nielsen, Niklas Quarfot [1 ,2 ,3 ]
Lee, Gregory L. [3 ]
Karlsson, Sven [1 ]
Legendre, Matthew [3 ]
Schulz, Martin [3 ]
Ahn, Dong H. [3 ]
机构
[1] Tech Univ Denmark, DTU Comp, Lyngby, Denmark
[2] Mesosphere Inc, San Francisco, CA USA
[3] Lawrence Livermore Natl Lab, Computat Directorate, Livermore, CA USA
关键词
D O I
10.1109/IPDPS.2015.15
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Debugging is a critical step in the development of any parallel program. However, the traditional interactive debugging model, where users manually step through code and inspect their application, does not scale well even for current supercomputers due its centralized nature. While lightweight debugging models, which have been proposed as an alternative, scale well, they can currently only debug a subset of bug classes. We therefore propose a new model, which we call prescriptive debugging, to fill this gap between these two approaches. This user-guided model allows programmers to express and test their debugging intuition in a way that helps to reduce the error space. Based on this debugging model we introduce a prototype implementation embodying this model, the DySectAPI, allowing programmers to construct probe trees for automatic, event-driven debugging at scale. In this paper we introduce the concepts behind DySectAPI and, using both experimental results and analytical modeling, we show that the DySectAPI implementation can run with a low overhead on current systems. We achieve a logarithmic scaling of the prototype and show predictions that even for a large system the overhead of the prescriptive debugging model will be small.
引用
收藏
页码:473 / 483
页数:11
相关论文
共 50 条
  • [21] Parallel program debugging by specification
    Huband, S
    McDonald, C
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2004, 16 (06): : 551 - 585
  • [22] MODELING CONCURRENCY IN PARALLEL DEBUGGING
    HSEUSH, W
    KAISER, GE
    SIGPLAN NOTICES, 1990, 25 (03): : 11 - 20
  • [23] Parallel debugging: An investigative study
    Zakari, Abubakar
    Lee, Sai Peck
    JOURNAL OF SOFTWARE-EVOLUTION AND PROCESS, 2019, 31 (11)
  • [24] DEBUGGING TECHNIQUES FOR PARALLEL PROGRAMS
    LEU, E
    SCHIPER, A
    TSI-TECHNIQUE ET SCIENCE INFORMATIQUES, 1991, 10 (01): : 5 - 21
  • [25] PARTIAL ORDERS FOR PARALLEL DEBUGGING
    FIDGE, CJ
    SIGPLAN NOTICES, 1989, 24 (01): : 183 - 194
  • [26] ONLINE DISTRIBUTED DEBUGGING ON SCALABLE MULTIPROCESSOR ARCHITECTURES
    BEMMERL, T
    WISMULLER, R
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF GRID COMPUTING AND ESCIENCE, 1995, 11 (4-5): : 375 - 385
  • [27] Scalable Concurrency Debugging with Distributed Graph Processing
    Zheng, Long
    Liao, Xiaofei
    Jin, Hai
    Zhao, Jieshan
    Wang, Qinggang
    PROCEEDINGS OF THE 2018 INTERNATIONAL SYMPOSIUM ON CODE GENERATION AND OPTIMIZATION (CGO'18), 2018, : 188 - 199
  • [28] Scalable distributed implementation of a biologically inspired parallel model
    Gabriel Ciobanu
    Complex & Intelligent Systems, 2015, 1 (1-4) : 69 - 80
  • [29] A scalable configurable architecture for the massively parallel GCA model
    Jendrsczok, J.
    Ediger, P.
    Hoffmann, R.
    2008 IEEE INTERNATIONAL SYMPOSIUM ON PARALLEL & DISTRIBUTED PROCESSING, VOLS 1-8, 2008, : 79 - 86
  • [30] DESIGN AND PERFORMANCE OF A SCALABLE PARALLEL COMMUNITY CLIMATE MODEL
    DRAKE, J
    FOSTER, I
    MICHALAKES, J
    TOONEN, B
    WORLEY, P
    PARALLEL COMPUTING, 1995, 21 (10) : 1571 - 1591