A Scalable Prescriptive Parallel Debugging Model

被引:1
|
作者
Jensen, Nicklas Bo [1 ]
Nielsen, Niklas Quarfot [1 ,2 ,3 ]
Lee, Gregory L. [3 ]
Karlsson, Sven [1 ]
Legendre, Matthew [3 ]
Schulz, Martin [3 ]
Ahn, Dong H. [3 ]
机构
[1] Tech Univ Denmark, DTU Comp, Lyngby, Denmark
[2] Mesosphere Inc, San Francisco, CA USA
[3] Lawrence Livermore Natl Lab, Computat Directorate, Livermore, CA USA
关键词
D O I
10.1109/IPDPS.2015.15
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Debugging is a critical step in the development of any parallel program. However, the traditional interactive debugging model, where users manually step through code and inspect their application, does not scale well even for current supercomputers due its centralized nature. While lightweight debugging models, which have been proposed as an alternative, scale well, they can currently only debug a subset of bug classes. We therefore propose a new model, which we call prescriptive debugging, to fill this gap between these two approaches. This user-guided model allows programmers to express and test their debugging intuition in a way that helps to reduce the error space. Based on this debugging model we introduce a prototype implementation embodying this model, the DySectAPI, allowing programmers to construct probe trees for automatic, event-driven debugging at scale. In this paper we introduce the concepts behind DySectAPI and, using both experimental results and analytical modeling, we show that the DySectAPI implementation can run with a low overhead on current systems. We achieve a logarithmic scaling of the prototype and show predictions that even for a large system the overhead of the prescriptive debugging model will be small.
引用
收藏
页码:473 / 483
页数:11
相关论文
共 50 条
  • [1] Scalable Parallel Debugging with Statistical Assertions
    Minh Ngoc Dinh
    Abramson, David
    Jin, Chao
    Gontarek, Andrew
    Moench, Bob
    DeRose, Luiz
    ACM SIGPLAN NOTICES, 2012, 47 (08) : 311 - 312
  • [2] Scalable Parallel Debugging with g-Eclipse
    Koeckerbauer, Thomas
    Klausecker, Christof
    Kranzlmueller, Dieter
    TOOLS FOR HIGH PERFORMANCE COMPUTING 2009, 2010, : 115 - 123
  • [3] Extending the Eclipse Parallel Tools Platform debugger with Scalable Parallel Debugging Library
    Jin, Chao
    Ding, Liang
    Abramson, David
    2013 INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE, 2013, 18 : 1774 - 1783
  • [4] Efficient and scalable omniscient debugging for model transformations
    Corley, Jonathan
    Eddy, Brian P.
    Syriani, Eugene
    Gray, Jeff
    SOFTWARE QUALITY JOURNAL, 2017, 25 (01) : 7 - 48
  • [5] Efficient and scalable omniscient debugging for model transformations
    Jonathan Corley
    Brian P. Eddy
    Eugene Syriani
    Jeff Gray
    Software Quality Journal, 2017, 25 : 7 - 48
  • [6] Scalable omniscient debugging
    Pothier, Guillaume
    Tanter, Eric
    Piquer, Jose
    ACM SIGPLAN NOTICES, 2007, 42 (10) : 535 - 551
  • [7] Scalable Relative Debugging
    Minh Ngoc Dinh
    Abramson, David
    Jin, Chao
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2014, 25 (03) : 740 - 749
  • [8] Scalable Omniscient Debugging
    Pothier, Guillaume
    Tanter, Eric
    Piquer, Jose
    OOPSLA: 22ND INTERNATIONAL CONFERENCE ON OBJECT-ORIENTED PROGRAMMING, SYSTEMS, LANGUAGES, AND APPLICATIONS, PROCEEDINGS, 2007, : 535 - 551
  • [9] A MODEL FOR MONITORING AND DEBUGGING PARALLEL AND DISTRIBUTED SOFTWARE
    MARINESCU, DC
    LUMPP, JE
    CASAVANT, TL
    SIEGEL, HJ
    PROCEEDINGS : THE THIRTEENTH ANNUAL INTERNATIONAL COMPUTER SOFTWARE & APPLICATIONS CONFERENCE, 1989, : 81 - 88
  • [10] An interface to support the identification of dynamic MPI 2 processes for scalable parallel debugging
    Gottbrath, Christopher
    Barrett, Brian
    Gropp, Bill
    Lusk, Ewing 'Rusty'
    Squyres, Jeff
    RECENT ADVANCES IN PARALLEL VIRTUAL MACHINE AND MESSAGE PASSING INTERFACE, 2006, 4192 : 115 - +