Memory Debugging of MPI-Parallel Applications in Open MPI

被引:0
|
作者
Keller, Rainer [1 ]
Fan, Shiqing [1 ]
Resch, Michael [1 ]
机构
[1] Univ Stuttgart, Ctr High Performance Comp, D-7000 Stuttgart, Germany
关键词
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
In this paper we describe the implementation of memory checking functionality based on instrumentation using valgrind. The combination of valgrind based checking functions within the MPI-implementation offers superior debugging functionality, for errors that otherwise are not possible to detect with comparable MPI-debugging tools. The functionality is integrated into Open MPI as the so-called memchecker-framework. This allows other memory debuggers that offer a similar API to be integrated. The tight control of the user's memory passed to Open MPI, allows not only to find application errors. but also helps track bugs within Open MPI itself. We describe the actual checks, classes of errors being found, how memory buffers internally are being handled, show errors actually found in user's code and the performance implications of this instrumentation.
引用
下载
收藏
页码:517 / 523
页数:7
相关论文
共 50 条
  • [31] MPIGDB: A Flexible Debugging Infrastructure for MPI Programs
    Underwood, Robert
    Nicolae, Bogdan
    PROCEEDINGS OF THE 13TH WORKSHOP ON AI AND SCIENTIFIC COMPUTING AT SCALE USING FLEXIBLE COMPUTING INFRASTRUCTURES, FLEXSCIENCE 2023, 2023, : 11 - 18
  • [32] Toward Heterogeneous MPI plus MPI Programming: Comparison of OpenMP and MPI Shared Memory Models
    Szustak, Lukasz
    Wyrzykowski, Roman
    Halbiniak, Kamil
    Bratek, Pawel
    EURO-PAR 2019: PARALLEL PROCESSING WORKSHOPS, 2020, 11997 : 270 - 281
  • [33] A Channel Memory based fault tolerance for MPI applications
    Selikhov, A
    Germain, C
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2005, 21 (05): : 709 - 715
  • [34] Memory debugging in parallel and distributed applications
    Gottbrath, Chris
    TOOLS FOR HIGH PERFORMANCE COMPUTING, 2008, : 79 - 90
  • [35] MPI-SIM: Using parallel simulation to evaluate MPI programs
    Prakash, S
    Bagrodia, RL
    1998 WINTER SIMULATION CONFERENCE PROCEEDINGS, VOLS 1 AND 2, 1998, : 467 - 474
  • [36] Parallel programming with interoperable MPI
    George, WL
    Hagedorn, JG
    Devaney, JE
    DR DOBBS JOURNAL, 2004, 29 (02): : 49 - 53
  • [37] VolpexMPI: An MPI Library for Execution of Parallel Applications on Volatile Nodes
    LeBlanc, Troy
    Anand, Rakhi
    Gabriel, Edgar
    Subhlok, Jaspal
    RECENT ADVANCES IN PARALLEL VIRTUAL MACHINE AND MESSAGE PASSING INTERFACE, PROCEEDINGS, 2009, 5759 : 124 - 133
  • [38] Logically Parallel Communication for Fast MPI plus Threads Applications
    Zambre, Rohit
    Sahasrabudhe, Damodar
    Zhou, Hui
    Berzins, Martin
    Chandramowlishwaran, Aparna
    Balaji, Pavan
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2021, 32 (12) : 3038 - 3052
  • [39] Detection of violations to the MPI standard in hybrid OpenMP/MPI applications
    Hilbrich, Tobias
    Mueller, Matthias S.
    Krammer, Bettina
    OPENMP IN A NEW ERA OF PARALLELISM, PROCEEDINGS, 2008, 5004 : 26 - +
  • [40] Verification of MPI Applications
    Meca, Ondrej
    Boehm, Stanislav
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE OF NUMERICAL ANALYSIS AND APPLIED MATHEMATICS 2014 (ICNAAM-2014), 2015, 1648