Memory Debugging of MPI-Parallel Applications in Open MPI

被引:0
|
作者
Keller, Rainer [1 ]
Fan, Shiqing [1 ]
Resch, Michael [1 ]
机构
[1] Univ Stuttgart, Ctr High Performance Comp, D-7000 Stuttgart, Germany
关键词
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
In this paper we describe the implementation of memory checking functionality based on instrumentation using valgrind. The combination of valgrind based checking functions within the MPI-implementation offers superior debugging functionality, for errors that otherwise are not possible to detect with comparable MPI-debugging tools. The functionality is integrated into Open MPI as the so-called memchecker-framework. This allows other memory debuggers that offer a similar API to be integrated. The tight control of the user's memory passed to Open MPI, allows not only to find application errors. but also helps track bugs within Open MPI itself. We describe the actual checks, classes of errors being found, how memory buffers internally are being handled, show errors actually found in user's code and the performance implications of this instrumentation.
引用
下载
收藏
页码:517 / 523
页数:7
相关论文
共 50 条
  • [21] MPI/CTP: A reconfigurable MPI for HPC applications
    Venkata, Manjunath Gorentla
    Bridges, Patrick G.
    RECENT ADVANCES IN PARALLEL VIRTUAL MACHINE AND MESSAGE PASSING INTERFACE, 2006, 4192 : 96 - 104
  • [22] MPI Correctness Checking for OpenMP/MPI Applications
    Tobias Hilbrich
    Matthias S. Müller
    Bettina Krammer
    International Journal of Parallel Programming, 2009, 37 : 277 - 291
  • [23] MPI Correctness Checking for OpenMP/MPI Applications
    Hilbrich, Tobias
    Mueller, Matthias S.
    Krammer, Bettina
    INTERNATIONAL JOURNAL OF PARALLEL PROGRAMMING, 2009, 37 (03) : 277 - 291
  • [24] Taking the MPI standard and the open MPI library to exascale
    Bernholdt, David E.
    Bosilca, George
    Bouteiller, Aurelien
    Brightwell, Ron
    Ciesko, Jan
    Dosanjh, Matthew G. F.
    Georgakoudis, Giorgis
    Laguna, Ignacio
    Levy, Scott
    Naughton, Thomas
    Olivier, Stephen L.
    Pritchard, Howard P.
    Schonbein, Whit
    Schuchart, Joseph
    Shehata, Amir
    INTERNATIONAL JOURNAL OF HIGH PERFORMANCE COMPUTING APPLICATIONS, 2024, 38 (05): : 491 - 507
  • [25] Open MPI: A high-performance, heterogeneous MPI
    Graham, Richard L.
    Shipman, Galen M.
    Barrett, Brian W.
    Castain, Ralph H.
    Bosilca, George
    Lumsdaine, Andrew
    2006 IEEE INTERNATIONAL CONFERENCE ON CLUSTER COMPUTING, VOLS 1 AND 2, 2006, : 621 - +
  • [26] Legio: fault resiliency for embarrassingly parallel MPI applications
    Rocco, Roberto
    Gadioli, Davide
    Palermo, Gianluca
    JOURNAL OF SUPERCOMPUTING, 2022, 78 (02): : 2175 - 2195
  • [27] Legio: fault resiliency for embarrassingly parallel MPI applications
    Roberto Rocco
    Davide Gadioli
    Gianluca Palermo
    The Journal of Supercomputing, 2022, 78 : 2175 - 2195
  • [28] Analyzing the Parallel I/O Severity of MPI Applications
    Mendez, Sandra
    Rexachs, Dolores
    Luque, Emilio
    2017 17TH IEEE/ACM INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND GRID COMPUTING (CCGRID), 2017, : 953 - 962
  • [29] Efficient Execution of Parallel Applications in Grid with MPI library
    Ionescu, Felicia
    Chiru, Mihai
    Sandulescu, Vlad
    Ionescu, Mihail
    MATHEMATICAL METHODS AND APPLIED COMPUTING, VOL 1, 2009, : 384 - +
  • [30] Completely debugging indeterminate MPI/PVM programs
    Wang, F.
    An, H.
    Chen, Z.H.
    Chen, G.L.
    Ruan Jian Xue Bao/Journal of Software, 2001, 12 (03): : 334 - 339