MTOOL - AN INTEGRATED SYSTEM FOR PERFORMANCE DEBUGGING SHARED MEMORY MULTIPROCESSOR APPLICATIONS

被引:35
|
作者
GOLDBERG, AJ [1 ]
HENNESSY, JL [1 ]
机构
[1] STANFORD UNIV,COMP SYST LAB,STANFORD,CA 94305
关键词
MEMORY SYSTEM BEHAVIOR; PARALLEL PROGRAMMING; PERFORMANCE DEBUGGING; SHARED MEMORY MULTIPROCESSOR; SOFTWARE INSTRUMENTATION;
D O I
10.1109/71.205651
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
This paper describes Mtool, a software tool for analyzing performance losses in shared memory parallel programs. Mtool augments a program with low overhead instrumentation which perturbs the program's execution as little as possible while generating enough information to isolate memory and synchronization bottlenecks. After running the instrumented version of the parallel program, the programmer can use Mtool's window-based user interface to view compute time, memory, and synchronization bottlenecks at increasing levels of detail from a whole program level down to the level of individual procedures, loops, and synchronization objects. The paper describes Mtool's low overhead instrumentation methods, memory bottleneck detection technique, and attention focusing mechanisms, contrasts Mtool with other approaches, and offers a case study to demonstrate the effectiveness of Mtool.
引用
收藏
页码:28 / 40
页数:13
相关论文
共 50 条
  • [41] A SHARED-MEMORY MULTIPROCESSOR LOGIC SIMULATOR
    BEIHL, G
    EIGHTH ANNUAL INTERNATIONAL PHOENIX CONFERENCE ON COMPUTERS AND COMMUNICATIONS: 1989 CONFERENCE PROCEEDINGS, 1989, : 26 - 28
  • [42] Multitasking and multithreading on a multiprocessor with virtual shared memory
    Muller, HL
    Stallard, PWA
    Warren, DHD
    SECOND INTERNATIONAL SYMPOSIUM ON HIGH-PERFORMANCE COMPUTER ARCHITECTURE, PROCEEDINGS, 1996, : 212 - 221
  • [43] PVM in a shared-memory industrial multiprocessor
    Appiani, E
    Bologna, M
    Corvi, M
    HIGH-PERFORMANCE COMPUTING AND NETWORKING, 1995, 919 : 588 - 593
  • [44] PARALLEL GIVENS FACTORIZATION ON A SHARED MEMORY MULTIPROCESSOR
    DAOUDI, E
    LIBERT, G
    LECTURE NOTES IN COMPUTER SCIENCE, 1990, 457 : 131 - 142
  • [45] FINITE-ELEMENT APPLICATIONS ON A SHARED-MEMORY MULTIPROCESSOR - ALGORITHMS AND EXPERIMENTAL RESULTS
    NATARAJAN, R
    JOURNAL OF COMPUTATIONAL PHYSICS, 1991, 94 (02) : 352 - 381
  • [47] MACROMOLECULAR DYNAMICS ON A SHARED-MEMORY MULTIPROCESSOR
    SKEEL, RD
    JOURNAL OF COMPUTATIONAL CHEMISTRY, 1991, 12 (02) : 175 - 179
  • [48] CSP occam on shared memory multiprocessor workstations
    Vella, K
    Welch, PH
    ARCHITECTURES, LANGUAGES AND TECHNIQUES FOR CONCURRENT SYSTEMS, 1999, 57 : 87 - 119
  • [49] Low power tasks mapping for DVS capable multiprocessor system with shared memory
    Li, M
    Wu, XB
    Zhu, XH
    Wang, H
    PROCEEDINGS OF THE IASTED INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED COMPUTING AND NETWORKS, 2004, : 126 - 133
  • [50] A large-area integrated multiprocessor system for video applications
    Rudack, M
    Redeker, M
    Hilgenstock, J
    Moch, S
    Castagne, J
    IEEE DESIGN & TEST OF COMPUTERS, 2002, 19 (01): : 6 - 17