μprofiler:: Profiling user-level threads in a shared-memory programming environment

被引:0
|
作者
Buhr, PA [1 ]
Denda, R
机构
[1] Univ Waterloo, Dept Comp Sci, Waterloo, ON N2L 3G1, Canada
[2] Univ Mannheim, Fak Math & Informat, D-6800 Mannheim, Germany
关键词
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
A profiler is an important tool for understanding the dynamic behaviour of concurrent programs to locate problems and optimize performance The best way to improve profiling capabilities and reduce the time to analyze a concurrent program is to use a target-specific profiler that understands the underlying concurrent runtime environment. A profiler for understanding execution of user and kernel level threads is presented, which is target-specific for the mu C++ concurrency system. This allows the insertion of hooks into the mu C++ data structures and runtime kernel to ensure crucial operations are monitored exactly. Because the profiler is written in mu C++ and has an extendible design, it is easy for users to write new metrics and incorporate them into the profiler.
引用
收藏
页码:159 / 166
页数:8
相关论文
共 50 条
  • [21] Fast communication mechanisms - Coupling hardware distributed shared memory and user-level messaging
    Hellwagner, H
    Karl, W
    Leberecht, M
    INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED PROCESSING TECHNIQUES AND APPLICATIONS, VOLS I-III, PROCEEDINGS, 1997, : 1294 - 1301
  • [22] The design of user-level distributed shared memory system in SCI-based clusters
    Ding, KS
    Tsay, JJ
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED PROCESSING TECHNIQUES AND APPLICATIONS, VOLS I-V, 2000, : 587 - 593
  • [23] Parallelized Software Offloading of Low-Level Communication with User-Level Threads
    Endo, Wataru
    Taura, Kenjiro
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING IN ASIA-PACIFIC REGION (HPC ASIA 2018), 2018, : 289 - 298
  • [24] RTC-Threads: A user-level real-time threads package for multimedia systems
    Oikawa, S
    Tokuda, H
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 1996, E79D (10) : 1443 - 1452
  • [25] Correlation prefetching with a user-level memory thread
    Solihin, Y
    Lee, J
    Torrellas, J
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2003, 14 (06) : 563 - 580
  • [26] Analyzing the Performance Trade-Off in Implementing User-Level Threads
    Iwasaki, Shintaro
    Amer, Abdelhalim
    Taura, Kenjiro
    Balaji, Pavan
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2020, 31 (08) : 1859 - 1877
  • [27] MPI plus ULT: Overlapping Communication and Computation with User-Level Threads
    Lu, Huiwei
    Seo, Sangmin
    Balaji, Pavan
    2015 IEEE 17TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS, 2015 IEEE 7TH INTERNATIONAL SYMPOSIUM ON CYBERSPACE SAFETY AND SECURITY, AND 2015 IEEE 12TH INTERNATIONAL CONFERENCE ON EMBEDDED SOFTWARE AND SYSTEMS (ICESS), 2015, : 444 - 454
  • [28] OpenMP: An industry standard API for shared-memory programming
    Dagum, L
    Menon, R
    IEEE COMPUTATIONAL SCIENCE & ENGINEERING, 1998, 5 (01): : 46 - 55
  • [29] Performance Analysis of a User-level Memory Server
    Pakin, Scott
    Johnson, Greg
    2007 IEEE INTERNATIONAL CONFERENCE ON CLUSTER COMPUTING, 2007, : 249 - 258
  • [30] Memory management for user-level network interfaces
    Welsh, M
    Basu, A
    Huang, XW
    von Eicken, T
    IEEE MICRO, 1998, 18 (02) : 77 - 82