Parallel Performance Wizard: A Performance System for the Analysis of Partitioned Global-Address-Space Applications

被引:3
|
作者
Su, Hung-Hsun [1 ]
Billingsley, Max, III [1 ]
George, Alan D. [1 ]
机构
[1] Univ Florida, Dept Elect & Comp Engn, Res Lab, Gainesville, FL 32611 USA
关键词
Performance analysis tool; PGAS; UPC; SHMEM; GASP; generic-operation-type abstraction; TOOL;
D O I
10.1177/1094342010370953
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Given the complexity of high-performance parallel programs, developers often must rely on performance analysis tools to help them improve the performance of their applications. While many tools support analysis of message-passing programs, tool support is limited for applications written in programming models that present a partitioned global address space (PGAS) to the programmer such as UPC and SHMEM. Existing tools that support message-passing models are difficult to extend to support PGAS models due to differences between the two paradigms and the techniques used in their implementations. In this paper, we present our work on Parallel Performance Wizard (PPW), a performance analysis system for PGAS and MPI application analysis. We discuss new concepts, namely the generic-operation-type abstraction and GASP-enabled data collection, developed to facilitate support for multiple programming models and then give an overview of PPW's automatic analysis and visualization capabilities. Finally, to show the usefulness of our system, we present results on PPW's overhead, storage requirements and scalability before demonstrating its effectiveness via application case studies.
引用
收藏
页码:485 / 510
页数:26
相关论文
共 50 条
  • [1] Parallel performance wizard: A performance analysis tool for partitioned global-address-space programming
    Su, Hung-Hsun
    Billingsley, Max, III
    George, Alan D.
    [J]. 2008 IEEE INTERNATIONAL SYMPOSIUM ON PARALLEL & DISTRIBUTED PROCESSING, VOLS 1-8, 2008, : 3033 - 3040
  • [2] Productivity and performance using partitioned global address space languages
    University of California, Berkeley
    不详
    [J]. PASCO: Proc Int workshop Parallel Symb Comput, 2007, (24-32):
  • [3] Partitioned Global Address Space Languages
    De Wael, Mattias
    Marr, Stefan
    De Fraine, Bruno
    Van Cutsem, Tom
    De Meuter, Wolfgang
    [J]. ACM COMPUTING SURVEYS, 2015, 47 (04)
  • [4] Improving Data Locality for Irregular Partitioned Global Address Space Parallel Programs
    Larkins, D. Brian
    [J]. PROCEEDINGS OF THE 50TH ANNUAL ASSOCIATION FOR COMPUTING MACHINERY SOUTHEAST CONFERENCE, 2012,
  • [5] Performance characterization of global address space applications: a case study with NWChem
    Hammond, Jeff R.
    Krishnamoorthy, Sriram
    Shende, Sameer
    Romero, Nichols A.
    Malony, Allen D.
    [J]. CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2012, 24 (02): : 135 - 154
  • [6] Global address space, non-uniform bandwidth: A memory system performance characterization of parallel systems
    Stricker, T
    Gross, T
    [J]. THIRD INTERNATIONAL SYMPOSIUM ON HIGH-PERFORMANCE COMPUTER ARCHITECTURE - PROCEEDINGS, 1997, : 168 - 179
  • [7] A Runtime System for Finite Element Methods in a Partitioned Global Address Space
    Groth, Stefan
    Grunewald, Daniel
    Teich, Jurgen
    Hannig, Frank
    [J]. 17TH ACM INTERNATIONAL CONFERENCE ON COMPUTING FRONTIERS 2020 (CF 2020), 2020, : 39 - 48
  • [8] Efficient Runtime Support for a Partitioned Global Logical Address Space
    Larkins, D. Brian
    Snyder, John
    Dinan, James
    [J]. PROCEEDINGS OF THE 47TH INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING, 2018,
  • [9] Design and evaluation of efficient global data movement in partitioned global address space
    Murai, Hitoshi
    Sato, Mitsuhisa
    [J]. PARALLEL COMPUTING, 2020, 96
  • [10] Instrumentation database system for performance analysis of parallel scientific applications
    Nesheiwat, J
    Szymanski, BK
    [J]. PARALLEL COMPUTING, 2002, 28 (10) : 1409 - 1449