Automatic Performance Analysis of OpenMP Codes on a Scalable Shared Memory System Using Periscope

被引:0
|
作者
Benedict, Shajulin [1 ]
Gerndt, Michael [1 ]
机构
[1] Tech Univ Munich, Fak Informat I10, D-85748 Garching, Germany
关键词
Memory accesses analysis; OpenMP; Performance analysis; Supercomputers;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
OpenMP is a successful interface for programming parallel applications on shared memory systems. It is widely applied on small scale shared memory systems such as multicore processors, but also in hybrid programming on large supercomputers. This paper presents performance properties for OpenMP and their automatic detection by Periscope. We evaluate Periscope's OpenMP analysis strategy in the context of the Altix 4700 supercomputer at Leibniz Computing Center (LRZ) in Garching. On this unique machine OpenMP scales up to 500 cores, one partition of in total 19 partitions. We present results for the NAS parallel benchmarks and for a large hybrid scientific application.
引用
收藏
页码:452 / 462
页数:11
相关论文
共 50 条
  • [1] SCALABILITY AND PERFORMANCE ANALYSIS OF OPENMP CODES USING THE PERISCOPE TOOLKIT
    Benedict, Shajulin
    Gerndt, Michael
    [J]. COMPUTING AND INFORMATICS, 2014, 33 (04) : 921 - 942
  • [2] UPMLIB:: A runtime system for Turning the memory performance of OpenMP programs on scalable shared-memory multiprocessors
    Nikolopoulos, DS
    Papatheodorou, TS
    Polychronopoulos, CD
    Labarta, J
    Ayguadé, E
    [J]. LANGUAGES, COMPILERS, AND RUN-TIME SYSTEMS FOR SCALABLE COMPUTERS, 2000, 1915 : 85 - 99
  • [3] Automatic performance analysis with Periscope
    Gerndt, M.
    Ott, M.
    [J]. CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2010, 22 (06): : 736 - 748
  • [4] Automatic memory access analysis with Periscope
    Gerndt, Michael
    Kereku, Edmond
    [J]. COMPUTATIONAL SCIENCE - ICCS 2007, PT 2, PROCEEDINGS, 2007, 4488 : 847 - +
  • [5] Parallel PIC codes for distributed and shared memory architectures with HPF and OpenMP
    Di Martino, B
    Briguglio, S
    Fogaccia, G
    Vlad, G
    [J]. PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED PROCESSING TECHNIQUES AND APPLICATIONS, VOLS I-V, 2000, : 2233 - 2239
  • [6] Performance comparison of MPI and OpenMP on shared memory multiprocessors
    Krawezik, G
    Cappello, F
    [J]. CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2006, 18 (01): : 29 - 61
  • [7] Performance of cluster-enabled OpenMP for the SCASH software distributed shared memory system
    Ojima, Y
    Sato, M
    Harada, H
    Ishikawa, Y
    [J]. CCGRID 2003: 3RD IEEE/ACM INTERNATIONAL SYMPOSIUM ON CLUSTER COMPUTING AND THE GRID, PROCEEDINGS, 2003, : 450 - 456
  • [8] Performance Evaluation of Long Integer Multiplication Using OpenMP and MPI on Shared Memory Architecture
    Tembhurne, Jitendra V.
    Sathe, Shailesh R.
    [J]. 2014 SEVENTH INTERNATIONAL CONFERENCE ON CONTEMPORARY COMPUTING (IC3), 2014, : 283 - 288
  • [9] Performance of scalable shared-memory architectures
    Motlagh, BS
    DeMara, RF
    [J]. JOURNAL OF CIRCUITS SYSTEMS AND COMPUTERS, 2000, 10 (1-2) : 1 - 22
  • [10] Parallel molecular dynamics using OPENMP on a shared memory machine
    Couturier, R
    Chipot, C
    [J]. COMPUTER PHYSICS COMMUNICATIONS, 2000, 124 (01) : 49 - 59