Visualization of Memory Access Behavior on Hierarchical NUMA Architectures

被引:6
|
作者
Weyers, Benjamin [1 ]
Terboven, Christian [1 ]
Schmidl, Dirk [1 ]
Herber, Joachim [1 ]
Kuhlen, Torsten W. [1 ]
Uller, Matthias S. M. [1 ]
Hentschel, Bernd [1 ]
机构
[1] Rhein Westfal TH Aachen, IT Ctr, JARA High Performance Comp, D-52074 Aachen, Germany
关键词
PERFORMANCE;
D O I
10.1109/VPA.2014.12
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The available memory bandwidth of existing high performance computing platforms turns out as being more and more the limitation to various applications. Therefore, modern microarchitectures integrate the memory controller on the processor chip, which leads to a non-uniform memory access behavior of such systems. This access behavior in turn entails major challenges in the development of shared memory parallel applications. An improperly implemented memory access functionality results in a bad ratio between local and remote memory access, and causes low performance on such architectures. To address this problem, the developers of such applications rely on tools to make these kinds of performance problems visible. This work presents a new tool for the visualization of performance data of the non-uniform memory access behavior. Because of the visual design of the tool, the developer is able to judge the severity of remote memory access in a time-dependent simulation, which is currently not possible using existing tools.
引用
收藏
页码:42 / 49
页数:8
相关论文
共 50 条
  • [21] Unfair Scheduling Patterns in NUMA Architectures
    Ben-David, Naama
    Scully, Ziv
    Blelloch, Guy E.
    [J]. 2019 28TH INTERNATIONAL CONFERENCE ON PARALLEL ARCHITECTURES AND COMPILATION TECHNIQUES (PACT 2019), 2019, : 205 - 218
  • [22] Performance bounds for modeling NUMA architectures
    Geist, R
    [J]. INFORMATION PROCESSING LETTERS, 1997, 63 (02) : 113 - 117
  • [23] Performance oriented programming for NUMA architectures
    Chapman, B
    Patil, A
    Prabhakar, A
    [J]. OPENMP SHARED MEMORY PARALLEL PROGRAMMING, PROCEEDINGS, 2001, 2104 : 137 - 154
  • [24] NUMA (Non-Uniform Memory Access): An overview: NUMA becomes more common because memory controllers get close to execution units on microprocessors
    [J]. Lameter, C, 1600, Association for Computing Machinery, 2 Penn Plaza, Suite 701, New York, NY 10121-0701, United States (11):
  • [25] Binding Nested OpenMP Programs on Hierarchical Memory Architectures
    Schmidl, Dirk
    Terboven, Christian
    Mey, Dieter An
    Buecker, Martin
    [J]. BEYOND LOOP LEVEL PARALLELISM IN OPENMP: ACCELERATORS, TASKING AND MORE, PROCEEDINGS, 2010, 6132 : 29 - +
  • [26] Analyzing and Improving Memory Access Patterns of Large Irregular Applications on NUMA Machines
    Mariano, Artur
    Diener, Matthias
    Bischof, Christian
    Navaux, Philippe O. A.
    [J]. 2016 24TH EUROMICRO INTERNATIONAL CONFERENCE ON PARALLEL, DISTRIBUTED, AND NETWORK-BASED PROCESSING (PDP), 2016, : 382 - 387
  • [27] Supporting concurrent memory access in TCF processor architectures
    Forsell, Martti
    Roivainen, Jussi
    Leppanen, Ville
    Traeff, Jesper Larsson
    [J]. MICROPROCESSORS AND MICROSYSTEMS, 2018, 63 : 226 - 236
  • [28] ForestGOMP: An Efficient OpenMP Environment for NUMA Architectures
    Broquedis, Francois
    Furmento, Nathalie
    Goglin, Brice
    Wacrenier, Pierre-Andre
    Namyst, Raymond
    [J]. INTERNATIONAL JOURNAL OF PARALLEL PROGRAMMING, 2010, 38 (5-6) : 418 - 439
  • [29] Task-Parallel Programming on NUMA Architectures
    Terboven, Christian
    Schmidl, Dirk
    Cramer, Tim
    Mey, Dieter An
    [J]. EURO-PAR 2012 PARALLEL PROCESSING, 2012, 7484 : 638 - 649
  • [30] ForestGOMP: An Efficient OpenMP Environment for NUMA Architectures
    François Broquedis
    Nathalie Furmento
    Brice Goglin
    Pierre-André Wacrenier
    Raymond Namyst
    [J]. International Journal of Parallel Programming, 2010, 38 : 418 - 439