Visualization of Memory Access Behavior on Hierarchical NUMA Architectures

被引:6
|
作者
Weyers, Benjamin [1 ]
Terboven, Christian [1 ]
Schmidl, Dirk [1 ]
Herber, Joachim [1 ]
Kuhlen, Torsten W. [1 ]
Uller, Matthias S. M. [1 ]
Hentschel, Bernd [1 ]
机构
[1] Rhein Westfal TH Aachen, IT Ctr, JARA High Performance Comp, D-52074 Aachen, Germany
关键词
PERFORMANCE;
D O I
10.1109/VPA.2014.12
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The available memory bandwidth of existing high performance computing platforms turns out as being more and more the limitation to various applications. Therefore, modern microarchitectures integrate the memory controller on the processor chip, which leads to a non-uniform memory access behavior of such systems. This access behavior in turn entails major challenges in the development of shared memory parallel applications. An improperly implemented memory access functionality results in a bad ratio between local and remote memory access, and causes low performance on such architectures. To address this problem, the developers of such applications rely on tools to make these kinds of performance problems visible. This work presents a new tool for the visualization of performance data of the non-uniform memory access behavior. Because of the visual design of the tool, the developer is able to judge the severity of remote memory access in a time-dependent simulation, which is currently not possible using existing tools.
引用
收藏
页码:42 / 49
页数:8
相关论文
共 50 条
  • [31] An Adaptive Concurrent Priority Queue for NUMA Architectures
    Strati, Foteini
    Giannoula, Christina
    Siakavaras, Dimitrios
    Goumas, Georgios
    Koziris, Nectarios
    [J]. CF '19 - PROCEEDINGS OF THE 16TH ACM INTERNATIONAL CONFERENCE ON COMPUTING FRONTIERS, 2019, : 135 - 144
  • [32] ForestGOMP: An Efficient OpenMP Environment for NUMA Architectures
    François Broquedis
    Nathalie Furmento
    Brice Goglin
    Pierre-André Wacrenier
    Raymond Namyst
    [J]. International Journal of Parallel Programming, 2010, 38 : 418 - 439
  • [33] Balancing Shared and Distributed Heaps on NUMA Architectures
    Aljabri, Malak
    Loidl, Hans-Wolfgang
    Trinder, Phil
    [J]. TRENDS IN FUNCTIONAL PROGRAMMING, TFP 2014, 2015, 8843 : 1 - 17
  • [34] Gecko: Hierarchical Distributed View of Heterogeneous Shared Memory Architectures
    Ghane, Millad
    Chandrasekaran, Sunita
    Cheung, Margaret S.
    [J]. PROCEEDINGS OF THE TENTH INTERNATIONAL WORKSHOP ON PROGRAMMING MODELS AND APPLICATIONS FOR MULTICORES AND MANYCORES (PMAM 2019), 2019, : 21 - 30
  • [35] PERFORMANCE COMPARISONS IN HIERARCHICAL ARCHITECTURES FOR MEMORY NETWORK PATTERN CLASSIFIERS
    FAIRHURST, MC
    MAIA, MAGM
    [J]. PATTERN RECOGNITION LETTERS, 1986, 4 (02) : 121 - 124
  • [36] Measuring Memory Access Latency for Software Objects in a NUMA System-on-Chip Architecture
    Genius, Daniela
    [J]. 2013 8TH INTERNATIONAL WORKSHOP ON RECONFIGURABLE AND COMMUNICATION-CENTRIC SYSTEMS-ON-CHIP (RECOSOC), 2013,
  • [37] Exploring thread and memory placement on NUMA architectures: Solaris and Linux, UltraSPARC/FirePlane and Opteron/Hyper Transport
    Antony, Joseph
    Janes, Pete P.
    Rendell, Alistair P.
    [J]. HIGH PERFORMANCE COMPUTING - HIPC 2006, PROCEEDINGS, 2006, 4297 : 338 - +
  • [38] NumaMMA: NUMA MeMory Analyzer
    Trahay, Francois
    Selva, Manuel
    Morel, Lionel
    Marquet, Kevin
    [J]. PROCEEDINGS OF THE 47TH INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING, 2018,
  • [39] NUMA Distance for Heterogeneous Memory
    Williams, Sean
    Ionkov, Latchesar
    Lang, Michael
    [J]. PROCEEDINGS OF MCHPC'17: WORKSHOP ON MEMORY CENTRIC PROGRAMMING FOR HPC, 2017, : 30 - 34
  • [40] NUMAPROF, A NUMA Memory Profiler
    Valat, Sebastien
    Bouizi, Othman
    [J]. EURO-PAR 2018: PARALLEL PROCESSING WORKSHOPS, 2019, 11339 : 159 - 170