Visualization of Memory Access Behavior on Hierarchical NUMA Architectures

被引:6
|
作者
Weyers, Benjamin [1 ]
Terboven, Christian [1 ]
Schmidl, Dirk [1 ]
Herber, Joachim [1 ]
Kuhlen, Torsten W. [1 ]
Uller, Matthias S. M. [1 ]
Hentschel, Bernd [1 ]
机构
[1] Rhein Westfal TH Aachen, IT Ctr, JARA High Performance Comp, D-52074 Aachen, Germany
关键词
PERFORMANCE;
D O I
10.1109/VPA.2014.12
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The available memory bandwidth of existing high performance computing platforms turns out as being more and more the limitation to various applications. Therefore, modern microarchitectures integrate the memory controller on the processor chip, which leads to a non-uniform memory access behavior of such systems. This access behavior in turn entails major challenges in the development of shared memory parallel applications. An improperly implemented memory access functionality results in a bad ratio between local and remote memory access, and causes low performance on such architectures. To address this problem, the developers of such applications rely on tools to make these kinds of performance problems visible. This work presents a new tool for the visualization of performance data of the non-uniform memory access behavior. Because of the visual design of the tool, the developer is able to judge the severity of remote memory access in a time-dependent simulation, which is currently not possible using existing tools.
引用
收藏
页码:42 / 49
页数:8
相关论文
共 50 条
  • [1] Visualizing the memory access behavior of shared memory applications on NUMA architectures
    Tao, J
    Karl, W
    Schulz, M
    [J]. COMPUTATIONAL SCIENCE -- ICCS 200, PROCEEDINGS PT 2, 2001, 2074 : 861 - 870
  • [2] Nap: Persistent Memory Indexes for NUMA Architectures
    Wang, Qing
    Lu, Youyou
    Li, Junru
    Xie, Minhui
    Shu, Jiwu
    [J]. ACM TRANSACTIONS ON STORAGE, 2022, 18 (01)
  • [3] Data access collection and data partitioning for NUMA architectures
    Calidonna, CR
    Furnari, MM
    [J]. ADVANCES IN COMPUTATIONAL MECHANICS WITH HIGH PERFORMANCE COMPUTING, 1998, : 33 - 40
  • [4] Near-Optimal Placement of MPI Processes on Hierarchical NUMA Architectures
    Jeannot, Emmanuel
    Mercier, Guillaume
    [J]. EURO-PAR 2010 - PARALLEL PROCESSING, PART II, 2010, 6272 : 199 - 210
  • [5] Volume visualization on shared memory architectures
    Koning, AHJ
    Zuiderveld, KJ
    Viergever, MA
    [J]. PARALLEL COMPUTING, 1997, 23 (07) : 915 - 925
  • [6] Comparative modeling and evaluation of CC-NUMA and COMA on hierarchical ring architectures
    Zhang, XD
    Yan, Y
    [J]. IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 1995, 6 (12) : 1316 - 1331
  • [7] An Adaptive and Hierarchical CPU Allocation for Multicore NUMA Memory Organization
    Kang, Dongwoo
    Park, Heekwon
    Choi, Jongmoo
    [J]. INFORMATION-AN INTERNATIONAL INTERDISCIPLINARY JOURNAL, 2011, 14 (05): : 1595 - 1611
  • [8] Evaluation of memory performance in NUMA architectures using Stochastic Reward Nets
    Entezari-Maleki, Reza
    Cho, Younghyun
    Egger, Bernhard
    [J]. JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2020, 144 : 172 - 188
  • [9] AdaptMD: Balancing Space and Performance in NUMA Architectures With Adaptive Memory Deduplication
    Yao, Lulu
    Li, Yongkun
    Lee, Patrick P. C.
    Wang, Xiaoyang
    Xu, Yinlong
    [J]. IEEE TRANSACTIONS ON COMPUTERS, 2024, 73 (06) : 1588 - 1602
  • [10] A performance comparison of data and memory allocation strategies for sequence aligners on NUMA architectures
    Josefina Lenis
    Miquel Angel Senar
    [J]. Cluster Computing, 2017, 20 : 1909 - 1924