Visualization of Memory Access Behavior on Hierarchical NUMA Architectures

被引:6
|
作者
Weyers, Benjamin [1 ]
Terboven, Christian [1 ]
Schmidl, Dirk [1 ]
Herber, Joachim [1 ]
Kuhlen, Torsten W. [1 ]
Uller, Matthias S. M. [1 ]
Hentschel, Bernd [1 ]
机构
[1] Rhein Westfal TH Aachen, IT Ctr, JARA High Performance Comp, D-52074 Aachen, Germany
关键词
PERFORMANCE;
D O I
10.1109/VPA.2014.12
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The available memory bandwidth of existing high performance computing platforms turns out as being more and more the limitation to various applications. Therefore, modern microarchitectures integrate the memory controller on the processor chip, which leads to a non-uniform memory access behavior of such systems. This access behavior in turn entails major challenges in the development of shared memory parallel applications. An improperly implemented memory access functionality results in a bad ratio between local and remote memory access, and causes low performance on such architectures. To address this problem, the developers of such applications rely on tools to make these kinds of performance problems visible. This work presents a new tool for the visualization of performance data of the non-uniform memory access behavior. Because of the visual design of the tool, the developer is able to judge the severity of remote memory access in a time-dependent simulation, which is currently not possible using existing tools.
引用
收藏
页码:42 / 49
页数:8
相关论文
共 50 条
  • [41] Access-list: The structure for hierarchical snoop-based CC-NUMA multiprocessor systems
    Suh, HJ
    Jhon, CS
    [J]. INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED PROCESSING TECHNIQUES AND APPLICATIONS, VOLS I-IV, PROCEEDINGS, 1998, : 1504 - 1511
  • [42] ANALYTICAL ESTIMATION OF VECTOR ACCESS PERFORMANCE IN PARALLEL MEMORY ARCHITECTURES
    HARPER, DT
    COSTA, Y
    [J]. IEEE TRANSACTIONS ON COMPUTERS, 1993, 42 (05) : 616 - 624
  • [43] EFFICIENT ARCHITECTURES FOR DATA ACCESS IN A SHARED MEMORY-HIERARCHY
    PADMANABHAN, K
    [J]. JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 1991, 11 (04) : 314 - 327
  • [44] Memory access profiling tools for alpha-based architectures
    Balle, SM
    Steely, SC
    [J]. APPLIED PARALLEL COMPUTING: LARGE SCALE SCIENTIFIC AND INDUSTRIAL PROBLEMS, 1998, 1541 : 28 - 37
  • [45] Memory access optimization of dynamic binary translation for reconfigurable architectures
    Oh, SJ
    Kim, TG
    [J]. ICCAD-2005: INTERNATIONAL CONFERENCE ON COMPUTER AIDED DESIGN, DIGEST OF TECHNICAL PAPERS, 2005, : 1014 - 1020
  • [46] Memory-Equipped Quantum Architectures: The Power of Random Access
    Baker, Jonathan M.
    Schuster, David, I
    Chong, Frederic T.
    [J]. PACT '20: PROCEEDINGS OF THE ACM INTERNATIONAL CONFERENCE ON PARALLEL ARCHITECTURES AND COMPILATION TECHNIQUES, 2020, : 387 - 398
  • [47] Optimized Execution Strategies for Sequence Aligners on NUMA Architectures
    Lenis, Josefina
    Senar, Miquel Angel
    [J]. EURO-PAR 2016: PARALLEL PROCESSING WORKSHOPS, 2017, 10104 : 492 - 503
  • [48] Compiler Support for Selective Page Migration in NUMA Architectures
    Piccoli, Guilherme
    Santos, Henrique N.
    Rodrigues, Raphael E.
    Pousa, Christiane
    Borin, Edson
    Magno, Fernando
    Pereira, Quintao
    [J]. PROCEEDINGS OF THE 23RD INTERNATIONAL CONFERENCE ON PARALLEL ARCHITECTURES AND COMPILATION TECHNIQUES (PACT'14), 2014, : 369 - 380
  • [49] Switch cache: A framework for improving the remote memory access latency of CC-NUMA multiprocessors
    Iyer, R
    Bhuyan, LN
    [J]. FIFTH INTERNATIONAL SYMPOSIUM ON HIGH-PERFORMANCE COMPUTER ARCHITECTURE, PROCEEDINGS, 1999, : 152 - 160
  • [50] Hierarchical algorithms on hierarchical architectures
    Keyes, D. E.
    Ltaief, H.
    Turkiyyah, G.
    [J]. PHILOSOPHICAL TRANSACTIONS OF THE ROYAL SOCIETY A-MATHEMATICAL PHYSICAL AND ENGINEERING SCIENCES, 2020, 378 (2166):