Case Studies on the Impact and Challenges of Heterogeneous NUMA Architectures for HPC

被引:0
|
作者
Zaourar, Lilia [1 ]
Benazouz, Mohamed [1 ]
Mouhagir, Ayoub [1 ]
Falquez, Carlos [2 ]
Portero, Antoni [2 ]
Ho, Nam [2 ]
Suarez, Estela [2 ]
Petrakis, Polydoros [3 ]
Marazakis, Manolis [3 ]
Sgherzi, Francesco [4 ]
Fernandez, Ivan [4 ]
Dolbeau, Romain [5 ]
Pleiter, Dirk [6 ]
机构
[1] Univ Paris Saclay, List, CEA, F-91120 Palaiseau, France
[2] Forschungszentrum Julich, Inst Adv Simulat, Julich Supercomp Ctr, Julich, Germany
[3] Fdn Res & Technol Hellas FORTH, Inst Comp Sci, Iraklion, Greece
[4] Barcelona Supercomp Ctr BSC, Barcelona, Spain
[5] SiPearl, Rennes, France
[6] KTH Royal Inst Technol, Stockholm, Sweden
关键词
Non-Uniform Memory Access (NUMA); co-design; simulation; High Performance Computing (HPC); benchmarking;
D O I
10.1007/978-3-031-66146-4_17
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The memory systems of High-Performance Computing (HPC) systems commonly feature non-uniform data paths to memory, i.e. are non-uniform memory access (NUMA) architectures. Memory is divided into multiple regions, with each processing unit having its own local memory. Therefore, for each processing unit access to local memory regions is faster compared to accessing memory at non-local regions. Architectures with hybrid memory technologies result in further non-uniformity. This paper presents case studies of the performance potential and data placement implications of non-uniform and heterogeneous memory in HPC systems. Using the gem5 and VPSim simulation platforms, we model NUMA systems with processors based on the ARMv8 Neoverse V1 Reference Design. The gem5 simulator provides a cycle-accurate view, while VPSim offers greater simulation speed, with a high-level view of the simulated system. We highlight the performance impact of design trade-offs regarding NUMA node organization and System Level Cache (SLC) group assignment, as well as Networkon-Chip (NoC) configuration. Our case studies provide essential input to a co-design process involving HPC processor architects and system integrators. A comparison of system configurations for different NoC bandwidths shows reduced NoC latency and high memory bandwidth improvement when NUMA control is enabled. Furthermore, a configuration with HBM2 memory organized as four NUMA nodes highlights the memory bandwidth performance gap and NoC queuing latency impact when comparing local vs. remote memory accesses. On the other hand, NUMA can result in an unbalanced distribution of memory accesses and reduced SLC hit ratios, as shown with DDR4 memory organized as four NUMA nodes.
引用
收藏
页码:251 / 265
页数:15
相关论文
共 50 条
  • [41] Application-specific processor architectures for embedded control: Case studies
    Kappos, E
    Kinniment, DJ
    MICROPROCESSORS AND MICROSYSTEMS, 1996, 20 (04) : 225 - 232
  • [42] Challenges in the Dielectric Measurement of Heterogeneous Tissues: Impact of Uncertainty in Sensing Depth Calculation
    Porter, Emily
    La Gioia, Alessandra
    Bottiglieri, Anna
    O'Halloran, Martin
    2018 2ND URSI ATLANTIC RADIO SCIENCE MEETING (AT-RASC), 2018,
  • [43] Impact of HPC and Automated CFD Simulation Processes on Virtual Product Development-A Case Study
    Lange, Christopher
    Barthelmaes, Patrick
    Rosnitschek, Tobias
    Tremmel, Stephan
    Rieg, Frank
    APPLIED SCIENCES-BASEL, 2021, 11 (14):
  • [44] Rethinking African Studies: Four Challenges and the Case for Comparative African Studies
    Basedau, Matthias
    AFRICA SPECTRUM, 2020, 55 (02) : 194 - 206
  • [45] Capillary microsampling in clinical studies: opportunities and challenges in two case studies
    Verhaeghe, Tom
    De Meulder, Marc
    Hillewaert, Vera
    Dillen, Lieve
    Stieltjes, Hans
    BIOANALYSIS, 2020, 12 (13) : 905 - 918
  • [46] CML Case Studies: Impact of Comorbidities
    Hughes, Timothy P.
    CLINICAL LYMPHOMA MYELOMA & LEUKEMIA, 2019, 19 : S30 - S31
  • [47] Reporting case studies for making an impact
    Martinsuo, Miia
    Huemann, Martina
    INTERNATIONAL JOURNAL OF PROJECT MANAGEMENT, 2021, 39 (08) : 827 - 833
  • [48] Managing the challenges of WTO participation: 45 case studies
    Valckx, Nico
    ECONOMIST-NETHERLANDS, 2007, 155 (01): : 123 - 124
  • [49] Urban pervasive applications: Challenges, scenarios and case studies
    Chatzigiannakis, Ioannis
    Mylonas, Georgios
    Vitaletti, Andrea
    COMPUTER SCIENCE REVIEW, 2011, 5 (01) : 103 - 118
  • [50] Challenges for case-control studies with microbiome data
    Wijgert, Janneke H. van de
    Jespers, Vicky
    ANNALS OF EPIDEMIOLOGY, 2016, 26 (05) : 336 - 341