Case Studies on the Impact and Challenges of Heterogeneous NUMA Architectures for HPC

被引:0
|
作者
Zaourar, Lilia [1 ]
Benazouz, Mohamed [1 ]
Mouhagir, Ayoub [1 ]
Falquez, Carlos [2 ]
Portero, Antoni [2 ]
Ho, Nam [2 ]
Suarez, Estela [2 ]
Petrakis, Polydoros [3 ]
Marazakis, Manolis [3 ]
Sgherzi, Francesco [4 ]
Fernandez, Ivan [4 ]
Dolbeau, Romain [5 ]
Pleiter, Dirk [6 ]
机构
[1] Univ Paris Saclay, List, CEA, F-91120 Palaiseau, France
[2] Forschungszentrum Julich, Inst Adv Simulat, Julich Supercomp Ctr, Julich, Germany
[3] Fdn Res & Technol Hellas FORTH, Inst Comp Sci, Iraklion, Greece
[4] Barcelona Supercomp Ctr BSC, Barcelona, Spain
[5] SiPearl, Rennes, France
[6] KTH Royal Inst Technol, Stockholm, Sweden
关键词
Non-Uniform Memory Access (NUMA); co-design; simulation; High Performance Computing (HPC); benchmarking;
D O I
10.1007/978-3-031-66146-4_17
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The memory systems of High-Performance Computing (HPC) systems commonly feature non-uniform data paths to memory, i.e. are non-uniform memory access (NUMA) architectures. Memory is divided into multiple regions, with each processing unit having its own local memory. Therefore, for each processing unit access to local memory regions is faster compared to accessing memory at non-local regions. Architectures with hybrid memory technologies result in further non-uniformity. This paper presents case studies of the performance potential and data placement implications of non-uniform and heterogeneous memory in HPC systems. Using the gem5 and VPSim simulation platforms, we model NUMA systems with processors based on the ARMv8 Neoverse V1 Reference Design. The gem5 simulator provides a cycle-accurate view, while VPSim offers greater simulation speed, with a high-level view of the simulated system. We highlight the performance impact of design trade-offs regarding NUMA node organization and System Level Cache (SLC) group assignment, as well as Networkon-Chip (NoC) configuration. Our case studies provide essential input to a co-design process involving HPC processor architects and system integrators. A comparison of system configurations for different NoC bandwidths shows reduced NoC latency and high memory bandwidth improvement when NUMA control is enabled. Furthermore, a configuration with HBM2 memory organized as four NUMA nodes highlights the memory bandwidth performance gap and NoC queuing latency impact when comparing local vs. remote memory accesses. On the other hand, NUMA can result in an unbalanced distribution of memory accesses and reduced SLC hit ratios, as shown with DDR4 memory organized as four NUMA nodes.
引用
收藏
页码:251 / 265
页数:15
相关论文
共 50 条
  • [31] Device and Circuit Level Performance Comparison of Tunnel FET Architectures and Impact of Heterogeneous Gate Dielectric
    Narang, Rakhi
    Saxena, Manoj
    Gupta, R. S.
    Gupta, Mridula
    JOURNAL OF SEMICONDUCTOR TECHNOLOGY AND SCIENCE, 2013, 13 (03) : 224 - 236
  • [32] Virtual machine security challenges: case studies
    Rehman, Amjad
    Alqahtani, Sultan
    Altameem, Ayman
    Saba, Tanzila
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2014, 5 (05) : 729 - 742
  • [33] Unravelling heritage challenges: three case studies
    Perovic, Miljenka
    Coffey, Vaughan
    Kajewski, Stephen
    Madan, Ashok
    JOURNAL OF CULTURAL HERITAGE MANAGEMENT AND SUSTAINABLE DEVELOPMENT, 2016, 6 (03) : 330 - 344
  • [34] ISSUES AND CHALLENGES IN BUSINESS INTELLIGENCE CASE STUDIES
    Abu Hasan, Nooradilla
    Rahman, Azizah Abdul
    Lahad, Norminshah A.
    JURNAL TEKNOLOGI, 2016, 78 (8-2): : 171 - 178
  • [35] Contemporary ethical challenges. Case studies
    Jutras, France
    REVUE DES SCIENCES DE L EDUCATION, 2011, 37 (03): : 657 - 658
  • [36] Transdisciplinarity and its challenges: the case of urban studies
    Ramadier, T
    FUTURES, 2004, 36 (04) : 423 - 439
  • [37] Biomedical Visual Computing: Case Studies and Challenges
    Johnson, Chris R.
    COMPUTING IN SCIENCE & ENGINEERING, 2012, 14 (01) : 12 - 20
  • [38] Virtual machine security challenges: case studies
    Amjad Rehman
    Sultan Alqahtani
    Ayman Altameem
    Tanzila Saba
    International Journal of Machine Learning and Cybernetics, 2014, 5 : 729 - 742
  • [39] Case studies in mutational analysis: Challenges remain
    Cooksley, Renee
    Sandberg, Sherri
    Score, Paul
    Diethelm-Okita, Brenda
    Erickson, David
    Whitley, Chet
    MOLECULAR GENETICS AND METABOLISM, 2009, 96 (02) : S18 - S18
  • [40] The haptic paradigm in education: Challenges and case studies
    Hamza-Lup, Felix G.
    Stanescu, Ioana A.
    INTERNET AND HIGHER EDUCATION, 2010, 13 (1-2): : 78 - 81