Challenges of Memory Management on Modern NUMA Systems

被引：30

作者：

Gaud, Fabien ^{[1
]}

Lepers, Baptiste ^{[2
]}

Funston, Justin ^{[3
]}

Dashti, Mohammad ^{[3
]}

Fedorova, Alexandra ^{[4
]}

Quema, Vivien ^{[5
]}

Lachaize, Renaud ^{[6
]}

Roth, Mark

机构：

[1] Coho Data, Focusing Performance & Scalabil, Palo Alto, CA 94303 USA

[2] Ecole Polytech Fed Lausanne, Lausanne, Switzerland

[3] Univ British Columbia, Vancouver, BC V5Z 1M9, Canada

[4] Univ British Columbia, ECE Dept, Vancouver, BC V5Z 1M9, Canada

[5] Grenoble INP ENSIMAG, Grenoble, France

[6] Univ Grenoble, Grenoble, France

来源：

COMMUNICATIONS OF THE ACM | 2015年 / 58卷 / 12期

关键词：

D O I：

10.1145/2814328

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

The latency of memory access times is hence non-uniform, because it depends on where the request originates and where it is destined to go. Such systems are referred to as nonuniform memory access (or NUMA). Current x86 NUMA systems are cache coherent (called ccNUMA), which means programs can transparently access memory on local and remote nodes without changes to the code or special operating system support. Experiments have shown that Congestion happens when the rate of requests to memory controllers or the rate of traffic over interconnects is too high, which causes excessive delays for memory accesses. It can be alleviated by balancing the traffic among multiple memory controllers and interconnect links. The other factor of NUMA performance is locality, which is what previous NUMA algorithms have focused on. As NUMA systems grow and the number of cores issuing memory requests increases, NUMA effects will continue being a concern. Carrefour demonstrates a collection of techniques that effectively reduce these concerns.

引用

页码：59 / 66

页数：8

共 50 条

[31] Memory Errors in Modern Systems
Sridharan, Vilas
DeBardeleben, Nathan
Blanchard, Sean
Ferreira, Kurt B.
Stearley, Jon
Shalf, John
Gurumurthi, Sudhanva
ACM SIGPLAN NOTICES, 2015, 50 (04) : 297 - 310
[32] NUMA-aware memory coloring for multicore real-time systems
Pan, Xing
Mueller, Frank
JOURNAL OF SYSTEMS ARCHITECTURE, 2021, 118
[33] The Art of Efficient In-memory Query Processing on NUMA Systems: a Systematic Approach
Memarzia, Puya
Ray, Suprio
Bhavsar, Virendra C.
2020 IEEE 36TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2020), 2020, : 781 - 792
[34] FDTD XPU technology on Systems with Non Uniform Computer Memory (NUMA) Architecture
Lauer, A.
Simon, W.
Wien, A.
2016 10TH EUROPEAN CONFERENCE ON ANTENNAS AND PROPAGATION (EUCAP), 2016,
[35] Scalable and Effective Page-Table and TLB Management on NUMA Systems
Gao, Bin
Kang, Qingxuan
Tee, Hao-Wei
Chu, Kyle Timothy Ng
Sanaee, Alireza
Jevdjic, Djordje
PROCEEDINGS OF THE 2024 USENIX ANNUAL TECHNICAL CONFERENCE, ATC 2024, 2024, : 445 - 461
[36] Power-Capped DVFS and Thread Allocation with ANN Models on Modern NUMA Systems
Imamura, Satoshi
Sasaki, Hiroshi
Inoue, Koji
Nikolopoulos, Dimitrios S.
2014 32ND IEEE INTERNATIONAL CONFERENCE ON COMPUTER DESIGN (ICCD), 2014, : 324 - 331
[37] NUMAlloc: A Faster NUMA Memory Allocator
Yang, Hanmei
Zhao, Xin
Zhou, Jin
Wang, Wei
Kundu, Sandip
Wu, Bo
Guan, Hui
Liu, Tongping
PROCEEDINGS OF THE 2023 ACM SIGPLAN INTERNATIONAL SYMPOSIUM ON MEMORY MANAGEMENT, ISMM 2023, 2023, : 97 - 110
[38] NUMA POLICIES AND THEIR RELATION TO MEMORY ARCHITECTURE
BOLOSKY, WJ
SCOTT, ML
FITZGERALD, RP
FOWLER, RJ
COX, AL
SIGPLAN NOTICES, 1991, 26 (04): : 212 - 223
[39] Congestion-Aware Memory Management on NUMA Platforms: A VMware ESXi case study
Kotra, Jagadish B.
Kim, Seongbeom
Madduri, Kamesh
Kandemir, Mahmut T.
PROCEEDINGS OF THE 2017 IEEE INTERNATIONAL SYMPOSIUM ON WORKLOAD CHARACTERIZATION (IISWC), 2017, : 146 - 155
[40] An Automatic MPI Process Mapping Method Considering Locality and Memory Congestion on NUMA Systems
Agung, Mulya
Amrizal, Muhammad Alfian
Egawa, Ryusuke
Takizawa, Hiroyuki
2019 IEEE 13TH INTERNATIONAL SYMPOSIUM ON EMBEDDED MULTICORE/MANY-CORE SYSTEMS-ON-CHIP (MCSOC 2019), 2019, : 17 - 24

← 1 2 3 4 5 →