A Practical Approach For Workload-Aware Data Movement in Disaggregated Memory Systems

被引：1

作者：

Puri, Amit ^{[1
]}

Bellamkonda, Kartheek ^{[1
]}

Narreddy, Kailash ^{[1
]}

Jose, John ^{[1
]}

Venkatesh, Tamarapalli ^{[1
]}

机构：

[1] IIT Guwahati, Dept CSE, Gauhati, Assam, India

来源：

2023 IEEE 35TH INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE AND HIGH PERFORMANCE COMPUTING, SBAC-PAD | 2023年

关键词：

Data centers; Page migration; Memory disaggregation;

D O I：

10.1109/SBAC-PAD59825.2023.00017

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

Memory disaggregation is a solid alternative to traditional server systems that can overcome memory scalability issues in next-generation HPC data centers. In a rack-level disaggregated system, multiple compute nodes with small local memory rely on remote memory pools (memory nodes) to fulfill their memory demands. An in-network memory manager manages remote memory address space and allocates it to compute nodes which can access the memory at cache-line granularity using coherent interconnects such as CXL (or GenZ). However, the memory access cost is significantly increased due to the presence of the network. Even though a page migration system can exploit the locality of memory accesses, accessing a remote page starves the block-level requests. Further, page migrations introduce additional overheads which combined with starvation may even degrade the performance. All these issues require systematic evaluation of disaggregated memory systems to achieve improved designs. This paper presents a hardware mechanism for workload-aware data movement between compute and memory pools that significantly reduces the memory access cost. Firstly, our design enables centralized hot-page migration in a multi-tiered disaggregated memory that is aware of access patterns for individual compute nodes. Secondly, we analyze the complexities of accessing a remote memory page and propose a novel solution to eliminate starvation by serving all the remote memory requests at cache block granularity and by sharing bandwidth between page and block memory requests. Lastly, we add extra hardware support to get rid of additional overheads in a page migration system. We evaluate our designs over a variety of multi-threaded benchmarks using a cycle-level simulator which is specially designed to simulate a disaggregated memory system. Our design performs 10% to 100% better than traditional RDMA-based disaggregated systems that access remote memory at page granularity and 5% to 35% better than baseline disaggregated systems that use coherent interconnects for block-level access.

引用

页码：78 / 88

页数：11

共 50 条

[21] Dynamic workload-aware DVFS for multicore systems using machine learning
Manjari Gupta
Lava Bhargava
S. Indu
Computing, 2021, 103 : 1747 - 1769
[22] LSched A Workload-Aware Learned Query Scheduler for Analytical Database Systems
Sabek, Ibrahim
Ukyab, Tenzin Samten
Kraska, Tim
PROCEEDINGS OF THE 2022 INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA (SIGMOD '22), 2022, : 1228 - 1242
[23] Workload-Aware Scheduling Across Geo-distributed Data Centers
Jin, Yibo
Gao, Yuan
Qian, Zhuzhong
Zhai, Mingyu
Peng, Hui
Lu, Sanglu
2016 IEEE TRUSTCOM/BIGDATASE/ISPA, 2016, : 1455 - 1462
[24] Workload-Aware Revenue Maximization in SDN-Enabled Data Center
Yuan, Haitao
Bi, Jing
Zhang, Jia
Tan, Wei
Huang, Keman
2017 IEEE 10TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING (CLOUD), 2017, : 18 - 25
[25] WatCache: a workload-aware temporary cache on the compute side of HPC systems
Jie Yu
Guangming Liu
Wenrui Dong
Xiaoyong Li
The Journal of Supercomputing, 2019, 75 : 554 - 586
[26] ChewAnalyzer: Workload-Aware Data Management Across Differentiated Storage Pools
Ge, Xiongzi
Xie, Xuchao
Du, David H. C.
Ganesan, Pradeep
Hahn, Dennis
2018 IEEE 26TH INTERNATIONAL SYMPOSIUM ON MODELING, ANALYSIS, AND SIMULATION OF COMPUTER AND TELECOMMUNICATION SYSTEMS (MASCOTS), 2018, : 94 - 101
[27] Dynamic workload-aware DVFS for multicore systems using machine learning
Gupta, Manjari
Bhargava, Lava
Indu, S.
COMPUTING, 2021, 103 (08) : 1747 - 1769
[28] NBTI and Power Reduction Using a Workload-Aware Supply Voltage Assignment Approach
Yu, Yang
Liang, Jie
Yang, Zhiming
Peng, Xiyuan
JOURNAL OF ELECTRONIC TESTING-THEORY AND APPLICATIONS, 2018, 34 (01): : 27 - 41
[29] A Data- and Workload-Aware Algorithm for Range Queries Under Differential Privacy
Li, Chao
Hay, Michael
Miklau, Gerome
Wang, Yue
PROCEEDINGS OF THE VLDB ENDOWMENT, 2014, 7 (05): : 341 - 352
[30] NBTI and Power Reduction Using a Workload-Aware Supply Voltage Assignment Approach
Yang Yu
Jie Liang
Zhiming Yang
Xiyuan Peng
Journal of Electronic Testing, 2018, 34 : 27 - 41

← 1 2 3 4 5 →