A Practical Approach For Workload-Aware Data Movement in Disaggregated Memory Systems

被引:1
|
作者
Puri, Amit [1 ]
Bellamkonda, Kartheek [1 ]
Narreddy, Kailash [1 ]
Jose, John [1 ]
Venkatesh, Tamarapalli [1 ]
机构
[1] IIT Guwahati, Dept CSE, Gauhati, Assam, India
关键词
Data centers; Page migration; Memory disaggregation;
D O I
10.1109/SBAC-PAD59825.2023.00017
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Memory disaggregation is a solid alternative to traditional server systems that can overcome memory scalability issues in next-generation HPC data centers. In a rack-level disaggregated system, multiple compute nodes with small local memory rely on remote memory pools (memory nodes) to fulfill their memory demands. An in-network memory manager manages remote memory address space and allocates it to compute nodes which can access the memory at cache-line granularity using coherent interconnects such as CXL (or GenZ). However, the memory access cost is significantly increased due to the presence of the network. Even though a page migration system can exploit the locality of memory accesses, accessing a remote page starves the block-level requests. Further, page migrations introduce additional overheads which combined with starvation may even degrade the performance. All these issues require systematic evaluation of disaggregated memory systems to achieve improved designs. This paper presents a hardware mechanism for workload-aware data movement between compute and memory pools that significantly reduces the memory access cost. Firstly, our design enables centralized hot-page migration in a multi-tiered disaggregated memory that is aware of access patterns for individual compute nodes. Secondly, we analyze the complexities of accessing a remote memory page and propose a novel solution to eliminate starvation by serving all the remote memory requests at cache block granularity and by sharing bandwidth between page and block memory requests. Lastly, we add extra hardware support to get rid of additional overheads in a page migration system. We evaluate our designs over a variety of multi-threaded benchmarks using a cycle-level simulator which is specially designed to simulate a disaggregated memory system. Our design performs 10% to 100% better than traditional RDMA-based disaggregated systems that access remote memory at page granularity and 5% to 35% better than baseline disaggregated systems that use coherent interconnects for block-level access.
引用
收藏
页码:78 / 88
页数:11
相关论文
共 50 条
  • [21] Dynamic workload-aware DVFS for multicore systems using machine learning
    Manjari Gupta
    Lava Bhargava
    S. Indu
    Computing, 2021, 103 : 1747 - 1769
  • [22] LSched A Workload-Aware Learned Query Scheduler for Analytical Database Systems
    Sabek, Ibrahim
    Ukyab, Tenzin Samten
    Kraska, Tim
    PROCEEDINGS OF THE 2022 INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA (SIGMOD '22), 2022, : 1228 - 1242
  • [23] Workload-Aware Scheduling Across Geo-distributed Data Centers
    Jin, Yibo
    Gao, Yuan
    Qian, Zhuzhong
    Zhai, Mingyu
    Peng, Hui
    Lu, Sanglu
    2016 IEEE TRUSTCOM/BIGDATASE/ISPA, 2016, : 1455 - 1462
  • [24] Workload-Aware Revenue Maximization in SDN-Enabled Data Center
    Yuan, Haitao
    Bi, Jing
    Zhang, Jia
    Tan, Wei
    Huang, Keman
    2017 IEEE 10TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING (CLOUD), 2017, : 18 - 25
  • [25] WatCache: a workload-aware temporary cache on the compute side of HPC systems
    Jie Yu
    Guangming Liu
    Wenrui Dong
    Xiaoyong Li
    The Journal of Supercomputing, 2019, 75 : 554 - 586
  • [26] ChewAnalyzer: Workload-Aware Data Management Across Differentiated Storage Pools
    Ge, Xiongzi
    Xie, Xuchao
    Du, David H. C.
    Ganesan, Pradeep
    Hahn, Dennis
    2018 IEEE 26TH INTERNATIONAL SYMPOSIUM ON MODELING, ANALYSIS, AND SIMULATION OF COMPUTER AND TELECOMMUNICATION SYSTEMS (MASCOTS), 2018, : 94 - 101
  • [27] Dynamic workload-aware DVFS for multicore systems using machine learning
    Gupta, Manjari
    Bhargava, Lava
    Indu, S.
    COMPUTING, 2021, 103 (08) : 1747 - 1769
  • [28] NBTI and Power Reduction Using a Workload-Aware Supply Voltage Assignment Approach
    Yu, Yang
    Liang, Jie
    Yang, Zhiming
    Peng, Xiyuan
    JOURNAL OF ELECTRONIC TESTING-THEORY AND APPLICATIONS, 2018, 34 (01): : 27 - 41
  • [29] A Data- and Workload-Aware Algorithm for Range Queries Under Differential Privacy
    Li, Chao
    Hay, Michael
    Miklau, Gerome
    Wang, Yue
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2014, 7 (05): : 341 - 352
  • [30] NBTI and Power Reduction Using a Workload-Aware Supply Voltage Assignment Approach
    Yang Yu
    Jie Liang
    Zhiming Yang
    Xiyuan Peng
    Journal of Electronic Testing, 2018, 34 : 27 - 41