A Practical Approach For Workload-Aware Data Movement in Disaggregated Memory Systems

被引:1
|
作者
Puri, Amit [1 ]
Bellamkonda, Kartheek [1 ]
Narreddy, Kailash [1 ]
Jose, John [1 ]
Venkatesh, Tamarapalli [1 ]
机构
[1] IIT Guwahati, Dept CSE, Gauhati, Assam, India
关键词
Data centers; Page migration; Memory disaggregation;
D O I
10.1109/SBAC-PAD59825.2023.00017
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Memory disaggregation is a solid alternative to traditional server systems that can overcome memory scalability issues in next-generation HPC data centers. In a rack-level disaggregated system, multiple compute nodes with small local memory rely on remote memory pools (memory nodes) to fulfill their memory demands. An in-network memory manager manages remote memory address space and allocates it to compute nodes which can access the memory at cache-line granularity using coherent interconnects such as CXL (or GenZ). However, the memory access cost is significantly increased due to the presence of the network. Even though a page migration system can exploit the locality of memory accesses, accessing a remote page starves the block-level requests. Further, page migrations introduce additional overheads which combined with starvation may even degrade the performance. All these issues require systematic evaluation of disaggregated memory systems to achieve improved designs. This paper presents a hardware mechanism for workload-aware data movement between compute and memory pools that significantly reduces the memory access cost. Firstly, our design enables centralized hot-page migration in a multi-tiered disaggregated memory that is aware of access patterns for individual compute nodes. Secondly, we analyze the complexities of accessing a remote memory page and propose a novel solution to eliminate starvation by serving all the remote memory requests at cache block granularity and by sharing bandwidth between page and block memory requests. Lastly, we add extra hardware support to get rid of additional overheads in a page migration system. We evaluate our designs over a variety of multi-threaded benchmarks using a cycle-level simulator which is specially designed to simulate a disaggregated memory system. Our design performs 10% to 100% better than traditional RDMA-based disaggregated systems that access remote memory at page granularity and 5% to 35% better than baseline disaggregated systems that use coherent interconnects for block-level access.
引用
收藏
页码:78 / 88
页数:11
相关论文
共 50 条
  • [1] Workload-Aware Placement Strategies to Leverage Disaggregated Resources in the Datacenter
    Call, Aaron
    Polo, Jorda
    Carrera, David
    [J]. IEEE SYSTEMS JOURNAL, 2022, 16 (01): : 1697 - 1708
  • [2] FORESEER: Workload-aware Data Storage for MapReduce
    Zou, Jia
    Shi, Juwei
    Liu, Tongping
    Cao, Zhao
    Wang, Chen
    [J]. 2015 IEEE 35th International Conference on Distributed Computing Systems, 2015, : 746 - 747
  • [3] Workload-aware Power Management of Cluster Systems
    Liu, Zhuo
    Liang, Aihua
    Xiao, Limin
    Ruan, Li
    [J]. PROCEEDINGS OF THE NINTH INTERNATIONAL SYMPOSIUM ON DISTRIBUTED COMPUTING AND APPLICATIONS TO BUSINESS, ENGINEERING AND SCIENCE (DCABES 2010), 2010, : 603 - 608
  • [4] WISE: Workload-Aware Partitioning for RDF Systems
    Guo, Xintong
    Gao, Hong
    Zou, Zhaonian
    [J]. BIG DATA RESEARCH, 2020, 22
  • [5] SWORD: workload-aware data placement and replica selection for cloud data management systems
    Kumar, K. Ashwin
    Quamar, Abdul
    Deshpande, Amol
    Khuller, Samir
    [J]. VLDB JOURNAL, 2014, 23 (06): : 845 - 870
  • [6] SWORD: workload-aware data placement and replica selection for cloud data management systems
    K. Ashwin Kumar
    Abdul Quamar
    Amol Deshpande
    Samir Khuller
    [J]. The VLDB Journal, 2014, 23 : 845 - 870
  • [7] A Workload-Aware Change Data Capture Framework for Data Warehousing
    Qu, Weiping
    Liu, Xiufeng
    Dessloch, Stefan
    [J]. BIG DATA ANALYTICS AND KNOWLEDGE DISCOVERY (DAWAK 2021), 2021, 12925 : 222 - 231
  • [8] Workload-Aware Electromigration Analysis in Emerging Spintronic Memory Arrays
    Nair S.M.
    Mayahinia M.
    Tahoori M.B.
    Perumkunnil M.
    Zahedmanesh H.
    Croes K.
    Garello K.
    Marinelli T.
    Evenblij T.
    Kar G.S.
    Catthoor F.
    [J]. Nair, Sarath Mohanachandran (sarath.nair@kit.edu), 2021, Institute of Electrical and Electronics Engineers Inc. (21) : 258 - 266
  • [9] Workload-Aware Runtime Energy Management for HPC Systems
    Basireddy, Karunakar R.
    Wachter, Eduardo W.
    Al-Hashimi, Bashir M.
    Merrett, Geoff V.
    [J]. PROCEEDINGS 2018 INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING & SIMULATION (HPCS), 2018, : 292 - 299
  • [10] Workload-Aware Aggregate Maintenance in Columnar In-Memory Databases
    Mueller, Stephan
    Butzmann, Lars
    Klauck, Stefan
    Plattner, Hasso
    [J]. 2013 IEEE INTERNATIONAL CONFERENCE ON BIG DATA, 2013,