A Practical Approach For Workload-Aware Data Movement in Disaggregated Memory Systems

被引:1
|
作者
Puri, Amit [1 ]
Bellamkonda, Kartheek [1 ]
Narreddy, Kailash [1 ]
Jose, John [1 ]
Venkatesh, Tamarapalli [1 ]
机构
[1] IIT Guwahati, Dept CSE, Gauhati, Assam, India
关键词
Data centers; Page migration; Memory disaggregation;
D O I
10.1109/SBAC-PAD59825.2023.00017
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Memory disaggregation is a solid alternative to traditional server systems that can overcome memory scalability issues in next-generation HPC data centers. In a rack-level disaggregated system, multiple compute nodes with small local memory rely on remote memory pools (memory nodes) to fulfill their memory demands. An in-network memory manager manages remote memory address space and allocates it to compute nodes which can access the memory at cache-line granularity using coherent interconnects such as CXL (or GenZ). However, the memory access cost is significantly increased due to the presence of the network. Even though a page migration system can exploit the locality of memory accesses, accessing a remote page starves the block-level requests. Further, page migrations introduce additional overheads which combined with starvation may even degrade the performance. All these issues require systematic evaluation of disaggregated memory systems to achieve improved designs. This paper presents a hardware mechanism for workload-aware data movement between compute and memory pools that significantly reduces the memory access cost. Firstly, our design enables centralized hot-page migration in a multi-tiered disaggregated memory that is aware of access patterns for individual compute nodes. Secondly, we analyze the complexities of accessing a remote memory page and propose a novel solution to eliminate starvation by serving all the remote memory requests at cache block granularity and by sharing bandwidth between page and block memory requests. Lastly, we add extra hardware support to get rid of additional overheads in a page migration system. We evaluate our designs over a variety of multi-threaded benchmarks using a cycle-level simulator which is specially designed to simulate a disaggregated memory system. Our design performs 10% to 100% better than traditional RDMA-based disaggregated systems that access remote memory at page granularity and 5% to 35% better than baseline disaggregated systems that use coherent interconnects for block-level access.
引用
收藏
页码:78 / 88
页数:11
相关论文
共 50 条
  • [41] Federated workload-aware quantized framework for secure learning in data-sensitive applications
    Narula, Manu
    Meena, Jasraj
    Vishwakarma, Dinesh Kumar
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2025, 168
  • [42] An adaptive workload-aware power consumption measuring method for servers in cloud data centers
    Weiwei Lin
    Yufeng Zhang
    Wentai Wu
    Simon Fong
    Ligang He
    Jia Chang
    Computing, 2023, 105 : 515 - 538
  • [43] Progressive Data Stream Mining and Transaction Classification for Workload-Aware Incremental Database Repartitioning
    Kamal, Joarder Mohammad Mustafa
    Murshed, Manzur
    Gaber, Mohamed Medhat
    2014 IEEE/ACM INTERNATIONAL SYMPOSIUM ON BIG DATA COMPUTING (BDC), 2014, : 8 - 15
  • [44] LogStore: A Workload-Aware, Adaptable Key-Value Store on Hybrid Storage Systems
    Menon, Prashanth
    Qadah, Thamir M.
    Rabl, Tilmann
    Sadoghi, Mohammad
    Jacobsen, Hans-Arno
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2022, 34 (08) : 3867 - 3882
  • [45] Workload-Aware Reviewer Recommendation using a Multi-objective Search-Based Approach
    Al-Zubaidi, Wisam Haitham Abbood
    Thongtanunam, Patanamon
    Hoa Khanh Dam
    Tantithamthavorn, Chakkrit
    Ghose, Aditya
    PROCEEDINGS OF THE 16TH ACM INTERNATIONAL CONFERENCE ON PREDICTIVE MODELS AND DATA ANALYTICS IN SOFTWARE ENGINEERING, PROMISE 2020, 2020, : 21 - 30
  • [46] Novel Workload-Aware Approach to Mobile User Reallocation in Crowded Mobile Edge Computing Environment
    Xiao, Xuan
    Ma, Yong
    Xia, Yunni
    Zhou, Mengchu
    Luo, Xin
    Wang, Xu
    Fu, Xiaodong
    Wei, Wei
    Jiang, Ning
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (07) : 8846 - 8856
  • [47] Workload-aware request routing in cloud data center using software-defined networking
    Haitao Yuan
    Jing Bi
    Bohu Li
    Journal of Systems Engineering and Electronics, 2015, 26 (01) : 151 - 160
  • [48] A workload-aware flash translation layer enhancing performance and lifespan of TLC/SLC dual-mode flash memory in embedded systems
    Liu, Duo
    Yao, Lei
    Long, Linbo
    Shao, Zili
    Guan, Yong
    MICROPROCESSORS AND MICROSYSTEMS, 2017, 52 : 343 - 354
  • [49] Workload-aware request routing in cloud data center using software-defined networking
    Yuan, Haitao
    Bi, Jing
    Li, Bohu
    JOURNAL OF SYSTEMS ENGINEERING AND ELECTRONICS, 2015, 26 (01) : 151 - 160
  • [50] TLC-FTL: Workload-aware Flash Translation Layer for TLC/SLC Dual-Mode Flash Memory in Embedded Systems
    Yao, Lei
    Liu, Duo
    Zhong, Kan
    Long, Linbo
    Shao, Zili
    2015 IEEE 17TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS, 2015 IEEE 7TH INTERNATIONAL SYMPOSIUM ON CYBERSPACE SAFETY AND SECURITY, AND 2015 IEEE 12TH INTERNATIONAL CONFERENCE ON EMBEDDED SOFTWARE AND SYSTEMS (ICESS), 2015, : 831 - 834