A Practical Approach For Workload-Aware Data Movement in Disaggregated Memory Systems

被引：1

作者：

Puri, Amit ^{[1
]}

Bellamkonda, Kartheek ^{[1
]}

Narreddy, Kailash ^{[1
]}

Jose, John ^{[1
]}

Venkatesh, Tamarapalli ^{[1
]}

机构：

[1] IIT Guwahati, Dept CSE, Gauhati, Assam, India

来源：

2023 IEEE 35TH INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE AND HIGH PERFORMANCE COMPUTING, SBAC-PAD | 2023年

关键词：

Data centers; Page migration; Memory disaggregation;

D O I：

10.1109/SBAC-PAD59825.2023.00017

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

Memory disaggregation is a solid alternative to traditional server systems that can overcome memory scalability issues in next-generation HPC data centers. In a rack-level disaggregated system, multiple compute nodes with small local memory rely on remote memory pools (memory nodes) to fulfill their memory demands. An in-network memory manager manages remote memory address space and allocates it to compute nodes which can access the memory at cache-line granularity using coherent interconnects such as CXL (or GenZ). However, the memory access cost is significantly increased due to the presence of the network. Even though a page migration system can exploit the locality of memory accesses, accessing a remote page starves the block-level requests. Further, page migrations introduce additional overheads which combined with starvation may even degrade the performance. All these issues require systematic evaluation of disaggregated memory systems to achieve improved designs. This paper presents a hardware mechanism for workload-aware data movement between compute and memory pools that significantly reduces the memory access cost. Firstly, our design enables centralized hot-page migration in a multi-tiered disaggregated memory that is aware of access patterns for individual compute nodes. Secondly, we analyze the complexities of accessing a remote memory page and propose a novel solution to eliminate starvation by serving all the remote memory requests at cache block granularity and by sharing bandwidth between page and block memory requests. Lastly, we add extra hardware support to get rid of additional overheads in a page migration system. We evaluate our designs over a variety of multi-threaded benchmarks using a cycle-level simulator which is specially designed to simulate a disaggregated memory system. Our design performs 10% to 100% better than traditional RDMA-based disaggregated systems that access remote memory at page granularity and 5% to 35% better than baseline disaggregated systems that use coherent interconnects for block-level access.

引用

页码：78 / 88

页数：11

共 50 条

[41] Federated workload-aware quantized framework for secure learning in data-sensitive applications
Narula, Manu
Meena, Jasraj
Vishwakarma, Dinesh Kumar
FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2025, 168
[42] An adaptive workload-aware power consumption measuring method for servers in cloud data centers
Weiwei Lin
Yufeng Zhang
Wentai Wu
Simon Fong
Ligang He
Jia Chang
Computing, 2023, 105 : 515 - 538
[43] Progressive Data Stream Mining and Transaction Classification for Workload-Aware Incremental Database Repartitioning
Kamal, Joarder Mohammad Mustafa
Murshed, Manzur
Gaber, Mohamed Medhat
2014 IEEE/ACM INTERNATIONAL SYMPOSIUM ON BIG DATA COMPUTING (BDC), 2014, : 8 - 15
[44] LogStore: A Workload-Aware, Adaptable Key-Value Store on Hybrid Storage Systems
Menon, Prashanth
Qadah, Thamir M.
Rabl, Tilmann
Sadoghi, Mohammad
Jacobsen, Hans-Arno
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2022, 34 (08) : 3867 - 3882
[45] Workload-Aware Reviewer Recommendation using a Multi-objective Search-Based Approach
Al-Zubaidi, Wisam Haitham Abbood
Thongtanunam, Patanamon
Hoa Khanh Dam
Tantithamthavorn, Chakkrit
Ghose, Aditya
PROCEEDINGS OF THE 16TH ACM INTERNATIONAL CONFERENCE ON PREDICTIVE MODELS AND DATA ANALYTICS IN SOFTWARE ENGINEERING, PROMISE 2020, 2020, : 21 - 30
[46] Novel Workload-Aware Approach to Mobile User Reallocation in Crowded Mobile Edge Computing Environment
Xiao, Xuan
Ma, Yong
Xia, Yunni
Zhou, Mengchu
Luo, Xin
Wang, Xu
Fu, Xiaodong
Wei, Wei
Jiang, Ning
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (07) : 8846 - 8856
[47] Workload-aware request routing in cloud data center using software-defined networking
Haitao Yuan
Jing Bi
Bohu Li
Journal of Systems Engineering and Electronics, 2015, 26 (01) : 151 - 160
[48] A workload-aware flash translation layer enhancing performance and lifespan of TLC/SLC dual-mode flash memory in embedded systems
Liu, Duo
Yao, Lei
Long, Linbo
Shao, Zili
Guan, Yong
MICROPROCESSORS AND MICROSYSTEMS, 2017, 52 : 343 - 354
[49] Workload-aware request routing in cloud data center using software-defined networking
Yuan, Haitao
Bi, Jing
Li, Bohu
JOURNAL OF SYSTEMS ENGINEERING AND ELECTRONICS, 2015, 26 (01) : 151 - 160
[50] TLC-FTL: Workload-aware Flash Translation Layer for TLC/SLC Dual-Mode Flash Memory in Embedded Systems
Yao, Lei
Liu, Duo
Zhong, Kan
Long, Linbo
Shao, Zili
2015 IEEE 17TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS, 2015 IEEE 7TH INTERNATIONAL SYMPOSIUM ON CYBERSPACE SAFETY AND SECURITY, AND 2015 IEEE 12TH INTERNATIONAL CONFERENCE ON EMBEDDED SOFTWARE AND SYSTEMS (ICESS), 2015, : 831 - 834

← 1 2 3 4 5 →