Shared Last-level Cache Management for GPGPUs with Hybrid Main Memory

被引：0

作者：

Wang, Guan ^{[1
]}

Cai, Xiaojun ^{[1
]}

Ju, Lei ^{[1
]}

Zang, Chuanqi ^{[1
]}

Zhao, Mengying ^{[1
]}

Jia, Zhiping ^{[1
]}

机构：

[1] Shandong Univ, Sch Comp Sci & Technol, Jinan, Shandong, Peoples R China

来源：

PROCEEDINGS OF THE 2017 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION (DATE) | 2017年

基金：

中国国家自然科学基金;

关键词：

HIGH-PERFORMANCE;

D O I：

暂无

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Memory intensive workloads become increasingly popular on general purpose graphics processing units (GPGPUs), and impose great challenges on the GPGPU memory subsystem design. On the other hand, with the recent development of nonvolatile memory (NVM) technologies, hybrid memory combining both DRAM and NVM achieves high performance, low power and high density simultaneously, which provides a promising main memory design for GPGPUs. In this work, we explore the shared last-level cache management for GPGPUs with consideration of the underlying hybrid main memory. In order to improve the overall memory subsystem performance, we exploit the characteristics of both the asymmetric read/write latency of the hybrid main memory architecture, as well as the memory coalescing feature of GPGPU. In particular, to reduce the average cost of L2 cache misses, we prioritize cache blocks from DRAM or NVM based on observation that operations to NVM part of main memory have large impact on the system performance. Furthermore, the cache management scheme also integrates the GPU memory coalescing and cache bypassing techniques to improve the overall cache hit ratio. Experimental results show that in the context of a hybrid main memory system, our proposed L2 cache management policy improves performance against the traditional LRU policy and a state-ofthe-art GPU cache strategy EABP [20] by up to 27.76% and 14%, respectively.

引用

页码：25 / 30

页数：6

共 50 条

[41] NOVELLA: Nonvolatile Last-Level Cache Bypass for Optimizing Off-Chip Memory Energy
Bagchi, Aritra
Rishabh, Ohm
Panda, Preeti Ranjan
IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2024, 43 (11) : 3913 - 3924
[42] Access Pattern Characterization of Last-level Cache for Effective Replacement
Anik, Shafayat Mowla
Lee, Byeong Kil
2023 INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND COMPUTATIONAL INTELLIGENCE, CSCI 2023, 2023, : 1113 - 1116
[43] Discrete Cache Insertion Policies for Shared Last Level Cache Management on Large Multicores
Sridharan, Aswinkumar
Seznec, Andre
2016 IEEE 30TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM (IPDPS 2016), 2016, : 822 - 831
[44] Last-Level Cache Side-Channel Attacks are Practical
Liu, Fangfei
Yarom, Yuval
Ge, Qian
Heiser, Gernot
Lee, Ruby B.
2015 IEEE SYMPOSIUM ON SECURITY AND PRIVACY SP 2015, 2015, : 605 - 622
[45] SRAM- and STT-RAM-based hybrid, shared last-level cache for on-chip CPU-GPU heterogeneous architectures
Gao, Lan
Wang, Rui
Xu, Yunlong
Yang, Hailong
Luan, Zhongzhi
Qian, Depei
Zhang, Han
Cai, Jihong
JOURNAL OF SUPERCOMPUTING, 2018, 74 (07): : 3388 - 3414
[46] Buffer Filter: A Last-level Cache Management Policy for CPU-GPGPU Heterogeneous System
Li, Songyuan
Meng, Jinglei
Yu, Licheng
Ma, Jianliang
Chen, Tianzhou
Wu, Minghui
2015 IEEE 17TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS, 2015 IEEE 7TH INTERNATIONAL SYMPOSIUM ON CYBERSPACE SAFETY AND SECURITY, AND 2015 IEEE 12TH INTERNATIONAL CONFERENCE ON EMBEDDED SOFTWARE AND SYSTEMS (ICESS), 2015, : 266 - 271
[47] Design of an area and energy-efficient last-level cache memory using STT-MRAM
Saha, Rajesh
Pundir, Yogendra Pratap
Pal, Pankaj Kumar
JOURNAL OF MAGNETISM AND MAGNETIC MATERIALS, 2021, 529
[48] Last-Level Cache Insertion and Promotion Policy in the Presence of Aggressive Prefetching
Jimenez, Daniel A. A.
Teran, Elvira
Gratz, Paul V. V.
IEEE COMPUTER ARCHITECTURE LETTERS, 2023, 22 (01) : 17 - 20
[49] LA-LLC: Inter-Core Locality-Aware Last-Level Cache to Exploit Many-to-Many Traffic in GPGPUs
Zhao, Xia
Liu, Yuxi
Adileh, Almutaz
Eeckhout, Lieven
IEEE COMPUTER ARCHITECTURE LETTERS, 2017, 16 (01) : 42 - 45
[50] Taming Process Variations in CNFET for Efficient Last-Level Cache Design
Xu, Dawen
Feng, Zhuangyu
Liu, Cheng
Li, Li
Wang, Ying
Li, Huawei
Li, Xiaowei
IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, 2022, 30 (04) : 418 - 431

← 1 2 3 4 5 →