Shared Last-level Cache Management for GPGPUs with Hybrid Main Memory

被引:0
|
作者
Wang, Guan [1 ]
Cai, Xiaojun [1 ]
Ju, Lei [1 ]
Zang, Chuanqi [1 ]
Zhao, Mengying [1 ]
Jia, Zhiping [1 ]
机构
[1] Shandong Univ, Sch Comp Sci & Technol, Jinan, Shandong, Peoples R China
基金
中国国家自然科学基金;
关键词
HIGH-PERFORMANCE;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Memory intensive workloads become increasingly popular on general purpose graphics processing units (GPGPUs), and impose great challenges on the GPGPU memory subsystem design. On the other hand, with the recent development of nonvolatile memory (NVM) technologies, hybrid memory combining both DRAM and NVM achieves high performance, low power and high density simultaneously, which provides a promising main memory design for GPGPUs. In this work, we explore the shared last-level cache management for GPGPUs with consideration of the underlying hybrid main memory. In order to improve the overall memory subsystem performance, we exploit the characteristics of both the asymmetric read/write latency of the hybrid main memory architecture, as well as the memory coalescing feature of GPGPU. In particular, to reduce the average cost of L2 cache misses, we prioritize cache blocks from DRAM or NVM based on observation that operations to NVM part of main memory have large impact on the system performance. Furthermore, the cache management scheme also integrates the GPU memory coalescing and cache bypassing techniques to improve the overall cache hit ratio. Experimental results show that in the context of a hybrid main memory system, our proposed L2 cache management policy improves performance against the traditional LRU policy and a state-ofthe-art GPU cache strategy EABP [20] by up to 27.76% and 14%, respectively.
引用
收藏
页码:25 / 30
页数:6
相关论文
共 50 条
  • [41] NOVELLA: Nonvolatile Last-Level Cache Bypass for Optimizing Off-Chip Memory Energy
    Bagchi, Aritra
    Rishabh, Ohm
    Panda, Preeti Ranjan
    IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2024, 43 (11) : 3913 - 3924
  • [42] Access Pattern Characterization of Last-level Cache for Effective Replacement
    Anik, Shafayat Mowla
    Lee, Byeong Kil
    2023 INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND COMPUTATIONAL INTELLIGENCE, CSCI 2023, 2023, : 1113 - 1116
  • [43] Discrete Cache Insertion Policies for Shared Last Level Cache Management on Large Multicores
    Sridharan, Aswinkumar
    Seznec, Andre
    2016 IEEE 30TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM (IPDPS 2016), 2016, : 822 - 831
  • [44] Last-Level Cache Side-Channel Attacks are Practical
    Liu, Fangfei
    Yarom, Yuval
    Ge, Qian
    Heiser, Gernot
    Lee, Ruby B.
    2015 IEEE SYMPOSIUM ON SECURITY AND PRIVACY SP 2015, 2015, : 605 - 622
  • [45] SRAM- and STT-RAM-based hybrid, shared last-level cache for on-chip CPU-GPU heterogeneous architectures
    Gao, Lan
    Wang, Rui
    Xu, Yunlong
    Yang, Hailong
    Luan, Zhongzhi
    Qian, Depei
    Zhang, Han
    Cai, Jihong
    JOURNAL OF SUPERCOMPUTING, 2018, 74 (07): : 3388 - 3414
  • [46] Buffer Filter: A Last-level Cache Management Policy for CPU-GPGPU Heterogeneous System
    Li, Songyuan
    Meng, Jinglei
    Yu, Licheng
    Ma, Jianliang
    Chen, Tianzhou
    Wu, Minghui
    2015 IEEE 17TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS, 2015 IEEE 7TH INTERNATIONAL SYMPOSIUM ON CYBERSPACE SAFETY AND SECURITY, AND 2015 IEEE 12TH INTERNATIONAL CONFERENCE ON EMBEDDED SOFTWARE AND SYSTEMS (ICESS), 2015, : 266 - 271
  • [47] Design of an area and energy-efficient last-level cache memory using STT-MRAM
    Saha, Rajesh
    Pundir, Yogendra Pratap
    Pal, Pankaj Kumar
    JOURNAL OF MAGNETISM AND MAGNETIC MATERIALS, 2021, 529
  • [48] Last-Level Cache Insertion and Promotion Policy in the Presence of Aggressive Prefetching
    Jimenez, Daniel A. A.
    Teran, Elvira
    Gratz, Paul V. V.
    IEEE COMPUTER ARCHITECTURE LETTERS, 2023, 22 (01) : 17 - 20
  • [49] LA-LLC: Inter-Core Locality-Aware Last-Level Cache to Exploit Many-to-Many Traffic in GPGPUs
    Zhao, Xia
    Liu, Yuxi
    Adileh, Almutaz
    Eeckhout, Lieven
    IEEE COMPUTER ARCHITECTURE LETTERS, 2017, 16 (01) : 42 - 45
  • [50] Taming Process Variations in CNFET for Efficient Last-Level Cache Design
    Xu, Dawen
    Feng, Zhuangyu
    Liu, Cheng
    Li, Li
    Wang, Ying
    Li, Huawei
    Li, Xiaowei
    IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, 2022, 30 (04) : 418 - 431