Adaptive Cache Management for Energy-efficient GPU Computing

被引:92
|
作者
Chen, Xuhao [1 ,2 ,3 ]
Chang, Li-Wen [3 ]
Rodrigues, Christopher I. [3 ]
Lv, Jie [3 ]
Wang, Zhiying [1 ,2 ]
Hwu, Wen-Mei [3 ]
机构
[1] Natl Univ Def Technol, State Key Lab High Performance Comp, Changsha, Hunan, Peoples R China
[2] Natl Univ Def Technol, Sch Comp, Changsha, Hunan, Peoples R China
[3] Univ Illinois, Dept Elect & Comp Engn, Urbana, IL USA
关键词
GPGPU; cache management; bypass; warp throttling; REPLACEMENT;
D O I
10.1109/MICRO.2014.11
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
With the SIMT execution model, GPUs can hide memory latency through massive multithreading for many applications that have regular memory access patterns. To support applications with irregular memory access patterns, cache hierarchies have been introduced to GPU architectures to capture temporal and spatial locality and mitigate the effect of irregular accesses. However, GPU caches exhibit poor efficiency due to the mismatch of the throughput-oriented execution model and its cache hierarchy design, which limits system performance and energy-efficiency. The massive amount of memory requests generated by GPUs cause cache contention and resource congestion. Existing CPU cache management policies that are designed for multicore systems, can be suboptimal when directly applied to GPU caches. We propose a specialized cache management policy for GPGPUs. The cache hierarchy is protected from contention by the bypass policy based on reuse distance. Contention and resource congestion are detected at runtime. To avoid over-saturating on-chip resources, the bypass policy is coordinated with warp throttling to dynamically control the active number of warps. We also propose a simple predictor to dynamically estimate the optimal number of active warps that can take full advantage of the cache space and on-chip resources. Experimental results show that cache efficiency is significantly improved and on-chip resources are better utilized for cache-sensitive benchmarks. This results in a harmonic mean IPC improvement of 74% and 17% (maximum 661% and 44% IPC improvement), compared to the baseline GPU architecture and optimal static warp throttling, respectively.
引用
收藏
页码:343 / 355
页数:13
相关论文
共 50 条
  • [21] Energy-Efficient Cloud Computing
    Berl, Andreas
    Gelenbe, Erol
    Di Girolamo, Marco
    Giuliani, Giovanni
    De Meer, Hermann
    Dang, Minh Quan
    Pentikousis, Kostas
    COMPUTER JOURNAL, 2010, 53 (07): : 1045 - 1051
  • [22] Energy-efficient quantum computing
    Ikonen, Joni
    Salmilehto, Juha
    Mottonen, Mikko
    NPJ QUANTUM INFORMATION, 2017, 3
  • [23] Toward Energy-Efficient Computing
    Brown, David J.
    Reams, Charles
    COMMUNICATIONS OF THE ACM, 2010, 53 (03) : 50 - 58
  • [24] Energy-efficient quantum computing
    Joni Ikonen
    Juha Salmilehto
    Mikko Möttönen
    npj Quantum Information, 3
  • [25] Energy-Efficient Cache Partitioning For Future CMPs
    Sundararajan, Karthik T.
    Jones, Timothy M.
    Topham, Nigel P.
    PROCEEDINGS OF THE 21ST INTERNATIONAL CONFERENCE ON PARALLEL ARCHITECTURES AND COMPILATION TECHNIQUES (PACT'12), 2012, : 465 - 466
  • [26] The Tag Filter Cache: An Energy-Efficient Approach
    Valls, Joan J.
    Sahuquillo, Julio
    Ros, Alberto
    Gomez, Maria E.
    23RD EUROMICRO INTERNATIONAL CONFERENCE ON PARALLEL, DISTRIBUTED, AND NETWORK-BASED PROCESSING (PDP 2015), 2015, : 182 - 189
  • [27] An Energy-efficient Buffer Cache Replacement Algorithm
    Yue, Jianhui
    Zhu, Yifeng
    Cai, Zhao
    2008 IEEE INTERNATIONAL SYMPOSIUM ON MODELING, ANALYSIS & SIMULATION OF COMPUTER AND TELECOMMUNICATION SYSTEMS (MASCOTS), 2008, : 355 - 356
  • [28] Energy-efficient cache architecture for multimedia applications
    Yang, CL
    Lee, CH
    Tseng, HW
    2005 Emerging Information Technology Conference (EITC), 2005, : 165 - 166
  • [29] TCache: An Energy-Efficient DRAM Cache Design
    He, Jiacong
    Callenes-Sloan, Joseph
    2017 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2017,
  • [30] An energy-efficient replacement policy for data cache
    Musalappa, S
    Sundaram, S
    Chu, Y
    PROCEEDINGS OF THE IEEE SOUTHEASTCON 2004: EXCELLENCE IN ENGINEERING, SCIENCE, AND TECHNOLOGY, 2005, : 599 - 602