Adaptive Cache Management for Energy-efficient GPU Computing

被引:92
|
作者
Chen, Xuhao [1 ,2 ,3 ]
Chang, Li-Wen [3 ]
Rodrigues, Christopher I. [3 ]
Lv, Jie [3 ]
Wang, Zhiying [1 ,2 ]
Hwu, Wen-Mei [3 ]
机构
[1] Natl Univ Def Technol, State Key Lab High Performance Comp, Changsha, Hunan, Peoples R China
[2] Natl Univ Def Technol, Sch Comp, Changsha, Hunan, Peoples R China
[3] Univ Illinois, Dept Elect & Comp Engn, Urbana, IL USA
关键词
GPGPU; cache management; bypass; warp throttling; REPLACEMENT;
D O I
10.1109/MICRO.2014.11
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
With the SIMT execution model, GPUs can hide memory latency through massive multithreading for many applications that have regular memory access patterns. To support applications with irregular memory access patterns, cache hierarchies have been introduced to GPU architectures to capture temporal and spatial locality and mitigate the effect of irregular accesses. However, GPU caches exhibit poor efficiency due to the mismatch of the throughput-oriented execution model and its cache hierarchy design, which limits system performance and energy-efficiency. The massive amount of memory requests generated by GPUs cause cache contention and resource congestion. Existing CPU cache management policies that are designed for multicore systems, can be suboptimal when directly applied to GPU caches. We propose a specialized cache management policy for GPGPUs. The cache hierarchy is protected from contention by the bypass policy based on reuse distance. Contention and resource congestion are detected at runtime. To avoid over-saturating on-chip resources, the bypass policy is coordinated with warp throttling to dynamically control the active number of warps. We also propose a simple predictor to dynamically estimate the optimal number of active warps that can take full advantage of the cache space and on-chip resources. Experimental results show that cache efficiency is significantly improved and on-chip resources are better utilized for cache-sensitive benchmarks. This results in a harmonic mean IPC improvement of 74% and 17% (maximum 661% and 44% IPC improvement), compared to the baseline GPU architecture and optimal static warp throttling, respectively.
引用
收藏
页码:343 / 355
页数:13
相关论文
共 50 条
  • [41] Real-Time GPU Computing: Cache or No Cache?
    Huangfu, Yijie
    Zhang, Wei
    2015 IEEE 18th International Symposium on Real-Time Distributed Computing (ISORC), 2015, : 182 - 189
  • [42] CPU-GPU Utilization Aware Energy-Efficient Scheduling Algorithm on Heterogeneous Computing Systems
    Tang, Xiaoyong
    Fu, Zhuojun
    IEEE ACCESS, 2020, 8 (08): : 58948 - 58958
  • [43] Voltage-Driven Adaptive Spintronic Neuron for Energy-Efficient Neuromorphic Computing
    陈亚博
    杨晓阔
    闫涛
    危波
    崔焕卿
    李成
    刘嘉豪
    宋明旭
    蔡理
    Chinese Physics Letters, 2020, (07) : 174 - 178
  • [44] An adaptive, lightweight and energy-efficient context discovery protocol for ubiquitous computing environments
    Yau, SS
    Chandrasekar, D
    Huang, DZ
    10TH IEEE INTERNATIONAL WORKSHOP ON FUTURE TRENDS OF DISTRIBUTED COMPUTING SYSTEMS, PROCEEDINGS, 2004, : 261 - 267
  • [45] Energy-Efficient DNN Computing on GPUs Through Register File Management
    Wang, Xin
    Zhang, Wei
    2018 IEEE HIGH PERFORMANCE EXTREME COMPUTING CONFERENCE (HPEC), 2018,
  • [46] Adaptive DRL-Based Task Scheduling for Energy-Efficient Cloud Computing
    Kang, Kaixuan
    Ding, Ding
    Xie, Huamao
    Yin, Qian
    Zeng, Jing
    IEEE TRANSACTIONS ON NETWORK AND SERVICE MANAGEMENT, 2022, 19 (04): : 4948 - 4961
  • [47] Adaptive Scheduling of Stochastic Task Sequence for Energy-Efficient Mobile Cloud Computing
    Jiang, Qi
    Leung, Victor C. M.
    Tang, Hao
    Xi, Hong-Sheng
    IEEE SYSTEMS JOURNAL, 2019, 13 (03): : 3022 - 3025
  • [48] Backpropagation for Energy-Efficient Neuromorphic Computing
    Esser, Steve K.
    Appuswamy, Rathinakumar
    Merolla, Paul A.
    Arthur, John V.
    Modha, Dharmendra S.
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 28 (NIPS 2015), 2015, 28
  • [49] Voltage-Driven Adaptive Spintronic Neuron for Energy-Efficient Neuromorphic Computing
    陈亚博
    杨晓阔
    闫涛
    危波
    崔焕卿
    李成
    刘嘉豪
    宋明旭
    蔡理
    Chinese Physics Letters, 2020, 37 (07) : 174 - 178
  • [50] Energy-Efficient Computing in Nanoscale CMOS
    De, Vivek
    IEEE DESIGN & TEST, 2016, 33 (02) : 68 - 75