Adaptive Cache Management for Energy-efficient GPU Computing

被引:92
|
作者
Chen, Xuhao [1 ,2 ,3 ]
Chang, Li-Wen [3 ]
Rodrigues, Christopher I. [3 ]
Lv, Jie [3 ]
Wang, Zhiying [1 ,2 ]
Hwu, Wen-Mei [3 ]
机构
[1] Natl Univ Def Technol, State Key Lab High Performance Comp, Changsha, Hunan, Peoples R China
[2] Natl Univ Def Technol, Sch Comp, Changsha, Hunan, Peoples R China
[3] Univ Illinois, Dept Elect & Comp Engn, Urbana, IL USA
关键词
GPGPU; cache management; bypass; warp throttling; REPLACEMENT;
D O I
10.1109/MICRO.2014.11
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
With the SIMT execution model, GPUs can hide memory latency through massive multithreading for many applications that have regular memory access patterns. To support applications with irregular memory access patterns, cache hierarchies have been introduced to GPU architectures to capture temporal and spatial locality and mitigate the effect of irregular accesses. However, GPU caches exhibit poor efficiency due to the mismatch of the throughput-oriented execution model and its cache hierarchy design, which limits system performance and energy-efficiency. The massive amount of memory requests generated by GPUs cause cache contention and resource congestion. Existing CPU cache management policies that are designed for multicore systems, can be suboptimal when directly applied to GPU caches. We propose a specialized cache management policy for GPGPUs. The cache hierarchy is protected from contention by the bypass policy based on reuse distance. Contention and resource congestion are detected at runtime. To avoid over-saturating on-chip resources, the bypass policy is coordinated with warp throttling to dynamically control the active number of warps. We also propose a simple predictor to dynamically estimate the optimal number of active warps that can take full advantage of the cache space and on-chip resources. Experimental results show that cache efficiency is significantly improved and on-chip resources are better utilized for cache-sensitive benchmarks. This results in a harmonic mean IPC improvement of 74% and 17% (maximum 661% and 44% IPC improvement), compared to the baseline GPU architecture and optimal static warp throttling, respectively.
引用
收藏
页码:343 / 355
页数:13
相关论文
共 50 条
  • [1] IACM: Integrated Adaptive Cache Management for High-Performance and Energy-Efficient GPGPU Computing
    Kim, Kyu Yeun
    Park, Jinsu
    Baek, Woongki
    PROCEEDINGS OF THE 34TH IEEE INTERNATIONAL CONFERENCE ON COMPUTER DESIGN (ICCD), 2016, : 380 - 383
  • [2] CNT-Cache: an Energy-Efficient Carbon Nanotube Cache with Adaptive Encoding
    Xu, Dawen
    Chu, Kexin
    Liu, Cheng
    Wang, Ying
    Zhang, Lei
    Li, Huawei
    PROCEEDINGS OF THE 2020 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION (DATE 2020), 2020, : 963 - 966
  • [3] Energy-Efficient Adaptive Computing With Multifunctional Memory
    Qian, Wenchao
    Chen, Pai-Yu
    Karam, Robert
    Gao, Ligang
    Bhunia, Swarup
    Yu, Shimeng
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, 2017, 64 (02) : 191 - 195
  • [4] Energy-Efficient Resource Management for Federated Edge Learning With CPU-GPU Heterogeneous Computing
    Zeng, Qunsong
    Du, Yuqing
    Huang, Kaibin
    Leung, Kin K.
    IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2021, 20 (12) : 7947 - 7962
  • [5] Supporting Energy-Efficient Computing on Heterogeneous CPU-GPU Architectures
    Siehl, Kyle
    Zhao, Xinghui
    2017 IEEE 5TH INTERNATIONAL CONFERENCE ON FUTURE INTERNET OF THINGS AND CLOUD (FICLOUD 2017), 2017, : 134 - 141
  • [6] HAECubie: A Highly Adaptive and Energy-Efficient Computing Demonstrator
    Eichhorn, Franz
    Dargie, Waltenegus
    Moebius, Christoph
    Rybina, Kateryna
    24TH INTERNATIONAL CONFERENCE ON COMPUTER COMMUNICATIONS AND NETWORKS ICCCN 2015, 2015,
  • [7] Multicopy Cache: A Highly Energy-Efficient Cache Architecture
    Chakraborty, Arup
    Homayoun, Houman
    Khajeh, Amin
    Dutt, Nikil
    Eltawil, Ahmed
    Kurdahi, Fadi
    ACM TRANSACTIONS ON EMBEDDED COMPUTING SYSTEMS, 2014, 13
  • [8] Energy-Efficient Resource Allocation for Cache-Assisted Mobile Edge Computing
    Cui, Ying
    He, Wen
    Ni, Chun
    Guo, Chengjun
    Liu, Zhi
    2017 IEEE 42ND CONFERENCE ON LOCAL COMPUTER NETWORKS (LCN), 2017, : 640 - 648
  • [9] Holistic Management for a more Energy-Efficient Cloud Computing
    Ayguade, Eduard
    Torres, Jordi
    ERCIM NEWS, 2010, (83): : 29 - 30
  • [10] Energy-Efficient Resource Management in Mobile Cloud Computing
    Jin, Xiaomin
    Liu, Yuanan
    Fan, Wenhao
    Wu, Fan
    Tang, Bihua
    IEICE TRANSACTIONS ON COMMUNICATIONS, 2018, E101B (04) : 1010 - 1020