Quantifying the performance and energy efficiency of advanced cache indexing for GPGPU computing

被引:7
|
作者
Kim, Kyu Yeun [1 ]
Baek, Woongki [1 ]
机构
[1] UNIST, Sch ECE, 50 UNIST Gil, Ulsan, South Korea
基金
新加坡国家研究基金会;
关键词
Advanced cache indexing; GPGPU computing; High performance; Energy efficiency;
D O I
10.1016/j.micpro.2016.01.003
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
To achieve higher performance and energy efficiency, GPGPU architectures have recently begun to employ hardware caches. Adding caches to GPGPUs, however, does not always guarantee improved performance and energy efficiency due to the thrashing in small caches shared by thousands of threads. While prior work has proposed warp-scheduling and cache-bypassing techniques to address this issue, relatively little work has been done in the context of advanced cache indexing (ACI). To bridge this gap, this work investigates the effectiveness of ACI for high-performance and energy efficient GPGPU computing. We discuss the design and implementation of static and adaptive cache indexing schemes for GPGPUs. We then quantify the effectiveness of the ACI schemes based on a cycle accurate GPGPU simulator. Our quantitative evaluation demonstrates that the ACI schemes are effective in that they provide significant performance and energy-efficiency gains over the conventional indexing scheme. Further, we investigate the performance sensitivity of ACI to key architectural parameters (e.g., indexing latency and cache associativity). Our experimental results show that the ACI schemes are promising in that they continue to provide significant performance gains even when additional indexing latency occurs due to the hardware complexity and the baseline cache is enhanced with high associativity or large capacity. (C) 2016 Elsevier B.V. All rights reserved.
引用
收藏
页码:81 / 94
页数:14
相关论文
共 50 条
  • [1] Improving the Performance and Energy Efficiency of GPGPU Computing through Integrated Adaptive Cache Management
    Kim, Kyu Yeun
    Park, Jinsu
    Baek, Woongki
    [J]. IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2019, 30 (03) : 630 - 645
  • [2] Applying Victim Cache in High Performance GPGPU Computing
    Fan, Fengfeng
    Wang, Jianfei
    Jiang, Li
    Liang, Xiaoyao
    Jing, Naifeng
    [J]. 2016 15TH INTERNATIONAL SYMPOSIUM ON PARALLEL AND DISTRIBUTED COMPUTING (ISPDC), 2016, : 24 - 29
  • [3] Incorporating selective victim cache into GPGPU for high-performance computing
    Wang, Jianfei
    Fan, Fengfeng
    Jiang, Li
    Liang, Xiaoyao
    Jing, Naifeng
    [J]. CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2017, 29 (24):
  • [4] IACM: Integrated Adaptive Cache Management for High-Performance and Energy-Efficient GPGPU Computing
    Kim, Kyu Yeun
    Park, Jinsu
    Baek, Woongki
    [J]. PROCEEDINGS OF THE 34TH IEEE INTERNATIONAL CONFERENCE ON COMPUTER DESIGN (ICCD), 2016, : 380 - 383
  • [5] Optimizing GPGPU Kernel Summation for Performance and Energy Efficiency
    Wang, Jiajun
    Khawaja, Ahmed
    Biros, George
    Gerstlauer, Andreas
    John, Lizy K.
    [J]. PROCEEDINGS OF 45TH INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING WORKSHOPS (ICPPW 2016), 2016, : 123 - 132
  • [6] Quantifying the energy efficiency challenges of achieving exascale computing
    Mair, Jason
    Huang, Zhiyi
    Eyers, David
    Chen, Yawen
    [J]. 2015 15TH IEEE/ACM INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND GRID COMPUTING, 2015, : 943 - 950
  • [7] Performance and Energy Efficiency in Distributed Computing
    Luntovskyy, Andriy
    [J]. HARD AND SOFT COMPUTING FOR ARTIFICIAL INTELLIGENCE, MULTIMEDIA AND SECURITY, 2017, 534 : 281 - 292
  • [8] A Cache-Aware Performance Prediction Framework for GPGPU Computations
    Poeppl, Alexander
    Herz, Alexander
    [J]. EURO-PAR 2015: PARALLEL PROCESSING WORKSHOPS, 2015, 9523 : 749 - 760
  • [9] Measuring the Energy Efficiency of Transactional Loads on GPGPU
    von Kistowski, Joakim
    Pais, Johann
    Wahl, Tobias
    Lange, Klaus-Dieter
    Block, Hansfried
    Beckett, John
    Kounev, Samuel
    [J]. PROCEEDINGS OF THE 2019 ACM/SPEC INTERNATIONAL CONFERENCE ON PERFORMANCE ENGINEERING (ICPE '19), 2019, : 219 - 230
  • [10] Efficient Management of Cache Accesses to Boost GPGPU Memory Subsystem Performance
    Candel, Francisco
    Valero, Alejandro
    Petit, Salvador
    Sahuquillo, Julio
    [J]. IEEE TRANSACTIONS ON COMPUTERS, 2019, 68 (10) : 1442 - 1454