Restructuring computations for temporal data cache locality

被引:5
|
作者
Pingali, VK [1 ]
McKee, SA
Hsieh, WC
Carter, JB
机构
[1] Univ So Calif, Inst Informat Sci, Marina Del Rey, CA 90292 USA
[2] Cornell Univ, Ithaca, NY 14853 USA
[3] Univ Utah, Sch Comp, Salt Lake City, UT 84112 USA
基金
美国国家科学基金会;
关键词
memory performance; data structures; optimization;
D O I
10.1023/A:1024556711058
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Data access costs contribute significantly to the execution time of applications with complex data structures. A the latency of memory accesses becomes high relative to processor cycle times, application performance is increasingly limited by memory performance. In some situations it is useful to trade increased computation costs for reduced memory costs. The contributions of this paper are three-fold: we provide a detailed analysis of the memory performance of seven memory-intensive benchmarks; we describe Computation Regrouping, a source-level approach to improving the performance of memory-bound applications by increasing temporal locality to eliminate cache and TLB misses; and, we demonstrate significant performance improvement by applying Computation Regrouping to our suite of seven benchmarks. Using Computation Regrouping, we observe a geometric mean speedup of 1.90, with individual speedups ranging from 1.26 to 3.03. Most of this improvement comes from eliminating memory tall time.
引用
收藏
页码:305 / 338
页数:34
相关论文
共 50 条
  • [1] Restructuring Computations for Temporal Data Cache Locality
    Venkata K. Pingali
    Sally A. McKee
    Wilson C. Hsieh
    John B. Carter
    [J]. International Journal of Parallel Programming, 2003, 31 : 305 - 338
  • [2] In-Cache MapReduce: Leverage Tiling to Boost Temporal Locality-Sensitive MapReduce Computations
    Magro, Daniel
    Paulino, Herve
    [J]. 2016 IEEE INTERNATIONAL CONFERENCE ON CLUSTER COMPUTING (CLUSTER), 2016, : 374 - 383
  • [3] Restructuring I/O-intensive computations for locality
    Kandemir, M
    Choudhary, A
    Ramanujam, J
    [J]. HIGH-PERFORMANCE COMPUTING AND NETWORKING, PROCEEDINGS, 1999, 1593 : 1097 - 1106
  • [4] Data Locality Exploitation in Cache Compression
    Zeng, Qi
    Jha, Rakesh
    Chen, Shigang
    Peir, Jih-Kwon
    [J]. 2018 IEEE 24TH INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED SYSTEMS (ICPADS 2018), 2018, : 347 - 354
  • [5] Cache resident data locality analysis
    Samdani, QG
    Thornton, MA
    [J]. 8TH INTERNATIONAL SYMPOSIUM ON MODELING, ANALYSIS AND SIMULATION OF COMPUTER AND TELECOMMUNICATION SYSTEMS, PROCEEDINGS, 2000, : 539 - 546
  • [6] New tiling techniques to improve cache temporal locality
    Song, YH
    Li, ZY
    [J]. ACM SIGPLAN NOTICES, 1999, 34 (05) : 215 - 228
  • [7] Scalar metric for temporal locality and estimation of cache performance
    Alakarhu, J
    Niittylahti, J
    [J]. DESIGN, AUTOMATION AND TEST IN EUROPE CONFERENCE AND EXHIBITION, VOLS 1 AND 2, PROCEEDINGS, 2004, : 730 - 731
  • [8] A new cache architecture based on temporal and spatial locality
    Lee, JH
    Lee, JS
    Kim, SD
    [J]. JOURNAL OF SYSTEMS ARCHITECTURE, 2000, 46 (15) : 1451 - 1467
  • [9] A schedule cache for data parallel unstructured computations
    Brandes, T
    Germain-Renaud, C
    [J]. PARALLEL COMPUTING, 2000, 26 (13-14) : 1807 - 1823
  • [10] Temporal locality and its impact on Web proxy cache performance
    Mahanti, A
    Eager, D
    Williamson, C
    [J]. PERFORMANCE EVALUATION, 2000, 42 (2-3) : 187 - 203