MEMORY ACCESS COALESCING - A TECHNIQUE FOR ELIMINATING REDUNDANT MEMORY ACCESSES

被引:0
|
作者
DAVIDSON, JW [1 ]
JINTURKAR, S [1 ]
机构
[1] UNIV VIRGINIA,DEPT COMP SCI,CHARLOTTESVILLE,VA 22903
来源
SIGPLAN NOTICES | 1994年 / 29卷 / 06期
关键词
D O I
暂无
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
As microprocessor speeds increase, memory bandwidth is increasingly the performance bottleneck for microprocessors. This has occurred because innovation and technological improvements in processor design have outpaced advances in memory design. Most attempts at addressing this problem have involved hardware solutions. Unfortunately, these solutions do little to help the situation with respect to current microprocessors. In previous work, we developed, implemented, and evaluated an algorithm that exploited the ability of newer machines with wide-buses to load/store multiple floating-point operands in a single memory reference. This paper describes a general code improvement algorithm that transforms code to better exploit the available memory bandwidth on existing microprocessors as well as wide-bus machines. Where possible and advantageous, the algorithm coalesces narrow memory references into wide ones. An interesting characteristic of the algorithm is that some decisions about the applicability of the transformation are made at run time. This dynamic analysis significantly increases the probability of the transformation being applied. The code improvement transformation was implemented and added to the repertoire of code improvements of an existing retargetable optimizing back end. Using three current architectures as evaluation platforms, the effectiveness of the transformation was measured on a set of compute- and memory-intensive programs. Interestingly, the effectiveness of the transformation varied significantly with respect to the instruction-set architecture of the tested platform. For one of the tested architectures, improvements in execution speed ranging from 5 to 40 percent were observed. For another, the improvements in execution speed ranged from 5 to 20 percent, while for vet another, the transformation resulted in slower code for all programs.
引用
收藏
页码:186 / 195
页数:10
相关论文
共 50 条
  • [41] A Unified Approach to Eliminate Memory Accesses Early
    Islam, Mafijul Md.
    Stenstrom, Per
    [J]. PROCEEDINGS OF THE PROCEEDINGS OF THE 14TH INTERNATIONAL CONFERENCE ON COMPILERS, ARCHITECTURES AND SYNTHESIS FOR EMBEDDED SYSTEMS (CASES '11), 2011, : 55 - 64
  • [42] FPGA SAR processor with window memory accesses
    Dou, Yong
    Zhou, Jie
    Lei, Yuanwu
    Zhou, Xingming
    [J]. 2007 IEEE INTERNATIONAL CONFERENCE ON APPLICATION-SPECIFIC SYSTEMS, ARCHITECTURES, AND PROCESSORS, 2007, : 95 - 100
  • [43] Optimization of data accesses in reflective memory systems
    Ryou, Myung Seon
    Lee, Chang Won
    Park, Ki Woung
    Song, Seung-Whan
    Choi, Changho
    [J]. TENCON 2006 - 2006 IEEE REGION 10 CONFERENCE, VOLS 1-4, 2006, : 1344 - +
  • [44] Parallel memory architecture for arbitrary stride accesses
    Aho, Eero
    Vanne, Jarno
    Hamalainen, Timo D.
    [J]. PROCEEDINGS OF THE 2006 IEEE WORKSHOP ON DESIGN AND DIAGNOSTICS OF ELECTRONIC CIRCUITS AND SYSTEMS, 2006, : 65 - +
  • [45] Response time distribution of flash memory accesses
    Harrison, Peter G.
    Patel, Naresh M.
    Zertal, Soraya
    [J]. PERFORMANCE EVALUATION, 2010, 67 (04) : 248 - 259
  • [46] SPEED SYSTEM MEMORY BY INTERLEAVING DRAM ACCESSES
    MEKHIEL, N
    [J]. ELECTRONIC DESIGN, 1989, 37 (21) : 65 - &
  • [47] Memory-access-aware data structure transformations for embedded software with dynamic data accesses
    Daylight, EG
    Atienza, D
    Vandecappelle, A
    Catthoor, F
    Mendias, JA
    [J]. IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, 2004, 12 (03) : 269 - 280
  • [48] Eliminating the memory blocking effect
    Leynes, P. Andrew
    Rass, Olga
    Landau, Joshua D.
    [J]. MEMORY, 2008, 16 (08) : 852 - 872
  • [49] Scheduling memory accesses through a shared bus
    Almog, E
    Shachnai, H
    [J]. PERFORMANCE EVALUATION, 2001, 46 (2-3) : 193 - 218
  • [50] AUTOMATIC GENERATION OF MAPS OF MEMORY ACCESSES FOR ENERGY-AWARE MEMORY MANAGEMENT
    Balasa, Florin
    Luican, Ilie I.
    Zhu, Hongwei
    Nasui, Doru V.
    [J]. 2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 629 - +