Memory renaming: Fast, early and accurate processing of memory communication

被引:7
|
作者
Tyson, GS [1 ]
Austin, TM [1 ]
机构
[1] Univ Michigan, Ann Arbor, MI 48109 USA
关键词
memory; pipeline; prediction; renaming; speculation;
D O I
10.1023/A:1018734923512
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
As processors continue to exploit more instruction level parallelism, greater demands are placed on the performance of the memory system. In this paper, we introduce a novel modification of the processor pipeline called memory renaming. Memory renaming applies register access techniques to load and store instructions to speed the processing of memory traffic. The approach works by accurately predicting memory communication early in the pipeline and then re-mapping the communication to fast physical registers. This work extends previous studies of data value and dependence speculation. When memory renaming is added to the processor pipeline, renaming can be applied to 30-50% of all memory references, translating to an overall improvement in execution time of up to 14% for current pipeline configurations. As store forward delay times grow larger, renaming support can lead to performance improvements of as much as 42%. Furthermore, this improvement is seen across all memory segments-including the heap segment which has often been difficult to manage efficiently.
引用
收藏
页码:357 / 380
页数:24
相关论文
共 50 条
  • [1] Memory Renaming: Fast, Early and Accurate Processing of Memory Communication
    Gary S. Tyson
    Todd M. Austin
    [J]. International Journal of Parallel Programming, 1999, 27 : 357 - 380
  • [2] Improved the accuracy and performance of memory communication through renaming
    Tyson, GS
    Austin, TM
    [J]. THIRTIETH ANNUAL IEEE/ACM INTERNATIONAL SYMPOSIUM ON MICROARCHITECTURE, PROCEEDINGS, 1997, : 218 - 227
  • [3] Randomized Renaming in Shared Memory Systems
    Berenbrink, Petra
    Brinkmann, Andre
    Elsaesser, Robert
    Friedetzky, Tom
    Nagel, Lars
    [J]. 2015 IEEE 29TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM (IPDPS), 2015, : 542 - 549
  • [4] FAME: A Fast and Accurate Memory Emulator for New Memory System Architecture Exploration
    Malladi, Krishna T.
    Chang, Mu-Tien
    Ping, John
    Zheng, Hongzhong
    [J]. 2015 IEEE 23rd International Symposium on Modeling, Analysis, and Simulation of Computer and Telecommunication Systems (MASCOTS 2015), 2015, : 43 - 46
  • [5] Two contributions of distinctive processing to accurate memory
    Hunt, RR
    [J]. JOURNAL OF MEMORY AND LANGUAGE, 2003, 48 (04) : 811 - 825
  • [6] Randomized renaming in shared memory systems
    Berenbrink, Petra
    Brinkmann, Andre
    Elsaesser, Robert
    Friedetzky, Tom
    Nagel, Lars
    [J]. JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2021, 150 : 112 - 120
  • [7] DHS: Adaptive Memory Layout Organization of Sketch Slots for Fast and Accurate Data Stream Processing
    Zhao, Bohan
    Li, Xiang
    Tian, Boyu
    Mei, Zhiyu
    Wu, Wenfei
    [J]. KDD '21: PROCEEDINGS OF THE 27TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2021, : 2285 - 2293
  • [8] Fast and Accurate Neural Machine Translation with Translation Memory
    He, Qiuxiang
    Huang, Guoping
    Cui, Qu
    Li, Li
    Liu, Lemao
    [J]. 59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (ACL-IJCNLP 2021), VOL 1, 2021, : 3170 - 3180
  • [9] A fast and accurate approach to analyze cache memory Behavior
    Vera, X
    Llosa, J
    González, A
    Bermudo, N
    [J]. EURO-PAR 2000 PARALLEL PROCESSING, PROCEEDINGS, 2000, 1900 : 194 - 198
  • [10] BANDWIDTH OF FAST MEMORY IN MULTI-PROCESSING
    LI, HF
    [J]. PROCEEDINGS OF THE IEEE, 1980, 68 (05) : 630 - 632