Reducing data TLB power via compiler-directed address generation

被引:9
|
作者
Kadayif, Ismail
Nath, Partho
Kandemir, Mahmut
Sivasubramaniam, Anand
机构
[1] Canakkale Onsekiz Mart Univ, Dept Comp Engn, TR-17100 Canakkale, Turkey
[2] Penn State Univ, Dept Comp Sci & Engn, University Pk, PA 16802 USA
关键词
address translation; compiler optimizations; embedded systems design; low power; translation lookaside buffers (TLBs);
D O I
10.1109/TCAD.2006.882599
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Address translation using the translation lookaside buffer (TLB) consumes as much as 16 % of the chip power on some processors because of its high associativity and access frequency. While prior work has looked into optimizing this structure at the circuit and architectural levels, this paper takes a different approach to optimizing its power by reducing the number of data TLB (dTLB) lookups for data references. The main idea is to keep translations in a set of translation registers (TRs) and intelligently use them in software to directly generate the physical addresses without going through the dTLB. The software has to work within the confines of the TRs provided by the hardware and has to maximize the reuse of such translations to be effective. The authors propose strategies and code transformations for achieving this in array-based and pointer-based codes, looking to optimize data accesses. Results with a suite of Spec95 array-based and pointer-based codes show dTLB energy savings of up to 73% and 88%, respectively, compared to directly using the dTLB for all references. Despite the small increase in instructions executed with the mechanisms, the approach can, in fact, provide performance benefits in certain cache-addressing strategies.
引用
收藏
页码:312 / 324
页数:13
相关论文
共 39 条
  • [1] Compiler-directed physical address generation for reducing dTLB power
    Kadayif, I
    Nath, P
    Kandemir, M
    Sivasubramaniam, A
    ISPASS: 2004 IEEE INTERNATIONAL SYMPOSIUM ON PERFORMANCE ANALYSIS OF SYSTEMS AND SOFTWARE, 2004, : 161 - 168
  • [2] Compiler-directed code restructuring for reducing data TLB energy
    Kandemir, M
    Kadayif, I
    Chen, G
    INTERNATIONAL CONFERENCE ON HARDWARE/SOFTWARE CODESIGN AND SYSTEM SYNTHESIS, 2004, : 98 - 103
  • [3] Compiler-directed early load-address generation
    Cheng, BC
    Connors, DA
    Hwu, WMW
    31ST ANNUAL ACM/IEEE INTERNATIONAL SYMPOSIUM ON MICROARCHITECTURE, PROCEEDINGS, 1998, : 138 - 147
  • [4] Reducing power through compiler-directed barrier synchronization elimination
    Kandemir, Mahmut
    Son, Seung Woo
    ISLPED '06: PROCEEDINGS OF THE 2006 INTERNATIONAL SYMPOSIUM ON LOW POWER ELECTRONICS AND DESIGN, 2006, : 354 - 357
  • [5] Compiler-directed voltage scaling on communication links for reducing power consumption
    Li, F
    Chen, G
    Kandemir, M
    ICCAD-2005: INTERNATIONAL CONFERENCE ON COMPUTER AIDED DESIGN, DIGEST OF TECHNICAL PAPERS, 2005, : 456 - 460
  • [6] Compiler-Directed Power Management for Superscalars
    Haj-Yihia, Jawad
    Ben Asher, Yosi
    Rotem, Efraim
    Yasin, Ahmad
    Ginosar, Ran
    ACM TRANSACTIONS ON ARCHITECTURE AND CODE OPTIMIZATION, 2014, 11 (04)
  • [7] Compiler-directed data cache leakage reduction
    Zhang, W
    VLSI 2004: IEEE COMPUTER SOCIETY ANNUAL SYMPOSIUM ON VLSI, PROCEEDINGS, 2004, : 305 - 306
  • [8] Compiler-directed data partitioning for multicluster processors
    Chu, Michael L.
    Mahlke, Scott A.
    CGO 2006: 4TH INTERNATIONAL SYMPOSIUM ON CODE GENERATION AND OPTIMIZATION, 2006, : 208 - +
  • [9] Performance potentials of compiler-directed data speculation
    Wu, YF
    Chen, LL
    Ju, R
    Fang, J
    ISPASS: 2003 IEEE INTERNATIONAL SYMPOSIUM ON PERFORMANCE ANALYSIS OF SYSTEMS AND SOFTWARE, 2003, : 22 - 31
  • [10] A Compiler-Directed Data Prefetching Scheme for Chip Multiprocessors
    Son, Seung Woo
    Kandemir, Mahmut
    Karakoy, Mustafa
    Chakrabarti, Dhruva
    ACM SIGPLAN NOTICES, 2009, 44 (04) : 209 - 218