An energy efficient instruction window for scalable processor architecture

被引:0
|
作者
Choi, Min [1 ]
Maeng, Seungryoul [1 ]
机构
[1] Korea Adv Inst Sci & Technol, Dept EECS, Seoul, South Korea
来源
IEICE TRANSACTIONS ON ELECTRONICS | 2008年 / E91C卷 / 09期
关键词
instruction window; superscalar; low-power microarchitecture; reorder buffer; issue queue;
D O I
10.1093/ietele/e91-c.9.1427
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Modem microprocessors achieve high application performance at the acceptable level of power dissipation. In terms of power to performance trade-off, the instruction window is particularly important. This is because enlarging the window size achieves high performance but naive scaling of the conventional instruction window can severely increase the complexity and power consumption. In this paper, we propose low-power instruction window techniques for contemporary microprocessors. First, the small reorder buffer (SROB) reduces power dissipation by deferred allocation and early release. The deferred allocation delays the SROB allocation of instructions until their all data dependencies are resolved. Then, the instructions are executed in program order and they are released faster from the SROB. This results in higher resource utilization and low power consumption. Second, we replace a conventional issue queue by a direct lookup table (DLT) with an efficient tag translation technique. The translation scheme resolves the instruction dependency, especially for the case of one producer to multiple consumers. The efficiency of the translation scheme stems from the fact that the vast majority of instruction dependency exists within a basic block. Experimental results show that our proposed design reduces the power consumption significantly for SPEC2000 benchmarks.
引用
收藏
页码:1427 / 1436
页数:10
相关论文
共 50 条
  • [41] EFFICIENT INSTRUCTION SCHEDULING FOR A PIPELINED ARCHITECTURE
    GIBBONS, PB
    MUCHNICK, SS
    SIGPLAN NOTICES, 1986, 21 (07): : 11 - 16
  • [42] Scalable and Energy-Efficient NN Acceleration with GPU-ReRAM Architecture
    de Moura, Rafael Fao
    Carro, Luigi
    APPLIED RECONFIGURABLE COMPUTING. ARCHITECTURES, TOOLS, AND APPLICATIONS, ARC 2023, 2023, 14251 : 230 - 244
  • [43] A scalable and programmable simplicial CNN digital pixel processor architecture
    Mandolesi, PS
    Julián, P
    Andreou, AG
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I-REGULAR PAPERS, 2004, 51 (05) : 988 - 996
  • [44] Checkpoint Processing and Recovery: Towards scalable large instruction window processors
    Akkary, H
    Rajwar, R
    Srinivasan, ST
    36TH INTERNATIONAL SYMPOSIUM ON MICROARCHITECTURE, PROCEEDINGS, 2003, : 423 - 434
  • [45] Scalable processor architecture for Java']Java with explicit thread support
    Buchenrieder, K
    Kress, R
    Pyttel, A
    Sedlmeier, A
    Veith, C
    ELECTRONICS LETTERS, 1997, 33 (18) : 1532 - 1534
  • [46] Scalable and Parameterizable Processor Array Architecture for Similarity Distance Computation
    Kanan, Awos
    Gebali, Fayez
    Ibrahim, Atef
    Li, Kin Fun
    2019 10TH INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION SYSTEMS (ICICS), 2019, : 245 - 249
  • [47] A scalable and efficient architecture for service discovery
    Dowlatshahi, M
    MacLarty, G
    Fry, M
    ICON 2003: 11TH IEEE INTERNATIONAL CONFERENCE ON NETWORKS, 2003, : 51 - 56
  • [48] Scalable and Efficient Integrated Test Architecture
    Portolan, Michele
    Goyal, Suresh
    Van Treuren, Bradford
    ITC: 2009 INTERNATIONAL TEST CONFERENCE, 2009, : 610 - 610
  • [49] A Scalable FFT Processor Architecture for OFDM Based Communication Systems
    Revanna, Deepak
    Anjum, Omer
    Cucchi, Manuele
    Airoldi, Roberto
    Nurmi, Jari
    2013 INTERNATIONAL CONFERENCE ON EMBEDDED COMPUTER SYSTEMS: ARCHITECTURES, MODELING AND SIMULATION (IC-SAMOS), 2013, : 19 - 27
  • [50] Instruction scheduling and transformation for a VLIW unified reduced instruction set computer/digital signal processor processor with shared register architecture
    Lee, Cheng-Yu
    Hung, Min-Chin
    Chang, Rong-Guey
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2014, 26 (01): : 134 - 151