Augmenting modern superscalar architectures with configurable extended instructions

被引:0
|
作者
Zhou, XF [1 ]
Martonosi, M [1 ]
机构
[1] Princeton Univ, Dept Elect Engn, Princeton, NJ 08544 USA
关键词
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The instruction sets of general-purpose microprocessors are designed to offer good performance across a wide range of programs, The size and complexity of the instruction sets, however, are limited by a need for generality and for streamlined implementation. The particular needs of one application are balanced against the needs of the full range of applications considered. For this reason, one can "design" a better instruction set when considering only a single application than when considering a general collection of applications. Configurable hardware gives us the opportunity to explore this option. This paper examines the potential for automatically identifying application-specific extended instructions and implementing them in programmable functional units based on configurable hardware. Adding fine-grained reconfigurable hardware to the datapath of an out-of-order issue superscalar processor allows 4-44% speedups on the MediaBench benchmarks [1]. As a key contribution of our work, we present a selective algorithm for choosing extended instructions to minimize reconfiguration costs within loops. Our selective algorithm constrains instruction choices so that significant speedups are achieved with as few as 4 moderately sized programmable functional units, typically containing less than 150 look-up tables each.
引用
收藏
页码:941 / 950
页数:10
相关论文
共 50 条
  • [1] INSTRUCTION SCHEDULING FOR SUPERSCALAR ARCHITECTURES
    LAM, MS
    ANNUAL REVIEW OF COMPUTER SCIENCE, 1989, 4 : 173 - 201
  • [2] VLIW compilation techniques for superscalar architectures
    Stümpel, E
    Thies, M
    Kastens, U
    COMPILER CONSTRUCTION, 1998, 1383 : 234 - 248
  • [3] TRADEOFFS IN PROCESSOR DESIGN FOR SUPERSCALAR ARCHITECTURES
    MURAKAMI, K
    KUGA, M
    GWUN, O
    TOMITA, S
    IEICE TRANSACTIONS ON COMMUNICATIONS ELECTRONICS INFORMATION AND SYSTEMS, 1991, 74 (11): : 3883 - 3893
  • [4] PALMED: Throughput Characterization for Superscalar Architectures
    Derumigny, Nicolas
    Bastian, Theophile
    Gruber, Fabian
    Iooss, Guillaume
    Guillon, Christophe
    Pouchet, Louis-Noel
    Rastello, Fabrice
    CGO '22: PROCEEDINGS OF THE 2022 IEEE/ACM INTERNATIONAL SYMPOSIUM ON CODE GENERATION AND OPTIMIZATION (CGO), 2022, : 106 - 117
  • [5] Modeling assembly in superscalar instruction timing architectures
    Beltrame, G
    Brandolese, C
    Fornaciari, W
    Salice, F
    Sciuto, D
    Trianni, V
    ISSS'02: 15TH INTERNATIONAL SYMPOSIUM ON SYSTEM SYNTHESIS, 2002, : 132 - 137
  • [6] Color-Aware Instructions for Embedded Superscalar Processors
    Kim, Jongmyon
    Wills, Linda M.
    Wills, D. Scott
    JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2011, 64 (03): : 335 - 350
  • [7] Analyzing instruction prefetch schemes in superscalar architectures
    dos Santos, TGS
    Bampi, S
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED PROCESSING TECHNIQUES AND APPLICATIONS, VOLS I-V, 2000, : 2109 - 2115
  • [8] An evaluation of asynchronous and synchronous design for superscalar architectures
    Davey, A
    Lloyd, D
    INTERNATIONAL CONFERENCE ON COMPUTER DESIGN - VLSI IN COMPUTERS AND PROCESSORS, PROCEEDINGS, 1997, : 295 - 300
  • [9] Java']Java Optimization for Superscalar and Vector Architectures
    Lyon, Douglas
    JOURNAL OF OBJECT TECHNOLOGY, 2005, 4 (02): : 27 - 39
  • [10] Handling 16 instructions per cycle in a superscalar processor
    Goossens, B
    FUTURE GENERATION COMPUTER SYSTEMS, 2001, 17 (06) : 699 - 709