A Systematic Approach for Optimized Bypass Configurations for Application-Specific Embedded Processors

被引:1
|
作者
Jungeblut, Thorsten [1 ]
Huebener, Boris [2 ]
Porrmann, Mario [2 ]
Rueckert, Ulrich [1 ]
机构
[1] Univ Bielefeld, Cognit Interact Technol Ctr Excellence, Bielefeld, Germany
[2] Univ Paderborn, Heinz Nixdorf Inst, Paderborn, Germany
关键词
Algorithms; Design; Pipeline bypass; forwarding; optimized; VLIW; pipeline; application specific; CoreVA; multifrequency; DFS;
D O I
10.1145/2514641.2514645
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The diversity of today's mobile applications requires embedded processor cores with a high resource efficiency, that means, the devices should provide a high performance at low area requirements and power consumption. The fine-grained parallelism supported by multiple functional units of VLIW architectures offers a high throughput at reasonable low clock frequencies compared to single-core RISC processors. To efficiently utilize the processor pipeline, common system architectures have to cope with data hazards due to data dependencies between consecutive operations. On the one hand, such hazards can be resolved by complex forwarding circuits (i.e., a pipeline bypass) which forward intermediate results to a subsequent instruction. On the other hand, the pipeline bypass can strongly affect or even dominate the total resource requirements and degrade the maximum clock frequency. In this work the CoreVA VLIW architecture is used for the development and the analysis of application-specific bypass configurations. It is shown that many paths of a comprehensive bypass system are rarely used and may not be required for certain applications. For this reason, several strategies have been implemented to enhance the efficiency of the total system by introducing application-specific bypass configurations. The configuration can be carried out statically by only implementing required paths or at runtime by dynamically reconfiguring the hardware. An algorithm is proposed which derives an optimized configuration by iteratively disabling single bypass paths. The adaptation of these application-specific bypass configurations allows for a reduction of the critical path by 26%. As a result, the execution time and energy requirements could be reduced by up to 21.5%. Using Dynamic Frequency Scaling (DFS) and dynamic deactivation/reactivation of bypass paths allows for a runtime reconfiguration of the bypass system. This ensures the highest efficiency while processing varying applications.
引用
收藏
页数:25
相关论文
共 50 条
  • [1] Systematic register bypass customization for application-specific processors
    Fan, K
    Clark, N
    Chu, M
    Manjunath, KV
    Ravindran, R
    Smelyanskiy, M
    Mahlke, S
    IEEE INTERNATIONAL CONFERENCE ON APPLICATION-SPECIFIC SYSTEMS, ARCHITECTURES, AND PROCESSORS, PROCEEDINGS, 2003, : 64 - 74
  • [2] Application-specific processors
    Veidenbaum, A
    IEEE MICRO, 2004, 24 (03) : 8 - 9
  • [3] Application-specific processors and system-on-chips for embedded and pervasive applications
    Nedjah, Nadia
    Jozwiak, Lech
    Mourelle, Luiza de Macedo
    MICROPROCESSORS AND MICROSYSTEMS, 2013, 37 (6-7) : 672 - 673
  • [4] Dynamic Configuration of Application-Specific Implicit Instructions for Embedded Pipelined Processors
    Sykora, Martino
    Agosta, Giovanni
    Silvano, Cristina
    APPLIED COMPUTING 2008, VOLS 1-3, 2008, : 1509 - 1516
  • [5] Low-power branch target buffer for application-specific embedded processors
    Petrov, P
    Orailoglu, A
    IEE PROCEEDINGS-COMPUTERS AND DIGITAL TECHNIQUES, 2005, 152 (04): : 482 - 488
  • [6] Application-specific instruction memory customizations for power-efficient embedded processors
    Petrov, P
    Orailoglu, A
    IEEE DESIGN & TEST OF COMPUTERS, 2003, 20 (01): : 18 - 25
  • [7] Low-power branch target buffer for application-specific embedded processors
    Petrov, P
    Orailoglu, A
    EUROMICRO SYMPOSIUM ON DIGITAL SYSTEM DESIGN, PROCEEDINGS, 2003, : 158 - 165
  • [8] Low-power data memory communication for application-specific embedded processors.
    Petrov, P
    Orailoglu, A
    ISSS'02: 15TH INTERNATIONAL SYMPOSIUM ON SYSTEM SYNTHESIS, 2002, : 219 - 224
  • [9] Energy frugal tags in reprogrammable I-Caches for application-specific embedded processors
    Petrov, P
    Orailoglu, A
    CODES 2002: PROCEEDINGS OF THE TENTH INTERNATIONAL SYMPOSIUM ON HARDWARE/SOFTWARE CODESIGN, 2002, : 181 - 186
  • [10] A scalable synthesis methodology for application-specific processors
    Sun, Fei
    Ravi, Srivaths
    Raghunathan, Anand
    Jha, Niraj K.
    IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, 2006, 14 (11) : 1175 - 1188