Improving Resilience to Timing Errors by Exposing Variability Effects to Software in Tightly-Coupled Processor Clusters

被引:6
|
作者
Rahimi, Abbas [1 ]
Cesarini, Daniele [2 ]
Marongiu, Andrea [2 ,3 ]
Gupta, Rajesh K. [1 ]
Benini, Luca [2 ,3 ]
机构
[1] Univ Calif San Diego, Dept Comp Sci & Engn, La Jolla, CA 92093 USA
[2] Univ Bologna, Dept Elect Elect & Informat Engn, I-40136 Bologna, Italy
[3] Swiss Fed Inst Technol, Dept Informat Technol & Elect Engn, CH-8092 Zurich, Switzerland
基金
美国国家科学基金会;
关键词
Cross-layer variability management; OpenMP; processor clusters; recovery; robust system design; scheduling; timing errors; variations; ADAPTIVE APPROACH; ENERGY-EFFICIENT; OPENMP TASKING; DESIGN; MANAGEMENT; CORE; ARCHITECTURE; CIRCUITS;
D O I
10.1109/JETCAS.2014.2315883
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Manufacturing and environmental variations cause timing errors in microelectronic processors that are typically avoided by ultra-conservative multi-corner design margins or corrected by error detection and recovery mechanisms at the circuit-level. In contrast, we present here runtime software support for cost-effective countermeasures against hardware timing failures during system operation. We propose a variability-aware OpenMP (VOMP) programming environment, suitable for tightly-coupled shared memory processor clusters, that relies upon modeling across the hardware/software interface. VOMP is implemented as an extension to the OpenMP v3.0 programming model that covers various parallel constructs, including,, and. Using the notion of work-unit vulnerability (WUV) proposed here, we capture timing errors caused by circuit-level variability as high-level software knowledge. WUV consists of descriptive metadata to characterize the impact of variability on different work-unit types running on various cores. As such, WUV provides a useful abstraction of hardware variability to efficiently allocate a given work-unit to a suitable core for execution. VOMP enables hardware/software collaboration with online variability monitors in hardware and runtime scheduling in software. The hardware provides online per-core characterization of WUV metadata. This metadata is made available by carefully placing key data structures in a shared L1 memory and is used by VOMP schedulerss. Our results show that VOMP greatly reduces the cost of timing error recovery compared to the baseline schedulers of OpenMP, yielding speedup of 3%-36% for tasks, and 26%-49% for sections. Further, VOMP reaches energy saving of 2%-46% and 15%-50% for tasks, and sections, respectively.
引用
收藏
页码:216 / 229
页数:14
相关论文
共 3 条
  • [1] Variation-tolerant OpenMP Tasking on Tightly-coupled Processor Clusters
    Rahimi, Abbas
    Marongiu, Andrea
    Burgio, Paolo
    Gupta, Rajesh K.
    Benini, Luca
    DESIGN, AUTOMATION & TEST IN EUROPE, 2013, : 541 - 546
  • [2] A Highly Efficient, Thread-Safe Software Cache Implementation for Tightly-Coupled Multicore Clusters
    Pinto, Christian
    Benini, Luca
    PROCEEDINGS OF THE 2013 IEEE 24TH INTERNATIONAL CONFERENCE ON APPLICATION-SPECIFIC SYSTEMS, ARCHITECTURES AND PROCESSORS (ASAP 13), 2013, : 281 - 288
  • [3] System Integration of Tightly-Coupled Reconfigurable Processor Arrays and Evaluation of Buffer Size Effects on Their Performance
    Lari, Vahid
    Hannig, Frank
    Teich, Juergen
    2009 INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING WORKSHOPS (ICPPW 2009), 2009, : 528 - 534