PPU: A Control Error-Tolerant Processor for Streaming Applications with Formal Guarantees

被引:0
|
作者
Golnari, Pareesa Ameneh [1 ]
Yetim, Yavuz [2 ,4 ]
Martonosi, Margaret [3 ]
Vizel, Yakir [1 ]
Malik, Sharad [1 ]
机构
[1] Princeton Univ, Dept Elect Engn, Princeton, NJ 08544 USA
[2] Princeton Univ, Princeton, NJ 08544 USA
[3] Princeton Univ, Dept Comp Sci, Princeton, NJ 08544 USA
[4] Google, 345 Spear St, San Francisco, CA 94105 USA
基金
美国国家科学基金会;
关键词
Error-tolerant computing; streaming applications; reliability requirements; progress; control flow; verification; APPROXIMATE; RELIABILITY; POWER; SAFE;
D O I
10.1145/2990502
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
With increasing technology scaling and design complexity there are increasing threats from device and circuit failures. This is expected to worsen with post-CMOS devices. Current error-resilient solutions ensure reliability of circuits through protection mechanisms such as redundancy, error correction, and recovery. However, the costs of these solutions may be high, rendering them impractical. In contrast, error-tolerant solutions allow errors in the computation and are positioned to be suitable for error-tolerant applications such as media applications. For such programmable error-tolerant processors, the Instruction-Set-Architecture (ISA) no longer serves as a specification since it is acceptable for the processor to allow for errors during the execution of instructions. In this work, we address this specification gap by defining the basic requirements needed for an error-tolerant processor to provide acceptable results. Furthermore, we formally define properties that capture these requirements. Based on this, we propose the Partially Protected Uniprocessor (PPU), an error-tolerant processor that aims to meet these requirements with low-cost microarchitectural support. These protection mechanisms convert potentially fatal control errors to potentially tolerable data errors instead of ensuring instruction-level or byte-level correctness. The protection mechanisms in PPU protect the system against crashes, unresponsiveness, and external device corruption. In addition, they also provide support for achieving acceptable result quality. Additionally, we provide a methodology that formally proves the specification properties on PPU using model checking. This methodology uses models for the hardware and software that are integrated with the fault and recovery models. Finally, we experimentally demonstrate the results of model checking and the application-level quality of results for PPU.
引用
收藏
页数:29
相关论文
共 49 条
  • [21] Error-Tolerant Geometric Quantum Control for Logical Qubits with Minimal Resources
    Chen, Tao
    Xue, Zheng-Yuan
    Wang, Z. D.
    PHYSICAL REVIEW APPLIED, 2022, 18 (01)
  • [22] Energy-Area-Efficient Approximate Multipliers for Error-Tolerant Applications on FPGAs
    Nguyen Van Toan
    Lee, Jeong-Gun
    32ND IEEE INTERNATIONAL SYSTEM ON CHIP CONFERENCE (IEEE SOCC 2019), 2019, : 336 - 341
  • [23] Design and Evaluation of Approximate Logarithmic Multipliers for Low Power Error-Tolerant Applications
    Liu, Weiqiang
    Xu, Jiahua
    Wang, Danye
    Wang, Chenghua
    Montuschi, Paolo
    Lombardi, Fabrizio
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I-REGULAR PAPERS, 2018, 65 (09) : 2856 - 2868
  • [24] Research on Error-Tolerant Model of Master Control Computer for Electronic Automation System
    Du, Shuangyu
    Wang, Xianpei
    Yang, Tongzhong
    MANUFACTURING SCIENCE AND TECHNOLOGY, PTS 1-8, 2012, 383-390 : 5764 - +
  • [26] Hardware-Efficient Logarithmic Floating-Point Multipliers for Error-Tolerant Applications
    Niu, Zijing
    Zhang, Tingting
    Jiang, Honglan
    Cockburn, Bruce F.
    Liu, Leibo
    Han, Jie
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I-REGULAR PAPERS, 2024, 71 (01) : 209 - 222
  • [27] Error-Tolerant Reconfigurable VDD 10T SRAM Architecture for IoT Applications
    Gupta, Neha
    Shah, Ambika Prasad
    Khan, Sajid
    Vishvakarma, Santosh Kumar
    Waltl, Michael
    Girard, Patrick
    ELECTRONICS, 2021, 10 (14)
  • [28] Reducing Serial I/O Power in Error-Tolerant Applications by Efficient Lossy Encoding
    Stanley-Marbell, Phillip
    Rinard, Martin
    2016 ACM/EDAC/IEEE DESIGN AUTOMATION CONFERENCE (DAC), 2016,
  • [29] High-Performance and Energy-Efficient Approximate Multiplier for Error-Tolerant Applications
    Kim, Sunghyun
    Kim, Youngmin
    PROCEEDINGS INTERNATIONAL SOC DESIGN CONFERENCE 2017 (ISOCC 2017), 2017, : 278 - 279
  • [30] Soft Error-Tolerant and Highly Stable Low-Power SRAM for Satellite Applications
    Oh, Jong-Yeob
    Jo, Sung-Hun
    APPLIED SCIENCES-BASEL, 2025, 15 (01):