PPU: A Control Error-Tolerant Processor for Streaming Applications with Formal Guarantees

被引:0
|
作者
Golnari, Pareesa Ameneh [1 ]
Yetim, Yavuz [2 ,4 ]
Martonosi, Margaret [3 ]
Vizel, Yakir [1 ]
Malik, Sharad [1 ]
机构
[1] Princeton Univ, Dept Elect Engn, Princeton, NJ 08544 USA
[2] Princeton Univ, Princeton, NJ 08544 USA
[3] Princeton Univ, Dept Comp Sci, Princeton, NJ 08544 USA
[4] Google, 345 Spear St, San Francisco, CA 94105 USA
基金
美国国家科学基金会;
关键词
Error-tolerant computing; streaming applications; reliability requirements; progress; control flow; verification; APPROXIMATE; RELIABILITY; POWER; SAFE;
D O I
10.1145/2990502
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
With increasing technology scaling and design complexity there are increasing threats from device and circuit failures. This is expected to worsen with post-CMOS devices. Current error-resilient solutions ensure reliability of circuits through protection mechanisms such as redundancy, error correction, and recovery. However, the costs of these solutions may be high, rendering them impractical. In contrast, error-tolerant solutions allow errors in the computation and are positioned to be suitable for error-tolerant applications such as media applications. For such programmable error-tolerant processors, the Instruction-Set-Architecture (ISA) no longer serves as a specification since it is acceptable for the processor to allow for errors during the execution of instructions. In this work, we address this specification gap by defining the basic requirements needed for an error-tolerant processor to provide acceptable results. Furthermore, we formally define properties that capture these requirements. Based on this, we propose the Partially Protected Uniprocessor (PPU), an error-tolerant processor that aims to meet these requirements with low-cost microarchitectural support. These protection mechanisms convert potentially fatal control errors to potentially tolerable data errors instead of ensuring instruction-level or byte-level correctness. The protection mechanisms in PPU protect the system against crashes, unresponsiveness, and external device corruption. In addition, they also provide support for achieving acceptable result quality. Additionally, we provide a methodology that formally proves the specification properties on PPU using model checking. This methodology uses models for the hardware and software that are integrated with the fault and recovery models. Finally, we experimentally demonstrate the results of model checking and the application-level quality of results for PPU.
引用
收藏
页数:29
相关论文
共 49 条
  • [41] Approximate MRAM: High-Performance and Power-Efficient Computing With MRAM Chips for Error-Tolerant Applications
    Ferdaus, Farah
    Talukder, B. M. S. Bahar
    Rahman, Md Tauhidur
    IEEE TRANSACTIONS ON COMPUTERS, 2023, 72 (03) : 668 - 681
  • [42] Novel approximate Booth multipliers (ABm-eRx) based on efficient encoding and reduction for error-tolerant applications
    Moses, Jayasheela
    Balasubramani, Sukanya
    Krishnamoorthy, Umapathi
    ANALOG INTEGRATED CIRCUITS AND SIGNAL PROCESSING, 2025, 123 (02)
  • [43] Lack-tolerant and error-tolerant control of five-phase induction motor based on effective time carrier type PWM
    Li Le
    Ye Zhi-hao
    Xia Yi-hui
    INTERNATIONAL CONFERENCE ON SENSORS AND INSTRUMENTS (ICSI 2021), 2021, 11887
  • [44] Majority logic based area-delay efficient 1-bit approximate adder for error-tolerant applications
    Parameshwara, M. C.
    Maroof, Naeem
    Khan, Angshuman
    ENGINEERING RESEARCH EXPRESS, 2022, 4 (02):
  • [45] Error-Tolerant and Energy-Efficient Coverage Control Based on Biological Attractor Selection Model in Wireless Sensor Networks
    Iwai, Takuya
    Wakamiya, Naoki
    Murata, Masayuki
    INTERNATIONAL JOURNAL OF DISTRIBUTED SENSOR NETWORKS, 2012,
  • [46] A 32kb SRAM for Error-Free and Error-Tolerant Applications with Dynamic Energy-Quality Management in 28nm CMOS
    Frustaci, Fabio
    Khayatzadeh, Mahmood
    Blaauw, David
    Sylvester, Dennis
    Alioto, Massimo
    2014 IEEE INTERNATIONAL SOLID-STATE CIRCUITS CONFERENCE DIGEST OF TECHNICAL PAPERS (ISSCC), 2014, 57 : 244 - +
  • [47] Energy Consumption and Lifetime Improvement of Coarse-Grained Reconfigurable Architectures Targeting Low-Power Error-Tolerant Applications
    Afzali-Kusha, Hassan
    Akbari, Omid
    Kamal, Mehdi
    Pedram, Massoud
    PROCEEDINGS OF THE 2018 GREAT LAKES SYMPOSIUM ON VLSI (GLSVLSI'18), 2018, : 431 - 434
  • [48] X-Rel: Energy-Efficient and Low-Overhead Approximate Reliability Framework for Error-Tolerant Applications Deployed in Critical Systems
    Vafaei, Jafar
    Akbari, Omid
    Shafique, Muhammad
    Hochberger, Christian
    IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, 2023, 31 (07) : 1051 - 1064
  • [49] DNN Engine: A 28-nm Timing-Error Tolerant Sparse Deep Neural Network Processor for IoT Applications
    Whatmough, Paul N.
    Lee, Sae Kyu
    Brooks, David
    Wei, Gu-Yeon
    IEEE JOURNAL OF SOLID-STATE CIRCUITS, 2018, 53 (09) : 2722 - 2731