A Software-Only Approach to Enable Diverse Redundancy on Intel GPUs for Safety-Related Kernels

被引:0
|
作者
Andriotis, Nikolaos [1 ,2 ]
Serrano-Cases, Alejandro [1 ]
Alcaide, Sergi [1 ,2 ]
Abella, Jaume [1 ]
Cazorla, Francisco J. [1 ]
Peng, Yang [3 ]
Baldovin, Andrea [3 ]
Paulitsch, Michael [3 ]
Tsymbal, Vladimir [3 ]
机构
[1] Barcelona Supercomp Ctr BSC, Barcelona, Spain
[2] Univ Politecn Catalunya UPC, Barcelona, Spain
[3] Intel Corp, Neubiberg, Germany
关键词
redundancy; diversity; safety; GPU;
D O I
10.1145/3555776.3577610
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Autonomous Driving (AD) systems rely on object detection and tracking algorithms that require processing high volumes of data at high frequency. High-performance graphics processing units (GPUs) have been shown to provide the required computing performance. AD also carries functional safety requirements such as diverse redundancy for critical software tasks like object detection. This implies that software must be executed redundantly (in a single GPU for efficiency reasons), and with some form of diversity so that a single fault does not cause the same error in both redundant executions. Unfortunately, high-performance GPUs lack explicit hardware means for diverse redundancy, and software-based solutions with limited guarantees have only been provided for NVIDIA GPUs. This paper presents a software-only solution to enable diverse redundancy on Intel GPUs achieving, for the first time, strong guarantees on the diversity provided. By smartly tailoring workload geometry and managing workload allocation to execution units with thread-level wrappers, we guarantee that redundant threads use physically diverse execution units, hence meeting diverse redundancy requirements with affordable performance overheads.
引用
收藏
页码:451 / 460
页数:10
相关论文
共 4 条
  • [1] Software-only Diverse Redundancy on GPUs for Autonomous Driving Platforms
    Alcaide, Sergi
    Kosmidis, Leonidas
    Hernandez, Caries
    Abella, Jaume
    2019 IEEE 25TH INTERNATIONAL SYMPOSIUM ON ON-LINE TESTING AND ROBUST SYSTEM DESIGN (IOLTS 2019), 2019, : 90 - 96
  • [2] Software-only Triple Diverse Redundancy on GPUs for Autonomous Driving Platforms
    Alcaide, Sergi
    Kosmidis, Leonidas
    Hernandez, Carles
    Abella, Jaume
    2020 50TH ANNUAL IEEE-IFIP INTERNATIONAL CONFERENCE ON DEPENDABLE SYSTEMS AND NETWORKS-SUPPLEMENTAL VOLUME (DSN-S), 2020, : 82 - 88
  • [3] Software-only based Diverse Redundancy for ASIL-D Automotive Applications on Embedded HPC Platforms
    Alcaide, Sergi
    Kosmidis, Leonidas
    Hernandez, Carles
    Abella, Jaume
    2020 33RD IEEE INTERNATIONAL SYMPOSIUM ON DEFECT AND FAULT TOLERANCE IN VLSI AND NANOTECHNOLOGY SYSTEMS (DFT), 2020,
  • [4] An Approach of Creating Component Design Specification for Safety-Related Software in Railway
    Chen, Yin
    Linder, Sven
    Wigstein, Jonas
    2019 ANNUAL RELIABILITY AND MAINTAINABILITY SYMPOSIUM (RAMS 2019) - R & M IN THE SECOND MACHINE AGE - THE CHALLENGE OF CYBER PHYSICAL SYSTEMS, 2019,