Neutrons Sensitivity of Deep Reinforcement Learning Policies on EdgeAI Accelerators

被引:1
|
作者
Bodmann, Pablo R. [1 ]
Saveriano, Matteo [2 ]
Kritikakou, Angeliki [3 ]
Rech, Paolo [2 ]
机构
[1] Univ Fed Rio Grande do Sul, Informat Inst, BR-91501970 Porto Alegre, Brazil
[2] Univ Trento, Dept Ind Engn, I-38123 Trento, Italy
[3] INRIA, F-35042 Rennes, France
关键词
Robots; Reliability; Neutrons; Particle beams; Internet; Transient analysis; Task analysis; Artificial intelligence; EdgeAI; reinforcement learning (RL); reliability; ROBOT; SAFETY;
D O I
10.1109/TNS.2024.3387087
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Autonomous robots and their applications are becoming popular in several different fields, including tasks where robots closely interact with humans. Therefore, the reliability of computation must be paramount. In this work, we measure the reliability of Google's Coral Edge tensor processing unit (TPU) executing three deep reinforcement learning (DRL) models through an accelerated neutrons beam. We experimentally collect data that, when scaled to the natural neutron flux, account for more than 5 million years. Based on our extensive evaluation, we quantify and qualify the radiation-induced corruption on the correctness of DRL. Crucially, our data show that the Edge TPU executing DRL has an error rate that is up to 18 times higher the limit imposed by international reliability standards. We found that despite the feedback and intrinsic redundancy of DRL, the propagation of the fault induces the model to fail in the vast majority of cases or the model manages to finish but reports wrong metrics (i.e., speed, final position, and reward). We provide insights on how radiation corrupts the model, on how the fault propagates in the computation, and about the failure characteristic of the controlled robot.
引用
收藏
页码:1480 / 1486
页数:7
相关论文
共 50 条
  • [41] Example-guided learning of stochastic human driving policies using deep reinforcement learning
    Ran Emuna
    Rotem Duffney
    Avinoam Borowsky
    Armin Biess
    Neural Computing and Applications, 2023, 35 : 16791 - 16804
  • [42] Ergodic Approximate Deep Learning Accelerators
    van Lijssel, Tim
    Balatsoukas-Stimming, Alexios
    FIFTY-SEVENTH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS & COMPUTERS, IEEECONF, 2023, : 734 - 738
  • [43] AdequateDL: Approximating Deep Learning Accelerators
    Sentieys, Olivier
    Filip, Silviu
    Briand, David
    Novo, David
    Dupuis, Etienne
    O'Connor, Ian
    Bosio, Alberto
    2021 24TH INTERNATIONAL SYMPOSIUM ON DESIGN AND DIAGNOSTICS OF ELECTRONIC CIRCUITS & SYSTEMS (DDECS), 2021, : 37 - 40
  • [44] From Reinforcement Learning to Deep Reinforcement Learning: An Overview
    Agostinelli, Forest
    Hocquet, Guillaume
    Singh, Sameer
    Baldi, Pierre
    BRAVERMAN READINGS IN MACHINE LEARNING: KEY IDEAS FROM INCEPTION TO CURRENT STATE, 2018, 11100 : 298 - 328
  • [45] Enforcing Hard State-Dependent Action Bounds on Deep Reinforcement Learning Policies
    De Cooman, Bram
    Suykens, Johan
    Ortseifen, Andreas
    MACHINE LEARNING, OPTIMIZATION, AND DATA SCIENCE, LOD 2022, PT II, 2023, 13811 : 193 - 218
  • [46] Frame-Correlation Transfers Trigger Economical Attacks on Deep Reinforcement Learning Policies
    Qu, Xinghua
    Ong, Yew-Soon
    Gupta, Abhishek
    IEEE TRANSACTIONS ON CYBERNETICS, 2022, 52 (08) : 7577 - 7590
  • [47] Using Deep Reinforcement Learning to Learn High-Level Policies on the ATRIAS Biped
    Li, Tianyu
    Geyer, Hartmut
    Atkeson, Christopher G.
    Rai, Akshara
    2019 INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2019, : 263 - 269
  • [48] Adaptable control policies for variable liquid chromatography columns using deep reinforcement learning
    Andersson, David
    Edlund, Christoffer
    Corbett, Brandon
    Sjogren, Rickard
    SCIENTIFIC REPORTS, 2023, 13 (01)
  • [49] Active Queue-Management Policies for Undersea Networking via Deep Reinforcement Learning
    Forero, Pedro A.
    Zhang, P.
    Radosevic, D.
    OCEANS 2021: SAN DIEGO - PORTO, 2021,
  • [50] Deep Reinforcement Learning Based Mobility Load Balancing Under Multiple Behavior Policies
    Xu, Yue
    Xu, Wenjun
    Wang, Zhi
    Lin, Jiaru
    Cui, Shuguang
    ICC 2019 - 2019 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC), 2019,