Backdoor Attacks on Safe Reinforcement Learning-Enabled Cyber-Physical Systems

被引:0
|
作者
Jiang, Shixiong [1 ]
Liu, Mengyu [1 ]
Kong, Fanxin [1 ]
机构
[1] Univ Notre Dame, Dept Comp Sci & Engn, Notre Dame, IN 46556 USA
关键词
Training; Integrated circuits; Design automation; Navigation; Bridge circuits; Reinforcement learning; Safety; Backdoor attack; cyber-physical systems; safe reinforcement learning;
D O I
10.1109/TCAD.2024.3447468
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Safe reinforcement learning (RL) aims to derive a control policy that navigates a safety-critical system while avoiding unsafe explorations and adhering to safety constraints. While safe RL has been extensively studied, its vulnerabilities during the policy training have barely been explored in an adversarial setting. This article bridges this gap and investigates the training time vulnerability of formal language-guided safe RL. Such vulnerability allows a malicious adversary to inject backdoor behavior into the learned control policy. First, we formally define backdoor attacks for safe RL and divide them into active and passive ones depending on whether to manipulate the observation. Second, we propose two novel algorithms to synthesize the two kinds of attacks, respectively. Both algorithms generate backdoor behaviors that may go unnoticed after deployment but can be triggered when specific states are reached, leading to safety violations. Finally, we conduct both theoretical analysis and extensive experiments to show the effectiveness and stealthiness of our methods.
引用
收藏
页码:4093 / 4104
页数:12
相关论文
共 50 条
  • [1] Causal Repair of Learning-Enabled Cyber-Physical Systems
    Lu, Pengyuan
    Ruchkin, Ivan
    Cleaveland, Matthew
    Sokolsky, Oleg
    Lee, Insup
    2023 IEEE INTERNATIONAL CONFERENCE ON ASSURED AUTONOMY, ICAA, 2023, : 1 - 10
  • [2] Verification Approaches for Learning-Enabled Autonomous Cyber-Physical Systems
    Tran, Hoang-Dung
    Xiang, Weiming
    Johnson, Taylor T.
    IEEE DESIGN & TEST, 2022, 39 (01) : 24 - 34
  • [3] Vulnerability Analysis for Safe Reinforcement Learning in Cyber-Physical Systems
    Jiang, Shixiong
    Li, Mengyu
    Kong, Fanxin
    PROCEEDINGS 15TH ACM/IEEE INTERNATIONAL CONFERENCE ON CYBER-PHYSICAL SYSTEMS, ICCPS 2024, 2024, : 77 - 86
  • [4] Curating Naturally Adversarial Datasets for Learning-Enabled Medical Cyber-Physical Systems
    Pugh, Sydney
    Ruchkin, Ivan
    Weimer, James
    Lee, Insup
    PROCEEDINGS 15TH ACM/IEEE INTERNATIONAL CONFERENCE ON CYBER-PHYSICAL SYSTEMS, ICCPS 2024, 2024, : 212 - 223
  • [5] INVITED: Reasoning about Safety of Learning-Enabled Components in Autonomous Cyber-physical Systems
    Tuncali, Cumhur Erkan
    Kapinski, James
    Ito, Hisahiro
    Deshmukh, Jyotirmoy V.
    2018 55TH ACM/ESDA/IEEE DESIGN AUTOMATION CONFERENCE (DAC), 2018,
  • [6] Ensemble Learning-Enabled Security Anomaly Identification for IoT Cyber-Physical Power Systems
    Zhao, Hongjun
    Li, Changjun
    Yin, Xin
    Li, Xiujun
    Zhou, Rui
    Fu, Rong
    ELECTRONICS, 2022, 11 (23)
  • [7] Double Deep Q-Network Next-Generation Cyber-Physical Systems: A Reinforcement Learning-Enabled Anomaly Detection Framework for Next-Generation Cyber-Physical Systems
    Zhang, Yinjun
    Jamjoom, Mona
    Ullah, Zahid
    ELECTRONICS, 2023, 12 (17)
  • [8] Reinforcement Learning Solution for Cyber-Physical Systems Security Against Replay Attacks
    Yu, Yan
    Yang, Wen
    Ding, Wenjie
    Zhou, Jiayu
    IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2023, 18 : 2583 - 2595
  • [9] Falsification of Cyber-Physical Systems with Reinforcement Learning
    Kato, Koki
    Ishikawa, Fuyuki
    Honiden, Shinichi
    2018 IEEE 3RD WORKSHOP ON MONITORING AND TESTING OF CYBER-PHYSICAL SYSTEMS (MT-CPS 2018), 2018, : 5 - 6
  • [10] Testing Learning-Enabled Cyber-Physical Systems with Large-Language Models: A Formal Approach
    Zheng, Xi
    Mok, Aloysius K.
    Piskac, Ruzica
    Lee, Yong Jae
    Krishnamachari, Bhaskar
    Zhu, Dakai
    Sokolsky, Oleg
    Lee, Insup
    COMPANION PROCEEDINGS OF THE 32ND ACM INTERNATIONAL CONFERENCE ON THE FOUNDATIONS OF SOFTWARE ENGINEERING, FSE COMPANION 2024, 2024, : 467 - 471