Backdoor Attacks on Safe Reinforcement Learning-Enabled Cyber-Physical Systems

被引:0
|
作者
Jiang, Shixiong [1 ]
Liu, Mengyu [1 ]
Kong, Fanxin [1 ]
机构
[1] Univ Notre Dame, Dept Comp Sci & Engn, Notre Dame, IN 46556 USA
关键词
Training; Integrated circuits; Design automation; Navigation; Bridge circuits; Reinforcement learning; Safety; Backdoor attack; cyber-physical systems; safe reinforcement learning;
D O I
10.1109/TCAD.2024.3447468
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Safe reinforcement learning (RL) aims to derive a control policy that navigates a safety-critical system while avoiding unsafe explorations and adhering to safety constraints. While safe RL has been extensively studied, its vulnerabilities during the policy training have barely been explored in an adversarial setting. This article bridges this gap and investigates the training time vulnerability of formal language-guided safe RL. Such vulnerability allows a malicious adversary to inject backdoor behavior into the learned control policy. First, we formally define backdoor attacks for safe RL and divide them into active and passive ones depending on whether to manipulate the observation. Second, we propose two novel algorithms to synthesize the two kinds of attacks, respectively. Both algorithms generate backdoor behaviors that may go unnoticed after deployment but can be triggered when specific states are reached, leading to safety violations. Finally, we conduct both theoretical analysis and extensive experiments to show the effectiveness and stealthiness of our methods.
引用
收藏
页码:4093 / 4104
页数:12
相关论文
共 50 条
  • [31] Reinforcement Learning for Cyber-Physical Security Assessment of Power Systems
    Liu, Xiaorui
    Konstantinou, Charalambos
    2019 IEEE MILAN POWERTECH, 2019,
  • [32] Adversarial Learning of Robust and Safe Controllers for Cyber-Physical Systems
    Bortolussi, Luca
    Cairoli, Francesca
    Carbone, Ginevra
    Franchina, Francesco
    Regolin, Enrico
    IFAC PAPERSONLINE, 2021, 54 (05): : 223 - 228
  • [33] Falsification of Cyber-Physical Systems Using Deep Reinforcement Learning
    Akazaki, Takumi
    Liu, Shuang
    Yamagata, Yoriyuki
    Duan, Yihai
    Hao, Jianye
    FORMAL METHODS, 2018, 10951 : 456 - 465
  • [34] Deep Reinforcement Learning for Penetration Testing of Cyber-Physical Attacks in the Smart Grid
    Li, Yuanliang
    Yan, Jun
    Naili, Mohamed
    2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
  • [35] Falsification of Cyber-Physical Systems Using Deep Reinforcement Learning
    Yamagata, Yoriyuki
    Liu, Shuang
    Akazaki, Takumi
    Duan, Yihai
    Hao, Jianye
    IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 2021, 47 (12) : 2823 - 2840
  • [36] Safe and secure cyber-physical systems
    Biro, Miklos
    Mashkoor, Atif
    Sametinger, Johannes
    JOURNAL OF SOFTWARE-EVOLUTION AND PROCESS, 2021, 33 (09)
  • [37] Policy Selection and Scheduling of Cyber-Physical Systems with Denial-of-Service Attacks via Reinforcement Learning
    Jin, Zengwang
    Li, Qian
    Zhang, Huixiang
    Liu, Zhiqiang
    Wang, Zhen
    JOURNAL OF ADVANCED COMPUTATIONAL INTELLIGENCE AND INTELLIGENT INFORMATICS, 2024, 28 (04) : 962 - 973
  • [38] Improved control of cyber-physical systems subject to cyber and physical attacks
    Mahmoud M.S.
    Hamdan M.M.
    Cyber-Physical Systems, 2019, 5 (03) : 173 - 190
  • [39] Detecting cyber-physical attacks in CyberManufacturing systems with machine learning methods
    Mingtao Wu
    Zhengyi Song
    Young B. Moon
    Journal of Intelligent Manufacturing, 2019, 30 : 1111 - 1123
  • [40] Detecting cyber-physical attacks in CyberManufacturing systems with machine learning methods
    Wu, Mingtao
    Song, Zhengyi
    Moon, Young B.
    JOURNAL OF INTELLIGENT MANUFACTURING, 2019, 30 (03) : 1111 - 1123