Backdoor Attacks on Safe Reinforcement Learning-Enabled Cyber-Physical Systems

被引：0

作者：

Jiang, Shixiong ^{[1
]}

Liu, Mengyu ^{[1
]}

Kong, Fanxin ^{[1
]}

机构：

[1] Univ Notre Dame, Dept Comp Sci & Engn, Notre Dame, IN 46556 USA

来源：

IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS | 2024年 / 43卷 / 11期

关键词：

Training; Integrated circuits; Design automation; Navigation; Bridge circuits; Reinforcement learning; Safety; Backdoor attack; cyber-physical systems; safe reinforcement learning;

D O I：

10.1109/TCAD.2024.3447468

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

Safe reinforcement learning (RL) aims to derive a control policy that navigates a safety-critical system while avoiding unsafe explorations and adhering to safety constraints. While safe RL has been extensively studied, its vulnerabilities during the policy training have barely been explored in an adversarial setting. This article bridges this gap and investigates the training time vulnerability of formal language-guided safe RL. Such vulnerability allows a malicious adversary to inject backdoor behavior into the learned control policy. First, we formally define backdoor attacks for safe RL and divide them into active and passive ones depending on whether to manipulate the observation. Second, we propose two novel algorithms to synthesize the two kinds of attacks, respectively. Both algorithms generate backdoor behaviors that may go unnoticed after deployment but can be triggered when specific states are reached, leading to safety violations. Finally, we conduct both theoretical analysis and extensive experiments to show the effectiveness and stealthiness of our methods.

引用

页码：4093 / 4104

页数：12

共 50 条

[1] Causal Repair of Learning-Enabled Cyber-Physical Systems
Lu, Pengyuan
Ruchkin, Ivan
Cleaveland, Matthew
Sokolsky, Oleg
Lee, Insup
2023 IEEE INTERNATIONAL CONFERENCE ON ASSURED AUTONOMY, ICAA, 2023, : 1 - 10
[2] Verification Approaches for Learning-Enabled Autonomous Cyber-Physical Systems
Tran, Hoang-Dung
Xiang, Weiming
Johnson, Taylor T.
IEEE DESIGN & TEST, 2022, 39 (01) : 24 - 34
[3] Vulnerability Analysis for Safe Reinforcement Learning in Cyber-Physical Systems
Jiang, Shixiong
Li, Mengyu
Kong, Fanxin
PROCEEDINGS 15TH ACM/IEEE INTERNATIONAL CONFERENCE ON CYBER-PHYSICAL SYSTEMS, ICCPS 2024, 2024, : 77 - 86
[4] Curating Naturally Adversarial Datasets for Learning-Enabled Medical Cyber-Physical Systems
Pugh, Sydney
Ruchkin, Ivan
Weimer, James
Lee, Insup
PROCEEDINGS 15TH ACM/IEEE INTERNATIONAL CONFERENCE ON CYBER-PHYSICAL SYSTEMS, ICCPS 2024, 2024, : 212 - 223
[5] INVITED: Reasoning about Safety of Learning-Enabled Components in Autonomous Cyber-physical Systems
Tuncali, Cumhur Erkan
Kapinski, James
Ito, Hisahiro
Deshmukh, Jyotirmoy V.
2018 55TH ACM/ESDA/IEEE DESIGN AUTOMATION CONFERENCE (DAC), 2018,
[6] Ensemble Learning-Enabled Security Anomaly Identification for IoT Cyber-Physical Power Systems
Zhao, Hongjun
Li, Changjun
Yin, Xin
Li, Xiujun
Zhou, Rui
Fu, Rong
ELECTRONICS, 2022, 11 (23)
[7] Double Deep Q-Network Next-Generation Cyber-Physical Systems: A Reinforcement Learning-Enabled Anomaly Detection Framework for Next-Generation Cyber-Physical Systems
Zhang, Yinjun
Jamjoom, Mona
Ullah, Zahid
ELECTRONICS, 2023, 12 (17)
[8] Reinforcement Learning Solution for Cyber-Physical Systems Security Against Replay Attacks
Yu, Yan
Yang, Wen
Ding, Wenjie
Zhou, Jiayu
IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2023, 18 : 2583 - 2595
[9] Falsification of Cyber-Physical Systems with Reinforcement Learning
Kato, Koki
Ishikawa, Fuyuki
Honiden, Shinichi
2018 IEEE 3RD WORKSHOP ON MONITORING AND TESTING OF CYBER-PHYSICAL SYSTEMS (MT-CPS 2018), 2018, : 5 - 6
[10] Testing Learning-Enabled Cyber-Physical Systems with Large-Language Models: A Formal Approach
Zheng, Xi
Mok, Aloysius K.
Piskac, Ruzica
Lee, Yong Jae
Krishnamachari, Bhaskar
Zhu, Dakai
Sokolsky, Oleg
Lee, Insup
COMPANION PROCEEDINGS OF THE 32ND ACM INTERNATIONAL CONFERENCE ON THE FOUNDATIONS OF SOFTWARE ENGINEERING, FSE COMPANION 2024, 2024, : 467 - 471

← 1 2 3 4 5 →