Backdoor Attacks on Safe Reinforcement Learning-Enabled Cyber-Physical Systems

被引：0

作者：

Jiang, Shixiong ^{[1
]}

Liu, Mengyu ^{[1
]}

Kong, Fanxin ^{[1
]}

机构：

[1] Univ Notre Dame, Dept Comp Sci & Engn, Notre Dame, IN 46556 USA

来源：

IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS | 2024年 / 43卷 / 11期

关键词：

Training; Integrated circuits; Design automation; Navigation; Bridge circuits; Reinforcement learning; Safety; Backdoor attack; cyber-physical systems; safe reinforcement learning;

D O I：

10.1109/TCAD.2024.3447468

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

Safe reinforcement learning (RL) aims to derive a control policy that navigates a safety-critical system while avoiding unsafe explorations and adhering to safety constraints. While safe RL has been extensively studied, its vulnerabilities during the policy training have barely been explored in an adversarial setting. This article bridges this gap and investigates the training time vulnerability of formal language-guided safe RL. Such vulnerability allows a malicious adversary to inject backdoor behavior into the learned control policy. First, we formally define backdoor attacks for safe RL and divide them into active and passive ones depending on whether to manipulate the observation. Second, we propose two novel algorithms to synthesize the two kinds of attacks, respectively. Both algorithms generate backdoor behaviors that may go unnoticed after deployment but can be triggered when specific states are reached, leading to safety violations. Finally, we conduct both theoretical analysis and extensive experiments to show the effectiveness and stealthiness of our methods.

引用

页码：4093 / 4104

页数：12

共 50 条

[31] Reinforcement Learning for Cyber-Physical Security Assessment of Power Systems
Liu, Xiaorui
Konstantinou, Charalambos
2019 IEEE MILAN POWERTECH, 2019,
[32] Adversarial Learning of Robust and Safe Controllers for Cyber-Physical Systems
Bortolussi, Luca
Cairoli, Francesca
Carbone, Ginevra
Franchina, Francesco
Regolin, Enrico
IFAC PAPERSONLINE, 2021, 54 (05): : 223 - 228
[33] Falsification of Cyber-Physical Systems Using Deep Reinforcement Learning
Akazaki, Takumi
Liu, Shuang
Yamagata, Yoriyuki
Duan, Yihai
Hao, Jianye
FORMAL METHODS, 2018, 10951 : 456 - 465
[34] Deep Reinforcement Learning for Penetration Testing of Cyber-Physical Attacks in the Smart Grid
Li, Yuanliang
Yan, Jun
Naili, Mohamed
2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
[35] Falsification of Cyber-Physical Systems Using Deep Reinforcement Learning
Yamagata, Yoriyuki
Liu, Shuang
Akazaki, Takumi
Duan, Yihai
Hao, Jianye
IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 2021, 47 (12) : 2823 - 2840
[36] Safe and secure cyber-physical systems
Biro, Miklos
Mashkoor, Atif
Sametinger, Johannes
JOURNAL OF SOFTWARE-EVOLUTION AND PROCESS, 2021, 33 (09)
[37] Policy Selection and Scheduling of Cyber-Physical Systems with Denial-of-Service Attacks via Reinforcement Learning
Jin, Zengwang
Li, Qian
Zhang, Huixiang
Liu, Zhiqiang
Wang, Zhen
JOURNAL OF ADVANCED COMPUTATIONAL INTELLIGENCE AND INTELLIGENT INFORMATICS, 2024, 28 (04) : 962 - 973
[38] Improved control of cyber-physical systems subject to cyber and physical attacks
Mahmoud M.S.
Hamdan M.M.
Cyber-Physical Systems, 2019, 5 (03) : 173 - 190
[39] Detecting cyber-physical attacks in CyberManufacturing systems with machine learning methods
Mingtao Wu
Zhengyi Song
Young B. Moon
Journal of Intelligent Manufacturing, 2019, 30 : 1111 - 1123
[40] Detecting cyber-physical attacks in CyberManufacturing systems with machine learning methods
Wu, Mingtao
Song, Zhengyi
Moon, Young B.
JOURNAL OF INTELLIGENT MANUFACTURING, 2019, 30 (03) : 1111 - 1123

← 1 2 3 4 5 →