Privacy preservation in deep reinforcement learning: A training perspective

被引：0

作者：

Shen, Sheng ^{[1
]}

Ye, Dayong ^{[2
]}

Zhu, Tianqing ^{[3
]}

Zhou, Wanlei ^{[3
]}

机构：

[1] Torrens Univ Australia, Ctr Artificial Intelligence Res & Optimizat AIRO, Design & Creat Technol vert, Ultimo, NSW 2007, Australia

[2] Univ Technol Sydney, Sch Comp Sci, Ultimo, NSW 2007, Australia

[3] City Univ Macau, Fac Data Sci, Taipa 999078, Macao, Peoples R China

来源：

KNOWLEDGE-BASED SYSTEMS | 2024年 / 304卷

基金：

澳大利亚研究理事会;

关键词：

Reinforcement learning; Deep reinforcement learning; Privacy preservation; Differential privacy;

D O I：

10.1016/j.knosys.2024.112558

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Reinforcement learning (RL) is a principled AI framework for autonomous, experience-driven learning. Deep reinforcement learning (DRL) enhances this by incorporating deep learning models, promoting a higher-level understanding of the visual world. However, privacy concerns are emerging in RL applications that involve vast amounts of private information. Recent studies have demonstrated that DRL can leak private information and be vulnerable to attacks aiming to infer the training environment from an agent's behaviors without direct access to the environment. To address these privacy concerns, we propose a differentially private DRL approach that obfuscates the agent's observations from each visited state. This defends against privacy leakage attacks and prevents the inference of the agent's training environment from its optimized policy. We provide a theoretical analysis and design comprehensive experiments to thoroughly reproduce the privacy leakage attack. Both the theoretical analysis and experimental results demonstrate that our method effectively defends against privacy leakage attacks while maintaining the model utility of the RL agent.

引用

页数：12

共 50 条

[31] Collaborative Optimization Strategy of Distributed Generators Based on Federated Reinforcement Learning for Privacy Preservation
Pu T.
Du S.
Li Y.
Wang X.
Dianli Xitong Zidonghua/Automation of Electric Power Systems, 2023, 47 (08): : 62 - 70
[32] Privacy preservation for machine learning training and classification based on homomorphic encryption schemes
Li, Jing
Kuang, Xiaohui
Lin, Shujie
Ma, Xu
Tang, Yi
INFORMATION SCIENCES, 2020, 526 : 166 - 179
[33] An investigation of privacy preservation in deep learning-based eye-tracking
Seyedi, Salman
Jiang, Zifan
Levey, Allan
Clifford, Gari D.
BIOMEDICAL ENGINEERING ONLINE, 2022, 21 (01)
[34] Deep Reinforcement Learning Framework for COVID Therapy: A Research Perspective
Jacob, Shomona Gracia
Sulaiman, Majdi Mohammed Bait Ali
Bennet, Bensujin
CURRENT BIOINFORMATICS, 2022, 17 (05) : 393 - 395
[35] An Online Training Method for Augmenting MPC with Deep Reinforcement Learning
Bellegarda, Guillaume
Byl, Katie
2020 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2020, : 5453 - 5459
[36] An investigation of privacy preservation in deep learning-based eye-tracking
Salman Seyedi
Zifan Jiang
Allan Levey
Gari D. Clifford
BioMedical Engineering OnLine, 21
[37] A Data-Efficient Training Method for Deep Reinforcement Learning
Feng, Wenhui
Han, Chongzhao
Lian, Feng
Liu, Xia
ELECTRONICS, 2022, 11 (24)
[38] Stabilizing Deep Reinforcement Learning Model Training for Video Conferencing
Ryu, Sangwoo
Ko, Kyungchan
Hong, James Won-Ki
2022 23RD ASIA-PACIFIC NETWORK OPERATIONS AND MANAGEMENT SYMPOSIUM (APNOMS 2022), 2022, : 97 - 102
[39] Homomorphic Encryption-Based Federated Privacy Preservation for Deep Active Learning
Kurniawan, Hendra
Mambo, Masahiro
ENTROPY, 2022, 24 (11)
[40] A survey on deep learning and deep reinforcement learning in robotics with a tutorial on deep reinforcement learning
Morales, Eduardo F.
Murrieta-Cid, Rafael
Becerra, Israel
Esquivel-Basaldua, Marco A.
INTELLIGENT SERVICE ROBOTICS, 2021, 14 (05) : 773 - 805

← 1 2 3 4 5 →