Privacy preservation in deep reinforcement learning: A training perspective

被引:0
|
作者
Shen, Sheng [1 ]
Ye, Dayong [2 ]
Zhu, Tianqing [3 ]
Zhou, Wanlei [3 ]
机构
[1] Torrens Univ Australia, Ctr Artificial Intelligence Res & Optimizat AIRO, Design & Creat Technol vert, Ultimo, NSW 2007, Australia
[2] Univ Technol Sydney, Sch Comp Sci, Ultimo, NSW 2007, Australia
[3] City Univ Macau, Fac Data Sci, Taipa 999078, Macao, Peoples R China
基金
澳大利亚研究理事会;
关键词
Reinforcement learning; Deep reinforcement learning; Privacy preservation; Differential privacy;
D O I
10.1016/j.knosys.2024.112558
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Reinforcement learning (RL) is a principled AI framework for autonomous, experience-driven learning. Deep reinforcement learning (DRL) enhances this by incorporating deep learning models, promoting a higher-level understanding of the visual world. However, privacy concerns are emerging in RL applications that involve vast amounts of private information. Recent studies have demonstrated that DRL can leak private information and be vulnerable to attacks aiming to infer the training environment from an agent's behaviors without direct access to the environment. To address these privacy concerns, we propose a differentially private DRL approach that obfuscates the agent's observations from each visited state. This defends against privacy leakage attacks and prevents the inference of the agent's training environment from its optimized policy. We provide a theoretical analysis and design comprehensive experiments to thoroughly reproduce the privacy leakage attack. Both the theoretical analysis and experimental results demonstrate that our method effectively defends against privacy leakage attacks while maintaining the model utility of the RL agent.
引用
收藏
页数:12
相关论文
共 50 条
  • [31] Collaborative Optimization Strategy of Distributed Generators Based on Federated Reinforcement Learning for Privacy Preservation
    Pu T.
    Du S.
    Li Y.
    Wang X.
    Dianli Xitong Zidonghua/Automation of Electric Power Systems, 2023, 47 (08): : 62 - 70
  • [32] Privacy preservation for machine learning training and classification based on homomorphic encryption schemes
    Li, Jing
    Kuang, Xiaohui
    Lin, Shujie
    Ma, Xu
    Tang, Yi
    INFORMATION SCIENCES, 2020, 526 : 166 - 179
  • [33] An investigation of privacy preservation in deep learning-based eye-tracking
    Seyedi, Salman
    Jiang, Zifan
    Levey, Allan
    Clifford, Gari D.
    BIOMEDICAL ENGINEERING ONLINE, 2022, 21 (01)
  • [34] Deep Reinforcement Learning Framework for COVID Therapy: A Research Perspective
    Jacob, Shomona Gracia
    Sulaiman, Majdi Mohammed Bait Ali
    Bennet, Bensujin
    CURRENT BIOINFORMATICS, 2022, 17 (05) : 393 - 395
  • [35] An Online Training Method for Augmenting MPC with Deep Reinforcement Learning
    Bellegarda, Guillaume
    Byl, Katie
    2020 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2020, : 5453 - 5459
  • [36] An investigation of privacy preservation in deep learning-based eye-tracking
    Salman Seyedi
    Zifan Jiang
    Allan Levey
    Gari D. Clifford
    BioMedical Engineering OnLine, 21
  • [37] A Data-Efficient Training Method for Deep Reinforcement Learning
    Feng, Wenhui
    Han, Chongzhao
    Lian, Feng
    Liu, Xia
    ELECTRONICS, 2022, 11 (24)
  • [38] Stabilizing Deep Reinforcement Learning Model Training for Video Conferencing
    Ryu, Sangwoo
    Ko, Kyungchan
    Hong, James Won-Ki
    2022 23RD ASIA-PACIFIC NETWORK OPERATIONS AND MANAGEMENT SYMPOSIUM (APNOMS 2022), 2022, : 97 - 102
  • [39] Homomorphic Encryption-Based Federated Privacy Preservation for Deep Active Learning
    Kurniawan, Hendra
    Mambo, Masahiro
    ENTROPY, 2022, 24 (11)
  • [40] A survey on deep learning and deep reinforcement learning in robotics with a tutorial on deep reinforcement learning
    Morales, Eduardo F.
    Murrieta-Cid, Rafael
    Becerra, Israel
    Esquivel-Basaldua, Marco A.
    INTELLIGENT SERVICE ROBOTICS, 2021, 14 (05) : 773 - 805