Offline Reinforcement Learning with Differential Privacy

被引:0
|
作者
Qiao, Dan [1 ]
Wang, Yu-Xiang [1 ]
机构
[1] UC Santa Barbara, Dept Comp Sci, Santa Barbara, CA 93106 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The offline reinforcement learning (RL) problem is often motivated by the need to learn data-driven decision policies in financial, legal and healthcare applications. However, the learned policy could retain sensitive information of individuals in the training data (e.g., treatment and outcome of patients), thus susceptible to various privacy risks. We design offline RL algorithms with differential privacy guarantees which provably prevent such risks. These algorithms also enjoy strong instance-dependent learning bounds under both tabular and linear Markov Decision Process (MDP) settings. Our theory and simulation suggest that the privacy guarantee comes at (almost) no drop in utility comparing to the non-private counterpart for a medium-size dataset.
引用
收藏
页数:42
相关论文
共 50 条
  • [41] Efficient Offline Reinforcement Learning With Relaxed Conservatism
    Huang, Longyang
    Dong, Botao
    Zhang, Weidong
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (08) : 5260 - 5272
  • [42] Implicit policy constraint for offline reinforcement learning
    Peng, Zhiyong
    Liu, Yadong
    Han, Changlin
    Zhou, Zongtan
    CAAI TRANSACTIONS ON INTELLIGENCE TECHNOLOGY, 2024, 9 (04) : 973 - 981
  • [43] False Correlation Reduction for Offline Reinforcement Learning
    Deng, Zhihong
    Fu, Zuyue
    Wang, Lingxiao
    Yang, Zhuoran
    Bai, Chenjia
    Zhou, Tianyi
    Wang, Zhaoran
    Jiang, Jing
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (02) : 1199 - 1211
  • [44] Efficient Diffusion Policies for Offline Reinforcement Learning
    Kang, Bingyi
    Ma, Xiao
    Du, Chao
    Pang, Tianyu
    Yan, Shuicheng
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [45] State Deviation Correction for Offline Reinforcement Learning
    Zhang, Hongchang
    Shao, Jianzhun
    Jiang, Yuhang
    He, Shuncheng
    Zhang, Guanwen
    Ji, Xiangyang
    THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 9022 - 9030
  • [46] Percentile Criterion Optimization in Offline Reinforcement Learning
    Lobo, Elita A.
    Cousins, Cyrus
    Zick, Yair
    Petrik, Marek
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [47] Efficient Online Reinforcement Learning with Offline Data
    Ball, Philip J.
    Smith, Laura
    Kostrikov, Ilya
    Levine, Sergey
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 202, 2023, 202
  • [48] Offline Reinforcement Learning With Behavior Value Regularization
    Huang, Longyang
    Dong, Botao
    Xie, Wei
    Zhang, Weidong
    IEEE TRANSACTIONS ON CYBERNETICS, 2024, 54 (06) : 3692 - 3704
  • [49] Corruption-Robust Offline Reinforcement Learning
    Zhang, Xuezhou
    Chen, Yiding
    Zhu, Jerry
    Sun, Wen
    INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 151, 2022, 151 : 5757 - 5773
  • [50] Offline Quantum Reinforcement Learning in a Conservative Manner
    Cheng, Zhihao
    Zhang, Kaining
    Shen, Li
    Tao, Dacheng
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 6, 2023, : 7148 - 7156