Offline Reinforcement Learning with Differential Privacy

被引：0

作者：

Qiao, Dan ^{[1
]}

Wang, Yu-Xiang ^{[1
]}

机构：

[1] UC Santa Barbara, Dept Comp Sci, Santa Barbara, CA 93106 USA

来源：

ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023) | 2023年

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The offline reinforcement learning (RL) problem is often motivated by the need to learn data-driven decision policies in financial, legal and healthcare applications. However, the learned policy could retain sensitive information of individuals in the training data (e.g., treatment and outcome of patients), thus susceptible to various privacy risks. We design offline RL algorithms with differential privacy guarantees which provably prevent such risks. These algorithms also enjoy strong instance-dependent learning bounds under both tabular and linear Markov Decision Process (MDP) settings. Our theory and simulation suggest that the privacy guarantee comes at (almost) no drop in utility comparing to the non-private counterpart for a medium-size dataset.

引用

页数：42

共 50 条

[31] Mutual Information Regularized Offline Reinforcement Learning
Ma, Xiao
Kang, Bingyi
Xu, Zhongwen
Lin, Min
Yan, Shuicheng
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
[32] Revisiting the Minimalist Approach to Offline Reinforcement Learning
Tarasov, Denis
Kurenkov, Vladislav
Nikulin, Alexander
Kolesnikov, Sergey
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
[33] Bellman Residual Orthogonalization for Offline Reinforcement Learning
Zanette, Andrea
Wainwright, Martin J.
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35, NEURIPS 2022, 2022,
[34] Discrete Uncertainty Quantification For Offline Reinforcement Learning
Perez, Jose Luis
Corrochano, Javier
Garcia, Javier
Majadas, Ruben
Ibanez-Llano, Cristina
Perez, Sergio
Fernandez, Fernando
JOURNAL OF ARTIFICIAL INTELLIGENCE AND SOFT COMPUTING RESEARCH, 2023, 13 (04) : 273 - 287
[35] Supported Value Regularization for Offline Reinforcement Learning
Mao, Yixiu
Zhang, Hongchang
Chen, Chen
Xu, Yi
Ji, Xiangyang
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
[36] Supported Policy Optimization for Offline Reinforcement Learning
Wu, Jialong
Wu, Haixu
Qiu, Zihan
Wang, Jianmin
Long, Mingsheng
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35, NEURIPS 2022, 2022,
[37] Offline Reinforcement Learning for Automated Stock Trading
Lee, Namyeong
Moon, Jun
IEEE ACCESS, 2023, 11 : 112577 - 112589
[38] On the Role of Discount Factor in Offline Reinforcement Learning
Hu, Hao
Yang, Yiqing
Zhao, Qianchuan
Zhang, Chongjie
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
[39] Offline Evaluation of Online Reinforcement Learning Algorithms
Mandel, Travis
Liu, Yun-En
Brunskill, Emma
Popovic, Zoran
THIRTIETH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2016, : 1926 - 1933
[40] Federated Offline Reinforcement Learning With Multimodal Data
Wen, Jiabao
Dai, Huiao
He, Jingyi
Xi, Meng
Xiao, Shuai
Yang, Jiachen
IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 2024, 70 (01) : 4266 - 4276

← 1 2 3 4 5 →