共 50 条
- [1] Regularized Anderson Acceleration for Off-Policy Deep Reinforcement Learning ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
- [2] Safe Off-policy Reinforcement Learning Using Barrier Functions 2020 AMERICAN CONTROL CONFERENCE (ACC), 2020, : 2176 - 2181
- [3] Safe and efficient off-policy reinforcement learning ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 29 (NIPS 2016), 2016, 29
- [4] Bounds for Off-policy Prediction in Reinforcement Learning 2017 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2017, : 3991 - 3997
- [6] Off-Policy Reinforcement Learning with Delayed Rewards INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
- [7] Representations for Stable Off-Policy Reinforcement Learning INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 119, 2020, 119
- [9] On the Reuse Bias in Off-Policy Reinforcement Learning PROCEEDINGS OF THE THIRTY-SECOND INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2023, 2023, : 4513 - 4521
- [10] A perspective on off-policy evaluation in reinforcement learning Frontiers of Computer Science, 2019, 13 : 911 - 912