共 50 条
- [1] Off-Policy Deep Reinforcement Learning without Exploration INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 97, 2019, 97
- [2] Off-Policy Deep Reinforcement Learning by Bootstrapping the Covariate Shift THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 3647 - 3655
- [3] Regularized Anderson Acceleration for Off-Policy Deep Reinforcement Learning ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
- [4] Stabilizing Off-Policy Deep Reinforcement Learning from Pixels INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
- [5] Trajectory-Based Off-Policy Deep Reinforcement Learning INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 97, 2019, 97
- [6] Research on Experience Replay of Off-policy Deep Reinforcement Learning: A Review Zidonghua Xuebao/Acta Automatica Sinica, 2023, 49 (11): : 2237 - 2256
- [8] Exploration with Multiple Random ε-Buffers in Off-Policy Deep Reinforcement Learning SYMMETRY-BASEL, 2019, 11 (11):
- [10] Safe and efficient off-policy reinforcement learning ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 29 (NIPS 2016), 2016, 29