共 50 条
- [1] Regret Minimization Experience Replay in Off-Policy Reinforcement Learning [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
- [3] Enhanced Off-Policy Reinforcement Learning With Focused Experience Replay [J]. IEEE ACCESS, 2021, 9 : 93152 - 93164
- [4] Research on Experience Replay of Off-policy Deep Reinforcement Learning: A Review [J]. Zidonghua Xuebao/Acta Automatica Sinica, 2023, 49 (11): : 2237 - 2256
- [5] Re-attentive experience replay in off-policy reinforcement learning [J]. Machine Learning, 2024, 113 : 2327 - 2349
- [6] Re-attentive experience replay in off-policy reinforcement learning [J]. MACHINE LEARNING, 2024, 113 (05) : 2327 - 2349
- [7] High-Value Prioritized Experience Replay for Off-policy Reinforcement Learning [J]. 2019 IEEE 31ST INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2019), 2019, : 1510 - 1514
- [9] Safe and efficient off-policy reinforcement learning [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 29 (NIPS 2016), 2016, 29
- [10] Bounds for Off-policy Prediction in Reinforcement Learning [J]. 2017 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2017, : 3991 - 3997