共 50 条
- [21] A Multi-Agent Off-Policy Actor-Critic Algorithm for Distributed Reinforcement Learning IFAC PAPERSONLINE, 2020, 53 (02): : 1549 - 1554
- [23] Finite-Sample Analysis of Off-Policy Natural Actor-Critic With Linear Function Approximation IEEE CONTROL SYSTEMS LETTERS, 2022, 6 : 2611 - 2616
- [27] Episode-Experience Replay Based Tree-Backup Method for Off-Policy Actor-Critic Algorithm PATTERN RECOGNITION AND COMPUTER VISION (PRCV 2018), PT I, 2018, 11256 : 562 - 573
- [28] Max-Min Off-Policy Actor-Critic Method Focusing on Worst-Case Robustness to Model Misspecification ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35, NEURIPS 2022, 2022,