共 50 条
- [1] Off-policy and on-policy reinforcement learning with the Tsetlin machine Applied Intelligence, 2023, 53 : 8596 - 8613
- [5] Two-player nonlinear Stackelberg differential game via off-policy integral reinforcement learning JOURNAL OF THE FRANKLIN INSTITUTE-ENGINEERING AND APPLIED MATHEMATICS, 2024, 361 (08):
- [6] Interpolated Policy Gradient: Merging On-Policy and Off-Policy Gradient Estimation for Deep Reinforcement Learning Shixiang ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 30 (NIPS 2017), 2017, 30
- [7] H∞ Control for Discrete-Time Multi-Player Systems via Off-Policy Q-Learning IEEE ACCESS, 2020, 8 (08): : 28831 - 28846
- [8] Off-Policy Q-Learning for Anti-Interference Control of Multi-Player Systems IFAC PAPERSONLINE, 2020, 53 (02): : 9189 - 9194
- [9] Discrete-Time Multi-Player Games Based on Off-Policy Q-Learning IEEE ACCESS, 2019, 7 : 134647 - 134659