共 50 条
- [21] Off-policy Reinforcement Learning for Robust Control of Discrete-time Uncertain Linear Systems PROCEEDINGS OF THE 36TH CHINESE CONTROL CONFERENCE (CCC 2017), 2017, : 2507 - 2512
- [27] Optimized control for human-multi-robot collaborative manipulation via multi-player Q-learning JOURNAL OF THE FRANKLIN INSTITUTE-ENGINEERING AND APPLIED MATHEMATICS, 2021, 358 (11): : 5639 - 5658
- [29] Stabilizing Off-Policy Q-Learning via Bootstrapping Error Reduction ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32