共 50 条
- [21] A Residual Gradient Fuzzy Reinforcement Learning Algorithm for Differential Games International Journal of Fuzzy Systems, 2017, 19 : 1058 - 1076
- [22] Robot reinforcement learning accuracy-based learning classifier systems with Fuzzy Policy Gradient descent(XCS-FPGRL) PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON ADVANCES IN MECHANICAL ENGINEERING AND INDUSTRIAL INFORMATICS, 2015, 15 : 1013 - 1018
- [24] Reinforcement Learning based on MPC and the Stochastic Policy Gradient Method 2021 AMERICAN CONTROL CONFERENCE (ACC), 2021, : 1947 - 1952
- [27] Using policy gradient reinforcement learning on autonomous robot controllers IROS 2003: PROCEEDINGS OF THE 2003 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, VOLS 1-4, 2003, : 406 - 411
- [29] KERNEL-BASED LIFELONG POLICY GRADIENT REINFORCEMENT LEARNING 2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 3500 - 3504
- [30] Cold-Start Reinforcement Learning with Softmax Policy Gradient ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 30 (NIPS 2017), 2017, 30