共 50 条
- [22] Bayesian Residual Policy Optimization: Scalable Bayesian Reinforcement Learning with Clairvoyant Experts 2021 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2021, : 5611 - 5618
- [23] Reinforcement learning with knowledge by using a stochastic gradient method on a Bayesian network IEEE WORLD CONGRESS ON COMPUTATIONAL INTELLIGENCE, 1998, : 2045 - 2050
- [24] AUTOMATIC AND PARALLEL GENERATION OF GRADIENT AND HESSIAN MATRIX LECTURE NOTES IN CONTROL AND INFORMATION SCIENCES, 1990, 143 : 104 - 114
- [25] Using policy gradient reinforcement learning on autonomous robot controllers IROS 2003: PROCEEDINGS OF THE 2003 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, VOLS 1-4, 2003, : 406 - 411
- [26] Reinforcement Learning based on MPC and the Stochastic Policy Gradient Method 2021 AMERICAN CONTROL CONFERENCE (ACC), 2021, : 1947 - 1952
- [29] KERNEL-BASED LIFELONG POLICY GRADIENT REINFORCEMENT LEARNING 2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 3500 - 3504
- [30] Cold-Start Reinforcement Learning with Softmax Policy Gradient ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 30 (NIPS 2017), 2017, 30