共 50 条
- [21] A Policy Gradient Algorithm for Learning to Learn in Multiagent Reinforcement Learning INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
- [22] A Sensorimotor Reinforcement Learning Framework for Physical Human-Robot Interaction 2016 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS 2016), 2016, : 2682 - 2688
- [23] Policy gradient reinforcement learning for fast quadrupedal locomotion 2004 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, VOLS 1- 5, PROCEEDINGS, 2004, : 2619 - 2624
- [24] Fast Stochastic Kalman Gradient Descent for Reinforcement Learning LEARNING FOR DYNAMICS AND CONTROL, VOL 144, 2021, 144
- [25] Policy Gradient using Weak Derivatives for Reinforcement Learning 2019 53RD ANNUAL CONFERENCE ON INFORMATION SCIENCES AND SYSTEMS (CISS), 2019,
- [27] Direct gradient-based reinforcement learning for robot behavior learning INFORMATICS IN CONTROL, AUTOMATION AND ROBOTICS II, 2007, : 175 - +
- [29] A Hybrid Stochastic Policy Gradient Algorithm for Reinforcement Learning INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 108, 2020, 108 : 374 - 384
- [30] Independent Policy Gradient Methods for Competitive Reinforcement Learning ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33