共 50 条
- [21] Policy Gradient using Weak Derivatives for Reinforcement Learning 2019 53RD ANNUAL CONFERENCE ON INFORMATION SCIENCES AND SYSTEMS (CISS), 2019,
- [23] Direct gradient-based reinforcement learning for robot behavior learning INFORMATICS IN CONTROL, AUTOMATION AND ROBOTICS II, 2007, : 175 - +
- [25] A Hybrid Stochastic Policy Gradient Algorithm for Reinforcement Learning INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 108, 2020, 108 : 374 - 384
- [26] Independent Policy Gradient Methods for Competitive Reinforcement Learning ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
- [27] Evolution-Guided Policy Gradient in Reinforcement Learning ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
- [28] Policy Gradient using Weak Derivatives for Reinforcement Learning 2019 IEEE 58TH CONFERENCE ON DECISION AND CONTROL (CDC), 2019, : 5531 - 5537
- [29] Total stochastic gradient algorithms and applications in reinforcement learning ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31