共 50 条
- [41] Policy Gradient using Weak Derivatives for Reinforcement Learning [J]. 2019 IEEE 58TH CONFERENCE ON DECISION AND CONTROL (CDC), 2019, : 5531 - 5537
- [42] Evolution-Guided Policy Gradient in Reinforcement Learning [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
- [43] Fuzzy Baselines to Stabilize Policy Gradient Reinforcement Learning [J]. EXPLAINABLE AI AND OTHER APPLICATIONS OF FUZZY TECHNIQUES, NAFIPS 2021, 2022, 258 : 436 - 446
- [44] Policy gradient methods for reinforcement learning with function approximation [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 12, 2000, 12 : 1057 - 1063
- [45] Inverse Reinforcement Learning through Policy Gradient Minimization [J]. THIRTIETH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2016, : 1993 - 1999
- [48] Optimal quadrupedal locomotion [J]. INTEGRATIVE AND COMPARATIVE BIOLOGY, 2016, 56 : E209 - E209
- [49] THE DYNAMICS OF QUADRUPEDAL LOCOMOTION [J]. JOURNAL OF BIOMECHANICAL ENGINEERING-TRANSACTIONS OF THE ASME, 1988, 110 (03): : 230 - 237
- [50] Understanding quadrupedal locomotion [J]. EUROPEAN JOURNAL OF MORPHOLOGY, 1998, 36 (4-5): : 270 - 271