共 50 条
- [41] Curvilinear Bipedal Walk Learning in Nao Humanoid Robot using a CPG Based Policy Gradient Method MECHANICAL AND AEROSPACE ENGINEERING, PTS 1-7, 2012, 110-116 : 5161 - 5166
- [42] A Stochastic Policy Gradient Based Adaptive Control for Biped Walking 2015 34TH CHINESE CONTROL CONFERENCE (CCC), 2015, : 3224 - 3229
- [43] Online Control for Biped Robot with Incremental Learning Mechanism APPLIED SCIENCES-BASEL, 2021, 11 (18):
- [44] A modification of gradient policy in reinforcement learning procedure 2012 15TH INTERNATIONAL CONFERENCE ON INTERACTIVE COLLABORATIVE LEARNING (ICL), 2012,
- [45] Policy Gradient Method For Robust Reinforcement Learning INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
- [46] Reinforcement Learning to Rank with Pairwise Policy Gradient PROCEEDINGS OF THE 43RD INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '20), 2020, : 509 - 518
- [47] Scalable Multitask Policy Gradient Reinforcement Learning THIRTY-FIRST AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 1847 - 1853
- [49] Continuous Parameter Control in Genetic Algorithms using Policy Gradient Reinforcement Learning PROCEEDINGS OF THE 13TH INTERNATIONAL JOINT CONFERENCE ON COMPUTATIONAL INTELLIGENCE (IJCCI), 2021, : 115 - 122
- [50] Control Randomisation Approach for Policy Gradient and Application to Reinforcement Learning in Optimal Switching Applied Mathematics and Optimization, 2025, 91 (01):