共 50 条
- [1] Deterministic Policy Gradient Algorithms [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 32 (CYCLE 1), 2014, 32
- [3] Online Gradient Descent Learning Algorithms [J]. Foundations of Computational Mathematics, 2008, 8 : 561 - 596
- [4] An improvement of policy gradient estimation algorithms [J]. WODES' 08: PROCEEDINGS OF THE 9TH INTERNATIONAL WORKSHOP ON DISCRETE EVENT SYSTEMS, 2008, : 168 - 172
- [5] APPROXIMATE NEWTON POLICY GRADIENT ALGORITHMS [J]. SIAM Journal on Scientific Computing, 2023, 45 (05):
- [6] Successful Ingredients of Policy Gradient Algorithms [J]. PROCEEDINGS OF THE THIRTIETH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2021, 2021, : 2455 - 2461