共 50 条
- [43] On the Convergence of Natural Policy Gradient and Mirror Descent-Like Policy Methods for Average-Reward MDPs 2023 62ND IEEE CONFERENCE ON DECISION AND CONTROL, CDC, 2023, : 1979 - 1984
- [44] A New Framework for Matrix Discrepancy: Partial Coloring Bounds via Mirror Descent PROCEEDINGS OF THE 54TH ANNUAL ACM SIGACT SYMPOSIUM ON THEORY OF COMPUTING (STOC '22), 2022, : 649 - 658
- [46] Approximate Steepest Coordinate Descent INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 70, 2017, 70
- [47] Policy search via density estimation ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 12, 2000, 12 : 1022 - 1028
- [48] The Information Geometry of Mirror Descent GEOMETRIC SCIENCE OF INFORMATION, GSI 2015, 2015, 9389 : 359 - 368
- [50] Stunt Driving via Policy Search 2012 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2012, : 4699 - 4704