共 41 条
- [3] On Finite-Time Convergence of Actor-Critic Algorithm [J]. IEEE Journal on Selected Areas in Information Theory, 2021, 2 (02): : 652 - 664
- [5] An Online Actor–Critic Algorithm with Function Approximation for Constrained Markov Decision Processes [J]. Journal of Optimization Theory and Applications, 2012, 153 : 688 - 708
- [6] Natural Policy Gradient Primal-Dual Method for Constrained Markov Decision Processes [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
- [7] An Online Primal-Dual Method for Discounted Markov Decision Processes [J]. 2016 IEEE 55TH CONFERENCE ON DECISION AND CONTROL (CDC), 2016, : 4516 - 4521
- [8] Actor-critic algorithms for hierarchical Markov decision processes [J]. AUTOMATICA, 2006, 42 (04) : 637 - 644
- [10] Provably Efficient Convergence of Primal-Dual Actor-Critic with Nonlinear Function Approximation [J]. Proceedings of the International Joint Conference on Autonomous Agents and Multiagent Systems, AAMAS, 2023, 2023-May : 2640 - 2642