共 50 条
- [41] Linear convergence of inexact descent method and inexact proximal gradient algorithms for lower-order regularization problems Journal of Global Optimization, 2021, 79 : 853 - 883
- [42] On the Convergence of Natural Policy Gradient and Mirror Descent-Like Policy Methods for Average-Reward MDPs 2023 62ND IEEE CONFERENCE ON DECISION AND CONTROL, CDC, 2023, : 1979 - 1984
- [43] Convergence of Policy Gradient for Entropy Regularized MDPs with Neural Network Approximation in the Mean-Field Regime INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
- [44] Policy gradient method for team Markov games INTELLIGENT DAA ENGINEERING AND AUTOMATED LEARNING IDEAL 2004, PROCEEDINGS, 2004, 3177 : 733 - 739
- [46] Reinforcement Learning in Linear Quadratic Deep Structured Teams: Global Convergence of Policy Gradient Methods 2020 59TH IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2020, : 4927 - 4932
- [47] Optimistic Policy Gradient in Multi-Player Markov Games with a Single Controller: Convergence beyond the Minty Property THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 9, 2024, : 9451 - 9459
- [48] Policy gradient algorithm and its convergence analysis for two-player zero-sum Markov games Zhejiang Daxue Xuebao (Gongxue Ban)/Journal of Zhejiang University (Engineering Science), 2024, 58 (03): : 480 - 491
- [50] Policy Optimization for H2 Linear Control with H∞ Robustness Guarantee: Implicit Regularization and Global Convergence LEARNING FOR DYNAMICS AND CONTROL, VOL 120, 2020, 120 : 179 - 190