共 42 条
- [31] Fast Convergence for Time-Varying Semi-Anonymous Potential Games 2014 AMERICAN CONTROL CONFERENCE (ACC), 2014, : 5384 - 5389
- [32] Independent Deep Deterministic Policy Gradient Reinforcement Learning in Cooperative Multiagent Pursuit Games ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2021, PT IV, 2021, 12894 : 625 - 637
- [33] Fast Convergence of Optimistic Gradient Ascent in Network Zero-Sum Extensive Form Games ALGORITHMIC GAME THEORY, SAGT 2022, 2022, 13584 : 383 - 399
- [35] Convergence and optimality of policy gradient primal-dual method for constrained Markov decision processes 2022 AMERICAN CONTROL CONFERENCE, ACC, 2022, : 2851 - 2856
- [36] Natural Policy Gradient Primal-Dual Method for Constrained Markov Decision Processes ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
- [37] Symmetric (Optimistic) Natural Policy Gradient for Multi-agent Learning with Parameter Convergence INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 206, 2023, 206
- [39] On the Convergence of Natural Policy Gradient and Mirror Descent-Like Policy Methods for Average-Reward MDPs 2023 62ND IEEE CONFERENCE ON DECISION AND CONTROL, CDC, 2023, : 1979 - 1984
- [40] Understanding approximate Fisher information for fast convergence of natural gradient descent in wide neural networks* JOURNAL OF STATISTICAL MECHANICS-THEORY AND EXPERIMENT, 2021, 2021 (12):