共 42 条
- [22] On the convergence of policy gradient methods to Nash equilibria in general stochastic games ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35, NEURIPS 2022, 2022,
- [23] Provably Efficient Fictitious Play Policy Optimization for Zero-Sum Markov Games with Structured Transitions INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
- [24] Convergence to Nash Equilibrium and No-regret Guarantee in (Markov) Potential Games INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 238, 2024, 238
- [25] Fast Convergence of Natural Gradient Descent for Overparameterized Neural Networks ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
- [26] Convergence Rates for Localized Actor-Critic in Networked Markov Potential Games UNCERTAINTY IN ARTIFICIAL INTELLIGENCE, 2023, 216 : 2563 - 2573
- [27] Policy Gradient Algorithm in Two-Player Zero-Sum Markov Games Moshi Shibie yu Rengong Zhineng/Pattern Recognition and Artificial Intelligence, 2023, 36 (01): : 81 - 91
- [29] Convergence of Policy Gradient Methods for Nash Equilibria in General-sum Stochastic Games IFAC PAPERSONLINE, 2023, 56 (02): : 3435 - 3440
- [30] An off-policy natural policy gradient method for a partial observable Markov decision process ARTIFICIAL NEURAL NETWORKS: FORMAL MODELS AND THEIR APPLICATIONS - ICANN 2005, PT 2, PROCEEDINGS, 2005, 3697 : 431 - 436