共 50 条
- [41] Online Solution of Nonlinear Two-Player Zero-Sum Games Using Synchronous Policy Iteration 49TH IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2010, : 3040 - 3047
- [43] The Lagging Anchor Algorithm: Reinforcement Learning in Two-Player Zero-Sum Games with Imperfect Information Machine Learning, 2002, 49 : 5 - 37
- [44] GPI-Based design for partially unknown nonlinear two-player zero-sum games JOURNAL OF THE FRANKLIN INSTITUTE-ENGINEERING AND APPLIED MATHEMATICS, 2023, 360 (03): : 2068 - 2088
- [45] Online solution of nonlinear two-player zero-sum games using synchronous policy iteration International Journal of Robust and Nonlinear Control, 2012, 22 (13): : 1460 - 1483
- [46] Decentralized Single-Timescale Actor Critic on Zero-Sum Two-Player Stochastic Games INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
- [48] Sufficient Conditions for Optimality in Finite-Horizon Two-Player Zero-Sum Hybrid Games 2022 IEEE 61ST CONFERENCE ON DECISION AND CONTROL (CDC), 2022, : 3268 - 3273
- [49] Policy gradient algorithm and its convergence analysis for two-player zero-sum Markov games Zhejiang Daxue Xuebao (Gongxue Ban)/Journal of Zhejiang University (Engineering Science), 2024, 58 (03): : 480 - 491