共 50 条
- [42] The Lagging Anchor Algorithm: Reinforcement Learning in Two-Player Zero-Sum Games with Imperfect Information Machine Learning, 2002, 49 : 5 - 37
- [43] GPI-Based design for partially unknown nonlinear two-player zero-sum games JOURNAL OF THE FRANKLIN INSTITUTE-ENGINEERING AND APPLIED MATHEMATICS, 2023, 360 (03): : 2068 - 2088
- [44] Online solution of nonlinear two-player zero-sum games using synchronous policy iteration International Journal of Robust and Nonlinear Control, 2012, 22 (13): : 1460 - 1483
- [45] Decentralized Single-Timescale Actor Critic on Zero-Sum Two-Player Stochastic Games INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
- [47] Sufficient Conditions for Optimality in Finite-Horizon Two-Player Zero-Sum Hybrid Games 2022 IEEE 61ST CONFERENCE ON DECISION AND CONTROL (CDC), 2022, : 3268 - 3273
- [48] Policy gradient algorithm and its convergence analysis for two-player zero-sum Markov games Zhejiang Daxue Xuebao (Gongxue Ban)/Journal of Zhejiang University (Engineering Science), 2024, 58 (03): : 480 - 491
- [50] A Meta-evolutionary Learning Algorithm for Opponent Adaptation in Two-player Zero-sum Games Zidonghua Xuebao/Acta Automatica Sinica, 2022, 48 (10): : 2462 - 2473