Computing Team-Maxmin Equilibria in Zero-Sum Multiplayer Extensive-Form Games

被引:0
|
作者
Zhang, Youzhi [1 ]
An, Bo [1 ]
机构
[1] Nanyang Technol Univ, Sch Comp Sci & Engn, Singapore, Singapore
关键词
NORMALIZED MULTIPARAMETRIC DISAGGREGATION; BOUNDS;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The study of finding the equilibrium for multiplayer games is challenging. This paper focuses on computing Team-Maxmin Equilibria (TMEs) in zero-sum multiplayer Extensive-Form Games (EFGs), which describes the optimal strategies for a team of players who share the same goal but they take actions independently against an adversary. TMEs can capture many realistic scenarios, including: 1) a team of players play against a target player in poker games; and 2) defense resources schedule and patrol independently in security games. However, the study of efficiently finding TMEs within any given accuracy in EFGs is almost completely unexplored. To fill this gap, we first study the inefficiency caused by computing the equilibrium where team players correlate their strategies and then transforming it into the mixed strategy profile of the team and show that this inefficiency can be arbitrarily large. Second, to efficiently solve the non-convex program for finding TMEs directly, we develop the Associated Recursive Asynchronous Multiparametric Disaggregation Technique (ARAMDT) to approximate multilinear terms in the program with two novel techniques: 1) an asynchronous precision method to reduce the number of constraints and variables for approximation by using different precision levels to approximate these terms; and 2) an associated constraint method to reduce the feasible solution space of the mixed-integer linear program resulting from ARAMDT by exploiting the relation between these terms. Third, we develop a novel iterative algorithm to efficiently compute TMEs within any given accuracy based on ARAMDT. Our algorithm is orders of magnitude faster than baselines in the experimental evaluation.
引用
收藏
页码:2318 / 2325
页数:8
相关论文
共 50 条
  • [41] Computing Nash Equilibria for Multiplayer Symmetric Games Based on Tensor Form
    Liu, Qilong
    Liao, Qingshui
    MATHEMATICS, 2023, 11 (10)
  • [42] Localization for a class of two-team zero-sum Markov games
    Chang, HS
    Fu, MC
    2004 43RD IEEE CONFERENCE ON DECISION AND CONTROL (CDC), VOLS 1-5, 2004, : 4844 - 4849
  • [43] Low-Variance and Zero-Variance Baselines for Extensive-Form Games
    Davis, Trevor
    Schmid, Martin
    Bowling, Michael
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 119, 2020, 119
  • [44] Extensive-form games with heterogeneous populations: solution concepts, equilibria characterization, learning dynamics
    Gatti, Nicola
    Panozzo, Fabio
    Restelli, Marcello
    INTELLIGENZA ARTIFICIALE, 2016, 10 (01) : 19 - 31
  • [45] Attack-Defense Trees and Two-Player Binary Zero-Sum Extensive Form Games Are Equivalent
    Kordy, Barbara
    Mauw, Sjouke
    Melissen, Matthijs
    Schweitzer, Patrick
    DECISION AND GAME THEORY FOR SECURITY, 2010, 6442 : 245 - 256
  • [46] A NEO2 BAYESIAN FOUNDATION OF THE MAXMIN VALUE FOR 2-PERSON ZERO-SUM GAMES
    HART, S
    MODICA, S
    SCHMEIDLER, D
    INTERNATIONAL JOURNAL OF GAME THEORY, 1994, 23 (04) : 347 - 358
  • [47] Multiplayer zero-sum games optimal control for modular robot manipulators with interconnected dynamic couplings
    Zhu, Xinye
    An, Tianjiao
    Dong, Bo
    INTERNATIONAL JOURNAL OF ADAPTIVE CONTROL AND SIGNAL PROCESSING, 2022, 36 (12) : 3254 - 3270
  • [48] Nash equilibria of Cauchy-random zero-sum and coordination matrix games
    Roberts, David P.
    INTERNATIONAL JOURNAL OF GAME THEORY, 2006, 34 (02) : 167 - 184
  • [49] Bias and overtaking equilibria for zero-sum continuous-time Markov games
    Tomás Prieto-Rumeau
    Onésimo Hernández-Lerma
    Mathematical Methods of Operations Research, 2005, 61 : 437 - 454
  • [50] Asymmetric Constrained Optimal Tracking Control With Critic Learning of Nonlinear Multiplayer Zero-Sum Games
    Qiao, Junfei
    Li, Menghua
    Wang, Ding
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (04) : 5671 - 5683