Computing Team-Maxmin Equilibria in Zero-Sum Multiplayer Extensive-Form Games

被引：0

作者：

Zhang, Youzhi ^{[1
]}

An, Bo ^{[1
]}

机构：

[1] Nanyang Technol Univ, Sch Comp Sci & Engn, Singapore, Singapore

来源：

THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE | 2020年 / 34卷

关键词：

NORMALIZED MULTIPARAMETRIC DISAGGREGATION; BOUNDS;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The study of finding the equilibrium for multiplayer games is challenging. This paper focuses on computing Team-Maxmin Equilibria (TMEs) in zero-sum multiplayer Extensive-Form Games (EFGs), which describes the optimal strategies for a team of players who share the same goal but they take actions independently against an adversary. TMEs can capture many realistic scenarios, including: 1) a team of players play against a target player in poker games; and 2) defense resources schedule and patrol independently in security games. However, the study of efficiently finding TMEs within any given accuracy in EFGs is almost completely unexplored. To fill this gap, we first study the inefficiency caused by computing the equilibrium where team players correlate their strategies and then transforming it into the mixed strategy profile of the team and show that this inefficiency can be arbitrarily large. Second, to efficiently solve the non-convex program for finding TMEs directly, we develop the Associated Recursive Asynchronous Multiparametric Disaggregation Technique (ARAMDT) to approximate multilinear terms in the program with two novel techniques: 1) an asynchronous precision method to reduce the number of constraints and variables for approximation by using different precision levels to approximate these terms; and 2) an associated constraint method to reduce the feasible solution space of the mixed-integer linear program resulting from ARAMDT by exploiting the relation between these terms. Third, we develop a novel iterative algorithm to efficiently compute TMEs within any given accuracy based on ARAMDT. Our algorithm is orders of magnitude faster than baselines in the experimental evaluation.

引用

页码：2318 / 2325

页数：8

共 50 条

[41] Computing Nash Equilibria for Multiplayer Symmetric Games Based on Tensor Form
Liu, Qilong
Liao, Qingshui
MATHEMATICS, 2023, 11 (10)
[42] Localization for a class of two-team zero-sum Markov games
Chang, HS
Fu, MC
2004 43RD IEEE CONFERENCE ON DECISION AND CONTROL (CDC), VOLS 1-5, 2004, : 4844 - 4849
[43] Low-Variance and Zero-Variance Baselines for Extensive-Form Games
Davis, Trevor
Schmid, Martin
Bowling, Michael
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 119, 2020, 119
[44] Extensive-form games with heterogeneous populations: solution concepts, equilibria characterization, learning dynamics
Gatti, Nicola
Panozzo, Fabio
Restelli, Marcello
INTELLIGENZA ARTIFICIALE, 2016, 10 (01) : 19 - 31
[45] Attack-Defense Trees and Two-Player Binary Zero-Sum Extensive Form Games Are Equivalent
Kordy, Barbara
Mauw, Sjouke
Melissen, Matthijs
Schweitzer, Patrick
DECISION AND GAME THEORY FOR SECURITY, 2010, 6442 : 245 - 256
[46] A NEO2 BAYESIAN FOUNDATION OF THE MAXMIN VALUE FOR 2-PERSON ZERO-SUM GAMES
HART, S
MODICA, S
SCHMEIDLER, D
INTERNATIONAL JOURNAL OF GAME THEORY, 1994, 23 (04) : 347 - 358
[47] Multiplayer zero-sum games optimal control for modular robot manipulators with interconnected dynamic couplings
Zhu, Xinye
An, Tianjiao
Dong, Bo
INTERNATIONAL JOURNAL OF ADAPTIVE CONTROL AND SIGNAL PROCESSING, 2022, 36 (12) : 3254 - 3270
[48] Nash equilibria of Cauchy-random zero-sum and coordination matrix games
Roberts, David P.
INTERNATIONAL JOURNAL OF GAME THEORY, 2006, 34 (02) : 167 - 184
[49] Bias and overtaking equilibria for zero-sum continuous-time Markov games
Tomás Prieto-Rumeau
Onésimo Hernández-Lerma
Mathematical Methods of Operations Research, 2005, 61 : 437 - 454
[50] Asymmetric Constrained Optimal Tracking Control With Critic Learning of Nonlinear Multiplayer Zero-Sum Games
Qiao, Junfei
Li, Menghua
Wang, Ding
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (04) : 5671 - 5683

← 1 2 3 4 5 →