Team Correlated Equilibria in Zero-Sum Extensive-Form Games via Tree Decompositions

被引：0

作者：

Zhang, Brian Hu ^{[1
]}

Sandholm, Tuomas ^{[1
,2
,3
,4
]}

机构：

[1] Carnegie Mellon Univ, Comp Sci Dept, Pittsburgh, PA 15213 USA

[2] Strateg Machine Inc, Charlotte, NC USA

[3] Strategy Robot Inc, Pittsburgh, PA USA

[4] Optimized Markets Inc, Pittsburgh, PA USA

来源：

THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / THE TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE | 2022年

基金：

美国国家科学基金会;

关键词：

POKER;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Despite the many recent practical and theoretical breakthroughs in computational game theory, equilibrium finding in extensive-form team games remains a significant challenge. While NP-hard in the worst case, there are provably efficient algorithms for certain families of team game. In particular, if the game has common external information, also known as A-loss recall-informally, actions played by non-team members (i.e., the opposing team or nature) are either unknown to the entire team, or common knowledge within the team-then polynomial-time algorithms exist. In this paper, we devise a completely new algorithm for solving team games. It uses a tree decomposition of the constraint system representing each team's strategy to reduce the number and degree of constraints required for correctness (tightness of the mathematical program). Our approach has the bags of the tree decomposition correspond to team-public states-that is, minimal sets of nodes (that is, states of the team) such that, upon reaching the set, it is common knowledge among the players on the team that the set has been reached. Our algorithm reduces the problem of solving team games to a linear program with at most O(NWw+1) nonzero entries in the constraint matrix, where N is the size of the game tree, w is a parameter that depends on the amount of uncommon external information, and W is the treewidth of the tree decomposition. In public-action games, our program size is bounded by the tighter 2(O(nt)) N for teams of n players with t types each. Our algorithm is based on a new way to write a custom, concise tree decomposition, and its fast run time does not assume that the decomposition has small treewidth. Since our algorithm describes the polytope of correlated strategies directly, we get equilibrium finding in correlated strategies for free-instead of, say, having to run a double oracle algorithm. We show via experiments on a standard suite of games that our algorithm achieves state-of-the-art performance on all benchmark game classes except one. We also present, to our knowledge, the first experiments for this setting where both teams have more than one member.

引用

页码：5252 / 5259

页数：8

共 50 条

[1] Computing Team-Maxmin Equilibria in Zero-Sum Multiplayer Extensive-Form Games
Zhang, Youzhi
An, Bo
THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 2318 - 2325
[2] Computing Optimal Equilibria and Mechanisms via Learning in Zero-Sum Extensive-Form Games
Zhang, Brian Hu
Farina, Gabriele
Anagnostides, Ioannis
Cacciamani, Federico
McAleer, Stephen
Haupt, Andreas
Celli, Andrea
Gatti, Nicola
Conitzer, Vincent
Sandholm, Tuomas
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
[3] Practical Performance of Refinements of Nash Equilibria in Extensive-Form Zero-Sum Games
Cermak, Jiri
Bosansky, Branislav
Lisy, Viliam
21ST EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE (ECAI 2014), 2014, 263 : 201 - +
[4] Computing Ex Ante Coordinated Team-Maxmin Equilibria in Zero-Sum Multiplayer Extensive-Form Games
Zhang, Youzhi
An, Bo
Cerny, Jakub
THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 5813 - 5821
[5] Learning Extensive-Form Perfect Equilibria in Two-Player Zero-Sum Sequential Games
Bernasconi, Martino
Marchesi, Alberto
Trovo, Francesco
INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 238, 2024, 238
[6] Computing Maxmin Strategies in Extensive-form Zero-sum Games with Imperfect Recall
Bosansky, Branislav
Cermak, Jiri
Horak, Karel
Pechoucek, Michal
ICAART: PROCEEDINGS OF THE 9TH INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE, VOL 2, 2017, : 63 - 74
[7] An Exact Double-Oracle Algorithm for Zero-Sum Extensive-Form Games with Imperfect Information
Bosansky, Branislav
Kiekintveld, Christopher
Lisy, Viliam
Pechoucek, Michal
JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2014, 51 : 829 - 866
[8] Ex ante coordination and collusion in zero-sum multi-player extensive-form games
Farina, Gabriele
Celli, Andrea
Gatti, Nicola
Sandholm, Tuomas
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
[9] Variance Decompositions for Extensive-Form Games
Cloud, Alex
Laber, Eric B.
2021 IEEE CONFERENCE ON GAMES (COG), 2021, : 380 - 387
[10] Strategic decompositions of normal form games: Zero-sum games and potential games
Hwang, Sung-Ha
Rey-Bellet, Luc
GAMES AND ECONOMIC BEHAVIOR, 2020, 122 : 370 - 390

← 1 2 3 4 5 →