Efficient Regret Minimization Algorithm for Extensive-Form Correlated Equilibrium

被引：0

作者：

Farina, Gabriele ^{[1
]}

Ling, Chun Kai ^{[1
]}

Fang, Fei ^{[2
]}

Sandholm, Tuomas ^{[1
,3
,4
,5
]}

机构：

[1] Carnegie Mellon Univ, Dept Comp Sci, Pittsburgh, PA 15213 USA

[2] Carnegie Mellon Univ, Inst Software Res, Pittsburgh, PA 15213 USA

[3] Strateg Machine Inc, Morristown, NJ USA

[4] Strategy Robot Inc, Pittsburgh, PA USA

[5] Optimized Markets Inc, Pittsburgh, PA USA

来源：

ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019) | 2019年 / 32卷

基金：

美国国家科学基金会;

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Self-play methods based on regret minimization have become the state of the art for computing Nash equilibria in large two-players zero-sum extensive-form games. These methods fundamentally rely on the hierarchical structure of the players' sequential strategy spaces to construct a regret minimizer that recursively minimizes regret at each decision point in the game tree. In this paper, we introduce the first efficient regret minimization algorithm for computing extensive-form correlated equilibria in large two-player general-sum games with no chance moves. Designing such an algorithm is significantly more challenging than designing one for the Nash equilibrium counterpart, as the constraints that define the space of correlation plans lack the hierarchical structure and might even form cycles. We show that some of the constraints are redundant and can be excluded from consideration, and present an efficient algorithm that generates the space of extensive-form correlation plans incrementally from the remaining constraints. This structural decomposition is achieved via a special convexity-preserving operation that we coin scaled extension. We show that a regret minimizer can be designed for a scaled extension of any two convex sets, and that from the decomposition we then obtain a global regret minimizer. Our algorithm produces feasible iterates. Experiments show that it significantly outperforms prior approaches and for larger problems it is the only viable option.

引用

页数：11

共 50 条

[41] ON SELF-ENFORCEMENT IN EXTENSIVE-FORM GAMES
WEIBULL, JW
GAMES AND ECONOMIC BEHAVIOR, 1992, 4 (03) : 450 - 462
[42] Team Correlated Equilibria in Zero-Sum Extensive-Form Games via Tree Decompositions
Zhang, Brian Hu
Sandholm, Tuomas
THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / THE TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 5252 - 5259
[43] Discretization of Continuous Action Spaces in Extensive-Form Games
Kroer, Christian
Sandholm, Tuomas
PROCEEDINGS OF THE 2015 INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS & MULTIAGENT SYSTEMS (AAMAS'15), 2015, : 47 - 56
[44] Prudent Rationalizability in Generalized Extensive-form Games with Unawareness
Heifetz, Aviad
Meier, Martin
Schipper, Burkhard C.
B E JOURNAL OF THEORETICAL ECONOMICS, 2021, 21 (02): : 525 - 556
[45] SET-THEORETIC EQUIVALENCE OF EXTENSIVE-FORM GAMES
BONANNO, G
INTERNATIONAL JOURNAL OF GAME THEORY, 1992, 20 (04) : 429 - 447
[46] Iterated Admissibility does not Refine Extensive-form Rationalizability
Catonini, Emiliano
ECONOMIC JOURNAL, 2024, 134 (663): : 3017 - 3026
[47] Solving Large Extensive-Form Games with Strategy Constraints
Davis, Trevor
Waugh, Kevin
Bowling, Michael
THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 1861 - 1868
[48] Last-iterate Convergence in Extensive-Form Games
Lee, Chung-Wei
Kroer, Christian
Luo, Haipeng
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
[49] A Unified Framework for Extensive-Form Game Abstraction with Bounds
Kroer, Christian
Sandholm, Tuomas
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
[50] Designing Learning Algorithms over the Sequence Form of an Extensive-Form Game
Manino, Edoardo
Gatti, Nicola
Restelli, Marcello
AAMAS'17: PROCEEDINGS OF THE 16TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS, 2017, : 1622 - 1624

← 1 2 3 4 5 →