Efficient Regret Minimization Algorithm for Extensive-Form Correlated Equilibrium

被引：0

作者：

Farina, Gabriele ^{[1
]}

Ling, Chun Kai ^{[1
]}

Fang, Fei ^{[2
]}

Sandholm, Tuomas ^{[1
,3
,4
,5
]}

机构：

[1] Carnegie Mellon Univ, Dept Comp Sci, Pittsburgh, PA 15213 USA

[2] Carnegie Mellon Univ, Inst Software Res, Pittsburgh, PA 15213 USA

[3] Strateg Machine Inc, Morristown, NJ USA

[4] Strategy Robot Inc, Pittsburgh, PA USA

[5] Optimized Markets Inc, Pittsburgh, PA USA

来源：

ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019) | 2019年 / 32卷

基金：

美国国家科学基金会;

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Self-play methods based on regret minimization have become the state of the art for computing Nash equilibria in large two-players zero-sum extensive-form games. These methods fundamentally rely on the hierarchical structure of the players' sequential strategy spaces to construct a regret minimizer that recursively minimizes regret at each decision point in the game tree. In this paper, we introduce the first efficient regret minimization algorithm for computing extensive-form correlated equilibria in large two-player general-sum games with no chance moves. Designing such an algorithm is significantly more challenging than designing one for the Nash equilibrium counterpart, as the constraints that define the space of correlation plans lack the hierarchical structure and might even form cycles. We show that some of the constraints are redundant and can be excluded from consideration, and present an efficient algorithm that generates the space of extensive-form correlation plans incrementally from the remaining constraints. This structural decomposition is achieved via a special convexity-preserving operation that we coin scaled extension. We show that a regret minimizer can be designed for a scaled extension of any two convex sets, and that from the decomposition we then obtain a global regret minimizer. Our algorithm produces feasible iterates. Experiments show that it significantly outperforms prior approaches and for larger problems it is the only viable option.

引用

页数：11

共 50 条

[31] Efficient Deviation Types and Learning for Hindsight Rationality in Extensive-Form Games
Morrill, Dustin
D'Orazio, Ryan
Lanctot, Marc
Wright, James R.
Bowling, Michael
Greenwald, Amy R.
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
[32] Extensive-form games and strategic complementarities
Echenique, F
GAMES AND ECONOMIC BEHAVIOR, 2004, 46 (02) : 348 - 364
[33] Variance Decompositions for Extensive-Form Games
Cloud, Alex
Laber, Eric B.
2021 IEEE CONFERENCE ON GAMES (COG), 2021, : 380 - 387
[34] RATIONAL BEHAVIOR IN EXTENSIVE-FORM GAMES
RENY, PJ
CANADIAN JOURNAL OF ECONOMICS-REVUE CANADIENNE D ECONOMIQUE, 1995, 28 (01): : 1 - 16
[35] Stability and trembles in extensive-form games
Heller, Yuval
GAMES AND ECONOMIC BEHAVIOR, 2014, 84 : 132 - 136
[36] Rational Play in Extensive-Form Games
Bonanno, Giacomo
GAMES, 2022, 13 (06):
[37] Safe Subgame Resolving for Extensive Form Correlated Equilibrium
Ling, Chun Kai
Fang, Fei
THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / THE TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 5116 - 5123
[38] Equilibrium in justifiable strategies: A model of reason-based choice in extensive-form games
Spiegler, R
REVIEW OF ECONOMIC STUDIES, 2002, 69 (03): : 691 - 706
[39] An Algorithm for Constructing and Solving Imperfect Recall Abstractions of Large Extensive-Form Games
Cermak, Jiri
Bosansky, Branislav
Lisy, Viliam
PROCEEDINGS OF THE TWENTY-SIXTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 936 - 942
[40] Equilibrium Finding in Normal-Form Games via Greedy Regret Minimization
Zhang, Hugh
Lerer, Adam
Brown, Noam
THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 9484 - 9492

← 1 2 3 4 5 →