Efficient Regret Minimization Algorithm for Extensive-Form Correlated Equilibrium

被引：0

作者：

Farina, Gabriele ^{[1
]}

Ling, Chun Kai ^{[1
]}

Fang, Fei ^{[2
]}

Sandholm, Tuomas ^{[1
,3
,4
,5
]}

机构：

[1] Carnegie Mellon Univ, Dept Comp Sci, Pittsburgh, PA 15213 USA

[2] Carnegie Mellon Univ, Inst Software Res, Pittsburgh, PA 15213 USA

[3] Strateg Machine Inc, Morristown, NJ USA

[4] Strategy Robot Inc, Pittsburgh, PA USA

[5] Optimized Markets Inc, Pittsburgh, PA USA

来源：

ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019) | 2019年 / 32卷

基金：

美国国家科学基金会;

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Self-play methods based on regret minimization have become the state of the art for computing Nash equilibria in large two-players zero-sum extensive-form games. These methods fundamentally rely on the hierarchical structure of the players' sequential strategy spaces to construct a regret minimizer that recursively minimizes regret at each decision point in the game tree. In this paper, we introduce the first efficient regret minimization algorithm for computing extensive-form correlated equilibria in large two-player general-sum games with no chance moves. Designing such an algorithm is significantly more challenging than designing one for the Nash equilibrium counterpart, as the constraints that define the space of correlation plans lack the hierarchical structure and might even form cycles. We show that some of the constraints are redundant and can be excluded from consideration, and present an efficient algorithm that generates the space of extensive-form correlation plans incrementally from the remaining constraints. This structural decomposition is achieved via a special convexity-preserving operation that we coin scaled extension. We show that a regret minimizer can be designed for a scaled extension of any two convex sets, and that from the decomposition we then obtain a global regret minimizer. Our algorithm produces feasible iterates. Experiments show that it significantly outperforms prior approaches and for larger problems it is the only viable option.

引用

页数：11

共 50 条

[21] RATIONALITY IN EXTENSIVE-FORM GAMES
RENY, PJ
JOURNAL OF ECONOMIC PERSPECTIVES, 1992, 6 (04): : 103 - 118
[22] Timeability of Extensive-Form Games
Jakobsen, Sune K.
Sorensen, Troels B.
Conitzer, Vincent
ITCS'16: PROCEEDINGS OF THE 2016 ACM CONFERENCE ON INNOVATIONS IN THEORETICAL COMPUTER SCIENCE, 2016, : 191 - 199
[23] Computational Extensive-Form Games
Halpern, Joseph Y.
Pass, Rafael
Seeman, Lior
EC'16: PROCEEDINGS OF THE 2016 ACM CONFERENCE ON ECONOMICS AND COMPUTATION, 2016, : 681 - 698
[24] Quantum extensive-form games
Kazuki Ikeda
Quantum Information Processing, 22
[25] Sequence-Form Algorithm for Computing Stackelberg Equilibria in Extensive-Form Games
Bosansky, Branislav
Cermak, Jiri
PROCEEDINGS OF THE TWENTY-NINTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2015, : 805 - 811
[26] Quantum extensive-form games
Ikeda, Kazuki
QUANTUM INFORMATION PROCESSING, 2023, 22 (01)
[27] Extensive-Form Perfect Equilibrium Computation in Two-Player Games
Farina, Gabriele
Gatti, Nicola
THIRTY-FIRST AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 502 - 508
[28] Strategic negotiations for extensive-form games
Dave de Jonge
Dongmo Zhang
Autonomous Agents and Multi-Agent Systems, 2020, 34
[29] Coarse Correlation in Extensive-Form Games
Farina, Gabriele
Bianchi, Tommaso
Sandholm, Tuomas
THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 1934 - 1941
[30] Strategic negotiations for extensive-form games
de Jonge, Dave
Zhang, Dongmo
AUTONOMOUS AGENTS AND MULTI-AGENT SYSTEMS, 2020, 34 (01)

← 1 2 3 4 5 →