Efficient Regret Minimization Algorithm for Extensive-Form Correlated Equilibrium

被引：0

作者：

Farina, Gabriele ^{[1
]}

Ling, Chun Kai ^{[1
]}

Fang, Fei ^{[2
]}

Sandholm, Tuomas ^{[1
,3
,4
,5
]}

机构：

[1] Carnegie Mellon Univ, Dept Comp Sci, Pittsburgh, PA 15213 USA

[2] Carnegie Mellon Univ, Inst Software Res, Pittsburgh, PA 15213 USA

[3] Strateg Machine Inc, Morristown, NJ USA

[4] Strategy Robot Inc, Pittsburgh, PA USA

[5] Optimized Markets Inc, Pittsburgh, PA USA

来源：

ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019) | 2019年 / 32卷

基金：

美国国家科学基金会;

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Self-play methods based on regret minimization have become the state of the art for computing Nash equilibria in large two-players zero-sum extensive-form games. These methods fundamentally rely on the hierarchical structure of the players' sequential strategy spaces to construct a regret minimizer that recursively minimizes regret at each decision point in the game tree. In this paper, we introduce the first efficient regret minimization algorithm for computing extensive-form correlated equilibria in large two-player general-sum games with no chance moves. Designing such an algorithm is significantly more challenging than designing one for the Nash equilibrium counterpart, as the constraints that define the space of correlation plans lack the hierarchical structure and might even form cycles. We show that some of the constraints are redundant and can be excluded from consideration, and present an efficient algorithm that generates the space of extensive-form correlation plans incrementally from the remaining constraints. This structural decomposition is achieved via a special convexity-preserving operation that we coin scaled extension. We show that a regret minimizer can be designed for a scaled extension of any two convex sets, and that from the decomposition we then obtain a global regret minimizer. Our algorithm produces feasible iterates. Experiments show that it significantly outperforms prior approaches and for larger problems it is the only viable option.

引用

页数：11

共 50 条

[1] No-Regret Learning Dynamics for Extensive-Form Correlated Equilibrium
Celli, Andrea
Marchesi, Alberto
Farina, Gabriele
Gatti, Nicola
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
[2] Stochastic Regret Minimization in Extensive-Form Games
Farina, Gabriele
Kroer, Christian
Sandholm, Tuomas
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 119, 2020, 119
[3] Stochastic Regret Minimization in Extensive-Form Games
Farina, Gabriele
Kroer, Christian
Sandholm, Tuomas
25TH AMERICAS CONFERENCE ON INFORMATION SYSTEMS (AMCIS 2019), 2019,
[4] Simple Uncoupled No-regret Learning Dynamics for Extensive-form Correlated Equilibrium
Farina, Gabriele
Celli, Andrea
Marchesi, Alberto
Gatti, Nicola
JOURNAL OF THE ACM, 2022, 69 (06)
[5] Efficient Phi-Regret Minimization in Extensive-Form Games via Online Mirror Descent
Bai, Yu
Jin, Chi
Mei, Song
Song, Ziang
Yu, Tiancheng
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35, NEURIPS 2022, 2022,
[6] Extensive-Form Correlated Equilibrium: Definition and Computational Complexity
von Stengel, Bernhard
Forges, Francoise
MATHEMATICS OF OPERATIONS RESEARCH, 2008, 33 (04) : 1002 - 1022
[7] Computing an Extensive-Form Correlated Equilibrium in Polynomial Time
Huang, Wan
von Stengel, Bernhard
INTERNET AND NETWORK ECONOMICS, PROCEEDINGS, 2008, 5385 : 506 - 513
[8] Decentralized No-Regret Learning Algorithms for Extensive-Form Correlated Equilibria (Extended Abstract)
Celli, Andrea
Marchesi, Alberto
Farina, Gabriele
Gatti, Nicola
PROCEEDINGS OF THE THIRTIETH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2021, 2021, : 4755 - 4759
[9] Sample-Efficient Learning of Correlated Equilibria in Extensive-Form Games
Song, Ziang
Mei, Song
Bai, Yu
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35, NEURIPS 2022, 2022,
[10] Optimistic Regret Minimization for Extensive-Form Games via Dilated Distance-Generating Functions
Farina, Gabriele
Kroer, Christian
Sandholm, Tuomas
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32

← 1 2 3 4 5 →