Efficient Regret Minimization Algorithm for Extensive-Form Correlated Equilibrium

被引:0
|
作者
Farina, Gabriele [1 ]
Ling, Chun Kai [1 ]
Fang, Fei [2 ]
Sandholm, Tuomas [1 ,3 ,4 ,5 ]
机构
[1] Carnegie Mellon Univ, Dept Comp Sci, Pittsburgh, PA 15213 USA
[2] Carnegie Mellon Univ, Inst Software Res, Pittsburgh, PA 15213 USA
[3] Strateg Machine Inc, Morristown, NJ USA
[4] Strategy Robot Inc, Pittsburgh, PA USA
[5] Optimized Markets Inc, Pittsburgh, PA USA
基金
美国国家科学基金会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Self-play methods based on regret minimization have become the state of the art for computing Nash equilibria in large two-players zero-sum extensive-form games. These methods fundamentally rely on the hierarchical structure of the players' sequential strategy spaces to construct a regret minimizer that recursively minimizes regret at each decision point in the game tree. In this paper, we introduce the first efficient regret minimization algorithm for computing extensive-form correlated equilibria in large two-player general-sum games with no chance moves. Designing such an algorithm is significantly more challenging than designing one for the Nash equilibrium counterpart, as the constraints that define the space of correlation plans lack the hierarchical structure and might even form cycles. We show that some of the constraints are redundant and can be excluded from consideration, and present an efficient algorithm that generates the space of extensive-form correlation plans incrementally from the remaining constraints. This structural decomposition is achieved via a special convexity-preserving operation that we coin scaled extension. We show that a regret minimizer can be designed for a scaled extension of any two convex sets, and that from the decomposition we then obtain a global regret minimizer. Our algorithm produces feasible iterates. Experiments show that it significantly outperforms prior approaches and for larger problems it is the only viable option.
引用
收藏
页数:11
相关论文
共 50 条
  • [1] No-Regret Learning Dynamics for Extensive-Form Correlated Equilibrium
    Celli, Andrea
    Marchesi, Alberto
    Farina, Gabriele
    Gatti, Nicola
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
  • [2] Stochastic Regret Minimization in Extensive-Form Games
    Farina, Gabriele
    Kroer, Christian
    Sandholm, Tuomas
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 119, 2020, 119
  • [3] Stochastic Regret Minimization in Extensive-Form Games
    Farina, Gabriele
    Kroer, Christian
    Sandholm, Tuomas
    25TH AMERICAS CONFERENCE ON INFORMATION SYSTEMS (AMCIS 2019), 2019,
  • [4] Simple Uncoupled No-regret Learning Dynamics for Extensive-form Correlated Equilibrium
    Farina, Gabriele
    Celli, Andrea
    Marchesi, Alberto
    Gatti, Nicola
    JOURNAL OF THE ACM, 2022, 69 (06)
  • [5] Efficient Phi-Regret Minimization in Extensive-Form Games via Online Mirror Descent
    Bai, Yu
    Jin, Chi
    Mei, Song
    Song, Ziang
    Yu, Tiancheng
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35, NEURIPS 2022, 2022,
  • [6] Extensive-Form Correlated Equilibrium: Definition and Computational Complexity
    von Stengel, Bernhard
    Forges, Francoise
    MATHEMATICS OF OPERATIONS RESEARCH, 2008, 33 (04) : 1002 - 1022
  • [7] Computing an Extensive-Form Correlated Equilibrium in Polynomial Time
    Huang, Wan
    von Stengel, Bernhard
    INTERNET AND NETWORK ECONOMICS, PROCEEDINGS, 2008, 5385 : 506 - 513
  • [8] Decentralized No-Regret Learning Algorithms for Extensive-Form Correlated Equilibria (Extended Abstract)
    Celli, Andrea
    Marchesi, Alberto
    Farina, Gabriele
    Gatti, Nicola
    PROCEEDINGS OF THE THIRTIETH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2021, 2021, : 4755 - 4759
  • [9] Sample-Efficient Learning of Correlated Equilibria in Extensive-Form Games
    Song, Ziang
    Mei, Song
    Bai, Yu
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35, NEURIPS 2022, 2022,
  • [10] Optimistic Regret Minimization for Extensive-Form Games via Dilated Distance-Generating Functions
    Farina, Gabriele
    Kroer, Christian
    Sandholm, Tuomas
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32