Efficient Regret Minimization Algorithm for Extensive-Form Correlated Equilibrium

被引:0
|
作者
Farina, Gabriele [1 ]
Ling, Chun Kai [1 ]
Fang, Fei [2 ]
Sandholm, Tuomas [1 ,3 ,4 ,5 ]
机构
[1] Carnegie Mellon Univ, Dept Comp Sci, Pittsburgh, PA 15213 USA
[2] Carnegie Mellon Univ, Inst Software Res, Pittsburgh, PA 15213 USA
[3] Strateg Machine Inc, Morristown, NJ USA
[4] Strategy Robot Inc, Pittsburgh, PA USA
[5] Optimized Markets Inc, Pittsburgh, PA USA
基金
美国国家科学基金会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Self-play methods based on regret minimization have become the state of the art for computing Nash equilibria in large two-players zero-sum extensive-form games. These methods fundamentally rely on the hierarchical structure of the players' sequential strategy spaces to construct a regret minimizer that recursively minimizes regret at each decision point in the game tree. In this paper, we introduce the first efficient regret minimization algorithm for computing extensive-form correlated equilibria in large two-player general-sum games with no chance moves. Designing such an algorithm is significantly more challenging than designing one for the Nash equilibrium counterpart, as the constraints that define the space of correlation plans lack the hierarchical structure and might even form cycles. We show that some of the constraints are redundant and can be excluded from consideration, and present an efficient algorithm that generates the space of extensive-form correlation plans incrementally from the remaining constraints. This structural decomposition is achieved via a special convexity-preserving operation that we coin scaled extension. We show that a regret minimizer can be designed for a scaled extension of any two convex sets, and that from the decomposition we then obtain a global regret minimizer. Our algorithm produces feasible iterates. Experiments show that it significantly outperforms prior approaches and for larger problems it is the only viable option.
引用
收藏
页数:11
相关论文
共 50 条
  • [41] ON SELF-ENFORCEMENT IN EXTENSIVE-FORM GAMES
    WEIBULL, JW
    GAMES AND ECONOMIC BEHAVIOR, 1992, 4 (03) : 450 - 462
  • [42] Team Correlated Equilibria in Zero-Sum Extensive-Form Games via Tree Decompositions
    Zhang, Brian Hu
    Sandholm, Tuomas
    THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / THE TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 5252 - 5259
  • [43] Discretization of Continuous Action Spaces in Extensive-Form Games
    Kroer, Christian
    Sandholm, Tuomas
    PROCEEDINGS OF THE 2015 INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS & MULTIAGENT SYSTEMS (AAMAS'15), 2015, : 47 - 56
  • [44] Prudent Rationalizability in Generalized Extensive-form Games with Unawareness
    Heifetz, Aviad
    Meier, Martin
    Schipper, Burkhard C.
    B E JOURNAL OF THEORETICAL ECONOMICS, 2021, 21 (02): : 525 - 556
  • [45] SET-THEORETIC EQUIVALENCE OF EXTENSIVE-FORM GAMES
    BONANNO, G
    INTERNATIONAL JOURNAL OF GAME THEORY, 1992, 20 (04) : 429 - 447
  • [46] Iterated Admissibility does not Refine Extensive-form Rationalizability
    Catonini, Emiliano
    ECONOMIC JOURNAL, 2024, 134 (663): : 3017 - 3026
  • [47] Solving Large Extensive-Form Games with Strategy Constraints
    Davis, Trevor
    Waugh, Kevin
    Bowling, Michael
    THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 1861 - 1868
  • [48] Last-iterate Convergence in Extensive-Form Games
    Lee, Chung-Wei
    Kroer, Christian
    Luo, Haipeng
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [49] A Unified Framework for Extensive-Form Game Abstraction with Bounds
    Kroer, Christian
    Sandholm, Tuomas
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
  • [50] Designing Learning Algorithms over the Sequence Form of an Extensive-Form Game
    Manino, Edoardo
    Gatti, Nicola
    Restelli, Marcello
    AAMAS'17: PROCEEDINGS OF THE 16TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS, 2017, : 1622 - 1624