Efficient Regret Minimization Algorithm for Extensive-Form Correlated Equilibrium

被引:0
|
作者
Farina, Gabriele [1 ]
Ling, Chun Kai [1 ]
Fang, Fei [2 ]
Sandholm, Tuomas [1 ,3 ,4 ,5 ]
机构
[1] Carnegie Mellon Univ, Dept Comp Sci, Pittsburgh, PA 15213 USA
[2] Carnegie Mellon Univ, Inst Software Res, Pittsburgh, PA 15213 USA
[3] Strateg Machine Inc, Morristown, NJ USA
[4] Strategy Robot Inc, Pittsburgh, PA USA
[5] Optimized Markets Inc, Pittsburgh, PA USA
基金
美国国家科学基金会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Self-play methods based on regret minimization have become the state of the art for computing Nash equilibria in large two-players zero-sum extensive-form games. These methods fundamentally rely on the hierarchical structure of the players' sequential strategy spaces to construct a regret minimizer that recursively minimizes regret at each decision point in the game tree. In this paper, we introduce the first efficient regret minimization algorithm for computing extensive-form correlated equilibria in large two-player general-sum games with no chance moves. Designing such an algorithm is significantly more challenging than designing one for the Nash equilibrium counterpart, as the constraints that define the space of correlation plans lack the hierarchical structure and might even form cycles. We show that some of the constraints are redundant and can be excluded from consideration, and present an efficient algorithm that generates the space of extensive-form correlation plans incrementally from the remaining constraints. This structural decomposition is achieved via a special convexity-preserving operation that we coin scaled extension. We show that a regret minimizer can be designed for a scaled extension of any two convex sets, and that from the decomposition we then obtain a global regret minimizer. Our algorithm produces feasible iterates. Experiments show that it significantly outperforms prior approaches and for larger problems it is the only viable option.
引用
收藏
页数:11
相关论文
共 50 条
  • [31] Efficient Deviation Types and Learning for Hindsight Rationality in Extensive-Form Games
    Morrill, Dustin
    D'Orazio, Ryan
    Lanctot, Marc
    Wright, James R.
    Bowling, Michael
    Greenwald, Amy R.
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
  • [32] Extensive-form games and strategic complementarities
    Echenique, F
    GAMES AND ECONOMIC BEHAVIOR, 2004, 46 (02) : 348 - 364
  • [33] Variance Decompositions for Extensive-Form Games
    Cloud, Alex
    Laber, Eric B.
    2021 IEEE CONFERENCE ON GAMES (COG), 2021, : 380 - 387
  • [34] RATIONAL BEHAVIOR IN EXTENSIVE-FORM GAMES
    RENY, PJ
    CANADIAN JOURNAL OF ECONOMICS-REVUE CANADIENNE D ECONOMIQUE, 1995, 28 (01): : 1 - 16
  • [35] Stability and trembles in extensive-form games
    Heller, Yuval
    GAMES AND ECONOMIC BEHAVIOR, 2014, 84 : 132 - 136
  • [36] Rational Play in Extensive-Form Games
    Bonanno, Giacomo
    GAMES, 2022, 13 (06):
  • [37] Safe Subgame Resolving for Extensive Form Correlated Equilibrium
    Ling, Chun Kai
    Fang, Fei
    THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / THE TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 5116 - 5123
  • [38] Equilibrium in justifiable strategies: A model of reason-based choice in extensive-form games
    Spiegler, R
    REVIEW OF ECONOMIC STUDIES, 2002, 69 (03): : 691 - 706
  • [39] An Algorithm for Constructing and Solving Imperfect Recall Abstractions of Large Extensive-Form Games
    Cermak, Jiri
    Bosansky, Branislav
    Lisy, Viliam
    PROCEEDINGS OF THE TWENTY-SIXTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 936 - 942
  • [40] Equilibrium Finding in Normal-Form Games via Greedy Regret Minimization
    Zhang, Hugh
    Lerer, Adam
    Brown, Noam
    THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 9484 - 9492