Efficient Regret Minimization Algorithm for Extensive-Form Correlated Equilibrium

被引:0
|
作者
Farina, Gabriele [1 ]
Ling, Chun Kai [1 ]
Fang, Fei [2 ]
Sandholm, Tuomas [1 ,3 ,4 ,5 ]
机构
[1] Carnegie Mellon Univ, Dept Comp Sci, Pittsburgh, PA 15213 USA
[2] Carnegie Mellon Univ, Inst Software Res, Pittsburgh, PA 15213 USA
[3] Strateg Machine Inc, Morristown, NJ USA
[4] Strategy Robot Inc, Pittsburgh, PA USA
[5] Optimized Markets Inc, Pittsburgh, PA USA
基金
美国国家科学基金会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Self-play methods based on regret minimization have become the state of the art for computing Nash equilibria in large two-players zero-sum extensive-form games. These methods fundamentally rely on the hierarchical structure of the players' sequential strategy spaces to construct a regret minimizer that recursively minimizes regret at each decision point in the game tree. In this paper, we introduce the first efficient regret minimization algorithm for computing extensive-form correlated equilibria in large two-player general-sum games with no chance moves. Designing such an algorithm is significantly more challenging than designing one for the Nash equilibrium counterpart, as the constraints that define the space of correlation plans lack the hierarchical structure and might even form cycles. We show that some of the constraints are redundant and can be excluded from consideration, and present an efficient algorithm that generates the space of extensive-form correlation plans incrementally from the remaining constraints. This structural decomposition is achieved via a special convexity-preserving operation that we coin scaled extension. We show that a regret minimizer can be designed for a scaled extension of any two convex sets, and that from the decomposition we then obtain a global regret minimizer. Our algorithm produces feasible iterates. Experiments show that it significantly outperforms prior approaches and for larger problems it is the only viable option.
引用
收藏
页数:11
相关论文
共 50 条
  • [21] RATIONALITY IN EXTENSIVE-FORM GAMES
    RENY, PJ
    JOURNAL OF ECONOMIC PERSPECTIVES, 1992, 6 (04): : 103 - 118
  • [22] Timeability of Extensive-Form Games
    Jakobsen, Sune K.
    Sorensen, Troels B.
    Conitzer, Vincent
    ITCS'16: PROCEEDINGS OF THE 2016 ACM CONFERENCE ON INNOVATIONS IN THEORETICAL COMPUTER SCIENCE, 2016, : 191 - 199
  • [23] Computational Extensive-Form Games
    Halpern, Joseph Y.
    Pass, Rafael
    Seeman, Lior
    EC'16: PROCEEDINGS OF THE 2016 ACM CONFERENCE ON ECONOMICS AND COMPUTATION, 2016, : 681 - 698
  • [24] Quantum extensive-form games
    Kazuki Ikeda
    Quantum Information Processing, 22
  • [25] Sequence-Form Algorithm for Computing Stackelberg Equilibria in Extensive-Form Games
    Bosansky, Branislav
    Cermak, Jiri
    PROCEEDINGS OF THE TWENTY-NINTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2015, : 805 - 811
  • [26] Quantum extensive-form games
    Ikeda, Kazuki
    QUANTUM INFORMATION PROCESSING, 2023, 22 (01)
  • [27] Extensive-Form Perfect Equilibrium Computation in Two-Player Games
    Farina, Gabriele
    Gatti, Nicola
    THIRTY-FIRST AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 502 - 508
  • [28] Strategic negotiations for extensive-form games
    Dave de Jonge
    Dongmo Zhang
    Autonomous Agents and Multi-Agent Systems, 2020, 34
  • [29] Coarse Correlation in Extensive-Form Games
    Farina, Gabriele
    Bianchi, Tommaso
    Sandholm, Tuomas
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 1934 - 1941
  • [30] Strategic negotiations for extensive-form games
    de Jonge, Dave
    Zhang, Dongmo
    AUTONOMOUS AGENTS AND MULTI-AGENT SYSTEMS, 2020, 34 (01)