No-Regret Learning Dynamics for Extensive-Form Correlated Equilibrium

被引:0
|
作者
Celli, Andrea [1 ]
Marchesi, Alberto [1 ]
Farina, Gabriele [2 ]
Gatti, Nicola [1 ]
机构
[1] Politecn Milan, Milan, Italy
[2] Carnegie Mellon Univ, Pittsburgh, PA USA
基金
美国国家科学基金会;
关键词
COMPLEXITY;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The existence of simple, uncoupled no-regret dynamics that converge to correlated equilibria in normal-form games is a celebrated result in the theory of multi-agent systems. Specifically, it has been known for more than 20 years that when all players seek to minimize their internal regret in a repeated normal-form game, the empirical frequency of play converges to a normal-form correlated equilibrium. Extensive-form (that is, tree-form) games generalize normal-form games by modeling both sequential and simultaneous moves, as well as private information. Because of the sequential nature and presence of partial information in the game, extensive-form correlation has significantly different properties than the normal-form counterpart, many of which are still open research directions. Extensive-form correlated equilibrium (EFCE) has been proposed as the natural extensive-form counterpart to normal-form correlated equilibrium. However, it was currently unknown whether EFCE emerges as the result of uncoupled agent dynamics. In this paper, we give the first uncoupled no-regret dynamics that converge to the set of EFCEs in n-player general-sum extensive-form games with perfect recall. First, we introduce a notion of trigger regret in extensive-form games, which extends that of internal regret in normal-form games. When each player has low trigger regret, the empirical frequency of play is close to an EFCE. Then, we give an efficient no-trigger-regret algorithm. Our algorithm decomposes trigger regret into local subproblems at each decision point for the player, and constructs a global strategy of the player from the local solutions at each decision point.
引用
收藏
页数:11
相关论文
共 50 条
  • [21] Using Correlated Strategies for Computing Stackelberg Equilibria in Extensive-Form Games
    Cermak, Jiri
    Bosansky, Branislav
    Durkota, Karel
    Lisy, Viliam
    Kiekintveld, Christopher
    THIRTIETH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2016, : 439 - 445
  • [22] No-regret dynamics and fictitious play
    Viossat, Yannick
    Zapechelnyuk, Andriy
    JOURNAL OF ECONOMIC THEORY, 2013, 148 (02) : 825 - 842
  • [23] Designing Learning Algorithms over the Sequence Form of an Extensive-Form Game
    Manino, Edoardo
    Gatti, Nicola
    Restelli, Marcello
    AAMAS'17: PROCEEDINGS OF THE 16TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS, 2017, : 1622 - 1624
  • [24] RATIONALITY IN EXTENSIVE-FORM GAMES
    RENY, PJ
    JOURNAL OF ECONOMIC PERSPECTIVES, 1992, 6 (04): : 103 - 118
  • [25] Timeability of Extensive-Form Games
    Jakobsen, Sune K.
    Sorensen, Troels B.
    Conitzer, Vincent
    ITCS'16: PROCEEDINGS OF THE 2016 ACM CONFERENCE ON INNOVATIONS IN THEORETICAL COMPUTER SCIENCE, 2016, : 191 - 199
  • [26] Computational Extensive-Form Games
    Halpern, Joseph Y.
    Pass, Rafael
    Seeman, Lior
    EC'16: PROCEEDINGS OF THE 2016 ACM CONFERENCE ON ECONOMICS AND COMPUTATION, 2016, : 681 - 698
  • [27] Quantum extensive-form games
    Kazuki Ikeda
    Quantum Information Processing, 22
  • [28] No-Regret Learning in Bayesian Games
    Hartline, Jason
    Syrgkanis, Vasilis
    Tardos, Eva
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 28 (NIPS 2015), 2015, 28
  • [29] Convergence of best-response dynamics in extensive-form games
    Xu, Zibo
    JOURNAL OF ECONOMIC THEORY, 2016, 162 : 21 - 54
  • [30] On the Convergence of No-Regret Learning Dynamics in Time-Varying Games
    Anagnostides, Ioannis
    Panageas, Ioannis
    Farina, Gabriele
    Sandholm, Tuomas
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,