No-Regret Learning Dynamics for Extensive-Form Correlated Equilibrium

被引:0
|
作者
Celli, Andrea [1 ]
Marchesi, Alberto [1 ]
Farina, Gabriele [2 ]
Gatti, Nicola [1 ]
机构
[1] Politecn Milan, Milan, Italy
[2] Carnegie Mellon Univ, Pittsburgh, PA USA
基金
美国国家科学基金会;
关键词
COMPLEXITY;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The existence of simple, uncoupled no-regret dynamics that converge to correlated equilibria in normal-form games is a celebrated result in the theory of multi-agent systems. Specifically, it has been known for more than 20 years that when all players seek to minimize their internal regret in a repeated normal-form game, the empirical frequency of play converges to a normal-form correlated equilibrium. Extensive-form (that is, tree-form) games generalize normal-form games by modeling both sequential and simultaneous moves, as well as private information. Because of the sequential nature and presence of partial information in the game, extensive-form correlation has significantly different properties than the normal-form counterpart, many of which are still open research directions. Extensive-form correlated equilibrium (EFCE) has been proposed as the natural extensive-form counterpart to normal-form correlated equilibrium. However, it was currently unknown whether EFCE emerges as the result of uncoupled agent dynamics. In this paper, we give the first uncoupled no-regret dynamics that converge to the set of EFCEs in n-player general-sum extensive-form games with perfect recall. First, we introduce a notion of trigger regret in extensive-form games, which extends that of internal regret in normal-form games. When each player has low trigger regret, the empirical frequency of play is close to an EFCE. Then, we give an efficient no-trigger-regret algorithm. Our algorithm decomposes trigger regret into local subproblems at each decision point for the player, and constructs a global strategy of the player from the local solutions at each decision point.
引用
收藏
页数:11
相关论文
共 50 条
  • [41] Strategic negotiations for extensive-form games
    de Jonge, Dave
    Zhang, Dongmo
    AUTONOMOUS AGENTS AND MULTI-AGENT SYSTEMS, 2020, 34 (01)
  • [42] On the convergence of no-regret learning in selfish routing
    Krichene, Walid
    Drighes, Benjamin
    Bayen, Alexandre
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 32 (CYCLE 2), 2014, 32 : 163 - 171
  • [43] No-Regret Learning in Dynamic Stackelberg Games
    Lauffer, Niklas
    Ghasemi, Mahsa
    Hashemi, Abolfazl
    Savas, Yagiz
    Topcu, Ufuk
    IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2024, 69 (03) : 1418 - 1431
  • [44] Unifying convergence and no-regret in multiagent learning
    Banerjee, Bikramjit
    Peng, Jing
    LEARNING AND ADAPTION IN MULTI-AGENT SYSTEMS, 2006, 3898 : 100 - 114
  • [45] Further Developments of Extensive-Form Replicator Dynamics using the Sequence-Form Representation
    Lanctot, Marc
    AAMAS'14: PROCEEDINGS OF THE 2014 INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS & MULTIAGENT SYSTEMS, 2014, : 1257 - 1264
  • [46] Near-Optimal No-Regret Learning Dynamics for General Convex Games
    Farina, Gabriele
    Anagnostides, Ioannis
    Luo, Haipeng
    Lee, Chung-Wei
    Kroer, Christian
    Sandholm, Tuomas
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35, NEURIPS 2022, 2022,
  • [47] Extensive-form games and strategic complementarities
    Echenique, F
    GAMES AND ECONOMIC BEHAVIOR, 2004, 46 (02) : 348 - 364
  • [48] Variance Decompositions for Extensive-Form Games
    Cloud, Alex
    Laber, Eric B.
    2021 IEEE CONFERENCE ON GAMES (COG), 2021, : 380 - 387
  • [49] RATIONAL BEHAVIOR IN EXTENSIVE-FORM GAMES
    RENY, PJ
    CANADIAN JOURNAL OF ECONOMICS-REVUE CANADIENNE D ECONOMIQUE, 1995, 28 (01): : 1 - 16
  • [50] Stability and trembles in extensive-form games
    Heller, Yuval
    GAMES AND ECONOMIC BEHAVIOR, 2014, 84 : 132 - 136