No-Regret Learning Dynamics for Extensive-Form Correlated Equilibrium

被引:0
|
作者
Celli, Andrea [1 ]
Marchesi, Alberto [1 ]
Farina, Gabriele [2 ]
Gatti, Nicola [1 ]
机构
[1] Politecn Milan, Milan, Italy
[2] Carnegie Mellon Univ, Pittsburgh, PA USA
基金
美国国家科学基金会;
关键词
COMPLEXITY;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The existence of simple, uncoupled no-regret dynamics that converge to correlated equilibria in normal-form games is a celebrated result in the theory of multi-agent systems. Specifically, it has been known for more than 20 years that when all players seek to minimize their internal regret in a repeated normal-form game, the empirical frequency of play converges to a normal-form correlated equilibrium. Extensive-form (that is, tree-form) games generalize normal-form games by modeling both sequential and simultaneous moves, as well as private information. Because of the sequential nature and presence of partial information in the game, extensive-form correlation has significantly different properties than the normal-form counterpart, many of which are still open research directions. Extensive-form correlated equilibrium (EFCE) has been proposed as the natural extensive-form counterpart to normal-form correlated equilibrium. However, it was currently unknown whether EFCE emerges as the result of uncoupled agent dynamics. In this paper, we give the first uncoupled no-regret dynamics that converge to the set of EFCEs in n-player general-sum extensive-form games with perfect recall. First, we introduce a notion of trigger regret in extensive-form games, which extends that of internal regret in normal-form games. When each player has low trigger regret, the empirical frequency of play is close to an EFCE. Then, we give an efficient no-trigger-regret algorithm. Our algorithm decomposes trigger regret into local subproblems at each decision point for the player, and constructs a global strategy of the player from the local solutions at each decision point.
引用
收藏
页数:11
相关论文
共 50 条
  • [31] Quantum extensive-form games
    Ikeda, Kazuki
    QUANTUM INFORMATION PROCESSING, 2023, 22 (01)
  • [32] Extensive-Form Perfect Equilibrium Computation in Two-Player Games
    Farina, Gabriele
    Gatti, Nicola
    THIRTY-FIRST AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 502 - 508
  • [33] Efficient Phi-Regret Minimization in Extensive-Form Games via Online Mirror Descent
    Bai, Yu
    Jin, Chi
    Mei, Song
    Song, Ziang
    Yu, Tiancheng
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35, NEURIPS 2022, 2022,
  • [34] Logit Learning by Valuation in Extensive-Form Games with Simultaneous Moves
    Castiglione, Jason
    Arslan, Gurdal
    2023 62ND IEEE CONFERENCE ON DECISION AND CONTROL, CDC, 2023, : 1213 - 1218
  • [35] Optimistic Regret Minimization for Extensive-Form Games via Dilated Distance-Generating Functions
    Farina, Gabriele
    Kroer, Christian
    Sandholm, Tuomas
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [36] Acceleration through Optimistic No-Regret Dynamics
    Wang, Jun-Kun
    Abernethy, Jacob
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
  • [37] Strategic negotiations for extensive-form games
    Dave de Jonge
    Dongmo Zhang
    Autonomous Agents and Multi-Agent Systems, 2020, 34
  • [38] Coarse Correlation in Extensive-Form Games
    Farina, Gabriele
    Bianchi, Tommaso
    Sandholm, Tuomas
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 1934 - 1941
  • [39] No-Regret Learning Supports Voters' Competence
    Spelda, Petr
    Stritecky, Vit
    Symons, John
    SOCIAL EPISTEMOLOGY, 2024, 38 (05) : 543 - 559
  • [40] Limits and limitations of no-regret learning in games
    Monnot, Barnabe
    Piliouras, Georgios
    KNOWLEDGE ENGINEERING REVIEW, 2017, 32