No-Regret Learning Dynamics for Extensive-Form Correlated Equilibrium

被引:0
|
作者
Celli, Andrea [1 ]
Marchesi, Alberto [1 ]
Farina, Gabriele [2 ]
Gatti, Nicola [1 ]
机构
[1] Politecn Milan, Milan, Italy
[2] Carnegie Mellon Univ, Pittsburgh, PA USA
基金
美国国家科学基金会;
关键词
COMPLEXITY;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The existence of simple, uncoupled no-regret dynamics that converge to correlated equilibria in normal-form games is a celebrated result in the theory of multi-agent systems. Specifically, it has been known for more than 20 years that when all players seek to minimize their internal regret in a repeated normal-form game, the empirical frequency of play converges to a normal-form correlated equilibrium. Extensive-form (that is, tree-form) games generalize normal-form games by modeling both sequential and simultaneous moves, as well as private information. Because of the sequential nature and presence of partial information in the game, extensive-form correlation has significantly different properties than the normal-form counterpart, many of which are still open research directions. Extensive-form correlated equilibrium (EFCE) has been proposed as the natural extensive-form counterpart to normal-form correlated equilibrium. However, it was currently unknown whether EFCE emerges as the result of uncoupled agent dynamics. In this paper, we give the first uncoupled no-regret dynamics that converge to the set of EFCEs in n-player general-sum extensive-form games with perfect recall. First, we introduce a notion of trigger regret in extensive-form games, which extends that of internal regret in normal-form games. When each player has low trigger regret, the empirical frequency of play is close to an EFCE. Then, we give an efficient no-trigger-regret algorithm. Our algorithm decomposes trigger regret into local subproblems at each decision point for the player, and constructs a global strategy of the player from the local solutions at each decision point.
引用
收藏
页数:11
相关论文
共 50 条
  • [1] Simple Uncoupled No-regret Learning Dynamics for Extensive-form Correlated Equilibrium
    Farina, Gabriele
    Celli, Andrea
    Marchesi, Alberto
    Gatti, Nicola
    JOURNAL OF THE ACM, 2022, 69 (06)
  • [2] Decentralized No-Regret Learning Algorithms for Extensive-Form Correlated Equilibria (Extended Abstract)
    Celli, Andrea
    Marchesi, Alberto
    Farina, Gabriele
    Gatti, Nicola
    PROCEEDINGS OF THE THIRTIETH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2021, 2021, : 4755 - 4759
  • [3] Efficient Regret Minimization Algorithm for Extensive-Form Correlated Equilibrium
    Farina, Gabriele
    Ling, Chun Kai
    Fang, Fei
    Sandholm, Tuomas
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [4] Extensive-Form Correlated Equilibrium: Definition and Computational Complexity
    von Stengel, Bernhard
    Forges, Francoise
    MATHEMATICS OF OPERATIONS RESEARCH, 2008, 33 (04) : 1002 - 1022
  • [5] Computing an Extensive-Form Correlated Equilibrium in Polynomial Time
    Huang, Wan
    von Stengel, Bernhard
    INTERNET AND NETWORK ECONOMICS, PROCEEDINGS, 2008, 5385 : 506 - 513
  • [6] Near-Optimal Φ-Regret Learning in Extensive-Form Games
    Anagnostides, Ioannis
    Farina, Gabriele
    Sandholm, Tuomas
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 202, 2023, 202 : 814 - 839
  • [7] Stochastic Regret Minimization in Extensive-Form Games
    Farina, Gabriele
    Kroer, Christian
    Sandholm, Tuomas
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 119, 2020, 119
  • [8] Stochastic Regret Minimization in Extensive-Form Games
    Farina, Gabriele
    Kroer, Christian
    Sandholm, Tuomas
    25TH AMERICAS CONFERENCE ON INFORMATION SYSTEMS (AMCIS 2019), 2019,
  • [9] Regret-Based Pruning in Extensive-Form Games
    Brown, Noam
    Sandholm, Tuomas
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 28 (NIPS 2015), 2015, 28
  • [10] Sample-Efficient Learning of Correlated Equilibria in Extensive-Form Games
    Song, Ziang
    Mei, Song
    Bai, Yu
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35, NEURIPS 2022, 2022,