Further Developments of Extensive-Form Replicator Dynamics using the Sequence-Form Representation

被引:0
|
作者
Lanctot, Marc [1 ]
机构
[1] Maastricht Univ, Dept Knowledge Engn, POB 616, NL-6200 MD Maastricht, Netherlands
关键词
Replicator dynamics; game theory; extensive-form games; multiplayer; Nash equilibrium; sequence form;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Replicator dynamics model the interactions and change in large populations of agents. Standard replicator dynamics, however, only allow single action interactions between agents. Complex interactions, as modeled by general extensive games, have received comparatively little attention in this setting. Recently, replicator dynamics have been adapted to extensive-form games represented in sequence form, leading to a large reduction in computational resource requirements. In this paper, we first show that sequenceform constraints and realization equivalence to standard replicator dynamics are maintained in general n-player games. We show that sequence-form replicator dynamics can minimize regret, leading to equilibrium convergence guarantees in two-player zero-sum games. We provide the first empirical evaluation of sequence-form replicator dynamics, applied to n-player Kuhn poker with two, three, and four players. Our results show that the average strategies generated by sequence-form replicator dynamics produce approximate equilibrium strategies with increasing accuracy over time.
引用
收藏
页码:1257 / 1264
页数:8
相关论文
共 50 条
  • [41] Converting MA-PDDL to Extensive-Form Games
    Kovacs, Daniel L.
    Dobrowiecki, Tadeusz P.
    ACTA POLYTECHNICA HUNGARICA, 2013, 10 (08) : 27 - 47
  • [42] Regret-Based Pruning in Extensive-Form Games
    Brown, Noam
    Sandholm, Tuomas
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 28 (NIPS 2015), 2015, 28
  • [43] Theoretical and Practical Advances on Smoothing for Extensive-Form Games
    Kroer, Christian
    Waugh, Kevin
    Kilinc-Karzan, Fatma
    Sandholm, Tuomas
    EC'17: PROCEEDINGS OF THE 2017 ACM CONFERENCE ON ECONOMICS AND COMPUTATION, 2017, : 693 - 693
  • [44] Extensive-form games with heterogeneous populations: solution concepts, equilibria characterization, learning dynamics
    Gatti, Nicola
    Panozzo, Fabio
    Restelli, Marcello
    INTELLIGENZA ARTIFICIALE, 2016, 10 (01) : 19 - 31
  • [45] Safe Search for Stackelberg Equilibria in Extensive-Form Games
    Ling, Chun Kai
    Brown, Noam
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 5541 - 5548
  • [46] On the Outcome Equivalence of Extensive-Form and Behavioral Correlated Equilibria
    Zhang, Brian Hu
    Sandholm, Tuomas
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 9, 2024, : 9969 - 9976
  • [47] Incremental Strategy Generation for Stackelberg Equilibria in Extensive-Form Games
    Cerny, Jakub
    Bosansky, Branislav
    Kiekintveld, Christopher
    ACM EC'18: PROCEEDINGS OF THE 2018 ACM CONFERENCE ON ECONOMICS AND COMPUTATION, 2018, : 151 - 168
  • [48] Near-Optimal Φ-Regret Learning in Extensive-Form Games
    Anagnostides, Ioannis
    Farina, Gabriele
    Sandholm, Tuomas
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 202, 2023, 202 : 814 - 839
  • [49] Faster Optimistic Online Mirror Descent for Extensive-Form Games
    Jiang, Huacong
    Liu, Weiming
    Li, Bin
    PRICAI 2022: TRENDS IN ARTIFICIAL INTELLIGENCE, PT I, 2022, 13629 : 90 - 103
  • [50] Efficient Regret Minimization Algorithm for Extensive-Form Correlated Equilibrium
    Farina, Gabriele
    Ling, Chun Kai
    Fang, Fei
    Sandholm, Tuomas
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32