Faster Optimistic Online Mirror Descent for Extensive-Form Games

被引:0
|
作者
Jiang, Huacong [1 ]
Liu, Weiming [1 ]
Li, Bin [1 ]
机构
[1] Univ Sci & Technol China, Hefei, Peoples R China
基金
中国国家自然科学基金;
关键词
Adaptive optimistic online mirror descent; Extensive-form games; Nash equilibrium; Counterfactual regret minimization; POKER;
D O I
10.1007/978-3-031-20862-1_7
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Online Mirror Descent (OMD) is a kind of regret minimization algorithms for Online Convex Optimization (OCO). Recently, they are applied to solve Extensive-Form Games (EFGs) for approximating Nash equilibrium. Especially, optimistic variants of OMD are developed, which have a better theoretical convergence rate compared to common regret minimization algorithms, e.g., Counterfactual Regret Minimization (CFR), for EFGs. However, despite the theoretical advantage, existing OMD and their optimistic variants have been shown to converge to a Nash equilibrium slower than the state-of-the-art (SOTA) CFR variants in practice. The reason for the inferior performance may be that they usually use constant regularizers whose parameters have to be chosen at the beginning. Inspired by the adaptive nature of CFRs, in this paper, an adaptive method is presented to speed up the optimistic variants of OMD. Based on this method, Adaptive Optimistic OMD (Ada-OOMD) for EFGs is proposed. In this algorithm, the regularizers can adapt to real-time regrets, thus the algorithm may converge faster in practice. Experimental results show that Ada-OOMD is at least two orders of magnitude faster than existing optimistic OMD algorithms. In some extensive-form games, such as Kuhn poker and Goofspiel, the convergence speed of Ada-OOMD even exceeds the SOTA CFRs. https://github.com/github-jhc/ada-oomd
引用
收藏
页码:90 / 103
页数:14
相关论文
共 50 条
  • [21] Polytope-form games and index/degree theories for extensive-form games
    Pahl, Lucas
    GAMES AND ECONOMIC BEHAVIOR, 2023, 141 : 444 - 471
  • [22] Discretization of Continuous Action Spaces in Extensive-Form Games
    Kroer, Christian
    Sandholm, Tuomas
    PROCEEDINGS OF THE 2015 INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS & MULTIAGENT SYSTEMS (AAMAS'15), 2015, : 47 - 56
  • [23] XDO: A Double Oracle Algorithm for Extensive-Form Games
    McAleer, Stephen
    Lanier, John
    Wang, Kevin A.
    Baldi, Pierre
    Fox, Roy
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021,
  • [24] Prudent Rationalizability in Generalized Extensive-form Games with Unawareness
    Heifetz, Aviad
    Meier, Martin
    Schipper, Burkhard C.
    B E JOURNAL OF THEORETICAL ECONOMICS, 2021, 21 (02): : 525 - 556
  • [25] SET-THEORETIC EQUIVALENCE OF EXTENSIVE-FORM GAMES
    BONANNO, G
    INTERNATIONAL JOURNAL OF GAME THEORY, 1992, 20 (04) : 429 - 447
  • [26] Computing Quantal Stackelberg Equilibrium in Extensive-Form Games
    Cerny, Jakub
    Lisy, Viliam
    Bosansky, Branislav
    An, Bo
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 5260 - 5268
  • [27] Solving Large Extensive-Form Games with Strategy Constraints
    Davis, Trevor
    Waugh, Kevin
    Bowling, Michael
    THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 1861 - 1868
  • [28] Last-iterate Convergence in Extensive-Form Games
    Lee, Chung-Wei
    Kroer, Christian
    Luo, Haipeng
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [29] Computational Results for Extensive-Form Adversarial Team Games
    Celli, Andrea
    Gatti, Nicola
    THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 965 - 972
  • [30] Fictitious Self-Play in Extensive-Form Games
    Heinrich, Johannes
    Lanctot, Marc
    Silver, David
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 37, 2015, 37 : 805 - 813