Faster algorithms for extensive-form game solving via improved smoothing functions

被引:0
|
作者
Christian Kroer
Kevin Waugh
Fatma Kılınç-Karzan
Tuomas Sandholm
机构
[1] Carnegie Mellon University,Computer Science Department
[2] University of Alberta,Department of Computing Science
[3] Carnegie Mellon University,Tepper School of Business
来源
Mathematical Programming | 2020年 / 179卷
关键词
Extensive-form game; Bilinear saddle-point problem; First-order method; Nash equilibrium; Zero-sum game; 91A05; 91A18; 90C06; 90C25; 90C47; 65K05; 52A41;
D O I
暂无
中图分类号
学科分类号
摘要
Sparse iterative methods, in particular first-order methods, are known to be among the most effective in solving large-scale two-player zero-sum extensive-form games. The convergence rates of these methods depend heavily on the properties of the distance-generating function that they are based on. We investigate both the theoretical and practical performance improvement of first-order methods (FOMs) for solving extensive-form games through better design of the dilated entropy function—a class of distance-generating functions related to the domains associated with the extensive-form games. By introducing a new weighting scheme for the dilated entropy function, we develop the first distance-generating function for the strategy spaces of sequential games that has only a logarithmic dependence on the branching factor of the player. This result improves the overall convergence rate of several FOMs working with dilated entropy function by a factor of Ω(bdd)\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\Omega (b^dd)$$\end{document}, where b is the branching factor of the player, and d is the depth of the game tree. Thus far, counterfactual regret minimization methods have been faster in practice, and more popular, than FOMs despite their theoretically inferior convergence rates. Using our new weighting scheme and a practical parameter tuning procedure we show that, for the first time, the excessive gap technique, a classical FOM, can be made faster than the counterfactual regret minimization algorithm in practice for large games, and that the aggressive stepsize scheme of CFR+ is the only reason that the algorithm is faster in practice.
引用
收藏
页码:385 / 417
页数:32
相关论文
共 30 条
  • [21] Team Correlated Equilibria in Zero-Sum Extensive-Form Games via Tree Decompositions
    Zhang, Brian Hu
    Sandholm, Tuomas
    THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / THE TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 5252 - 5259
  • [22] Iterative Algorithm for Solving Two-player Zero-sum Extensive-form Games with Imperfect Information
    Bosansky, Branislav
    Kiekintveld, Christopher
    Lisy, Viliam
    Pechoucek, Michal
    20TH EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE (ECAI 2012), 2012, 242 : 193 - +
  • [23] Extensive-form Game-Based Pre-Disaster Resilience Planning Considering Post-Disaster Restoration Decisions
    Chen, Ying
    Yan, Yunqi
    Nie, Huanhuan
    Jiang, Zhe
    Zhou, Ning
    FRONTIERS IN ENERGY RESEARCH, 2022, 10
  • [24] Computing Hermite Normal Form Faster via Solving System of Linear Equations
    Liu, Renzhang
    Pan, Yanbin
    PROCEEDINGS OF THE 2019 ACM INTERNATIONAL SYMPOSIUM ON SYMBOLIC AND ALGEBRAIC COMPUTATION (ISSAC '19), 2019, : 283 - 290
  • [25] Comparative analysis of extensive form zero sum game algorithms for Poker like games
    Keshavarzi, Behbod
    Navidi, Hamidreza
    SCIENTIFIC REPORTS, 2025, 15 (01):
  • [26] Faster Game Solving via Predictive Blackwell Approachability: Connecting Regret Matching and Mirror Descent
    Farina, Gabriele
    Kroer, Christian
    Sandholm, Tuomas
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 5363 - 5371
  • [27] A Novel Driving-Strategy Generating Method of Collision Avoidance for Unmanned Ships Based on Extensive-Form Game Model with Fuzzy Credibility Numbers
    Cui, Haotian
    Zhang, Fangwei
    Li, Mingjie
    Cui, Yang
    Wang, Rui
    MATHEMATICS, 2022, 10 (18)
  • [28] Optimal Correlated Equilibria in General-Sum Extensive-Form Games: Fixed-Parameter Algorithms, Hardness, and Two-Sided Column-Generation
    Zhang, Brian Hu
    Farina, Gabriele
    Celli, Andrea
    Sandholm, Tuomas
    MATHEMATICS OF OPERATIONS RESEARCH, 2025,
  • [29] IMPROVED ALGORITHMS FOR PARAMETER-IDENTIFICATION IN CONTINUOUS SYSTEMS VIA WALSH-FUNCTIONS
    RAO, GP
    PALANISAMY, KR
    IEE PROCEEDINGS-D CONTROL THEORY AND APPLICATIONS, 1983, 130 (01): : 9 - 16
  • [30] Power Control Game via Improved Utility Functions of Primary-Secondary User in Cognitive Radio Networks
    Deng, Zhang
    Xu, Yitao
    Wang, Naichao
    2011 INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND NETWORK TECHNOLOGY (ICCSNT), VOLS 1-4, 2012, : 1460 - 1463