Faster algorithms for extensive-form game solving via improved smoothing functions

被引：0

作者：

Christian Kroer

Kevin Waugh

Fatma Kılınç-Karzan

Tuomas Sandholm

机构：

[1] Carnegie Mellon University,Computer Science Department

[2] University of Alberta,Department of Computing Science

[3] Carnegie Mellon University,Tepper School of Business

来源：

Mathematical Programming | 2020年 / 179卷

关键词：

Extensive-form game; Bilinear saddle-point problem; First-order method; Nash equilibrium; Zero-sum game; 91A05; 91A18; 90C06; 90C25; 90C47; 65K05; 52A41;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

Sparse iterative methods, in particular first-order methods, are known to be among the most effective in solving large-scale two-player zero-sum extensive-form games. The convergence rates of these methods depend heavily on the properties of the distance-generating function that they are based on. We investigate both the theoretical and practical performance improvement of first-order methods (FOMs) for solving extensive-form games through better design of the dilated entropy function—a class of distance-generating functions related to the domains associated with the extensive-form games. By introducing a new weighting scheme for the dilated entropy function, we develop the first distance-generating function for the strategy spaces of sequential games that has only a logarithmic dependence on the branching factor of the player. This result improves the overall convergence rate of several FOMs working with dilated entropy function by a factor of Ω(bdd)\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\Omega (b^dd)$$\end{document}, where b is the branching factor of the player, and d is the depth of the game tree. Thus far, counterfactual regret minimization methods have been faster in practice, and more popular, than FOMs despite their theoretically inferior convergence rates. Using our new weighting scheme and a practical parameter tuning procedure we show that, for the first time, the excessive gap technique, a classical FOM, can be made faster than the counterfactual regret minimization algorithm in practice for large games, and that the aggressive stepsize scheme of CFR+ is the only reason that the algorithm is faster in practice.

引用

页码：385 / 417

页数：32

共 30 条

[21] Team Correlated Equilibria in Zero-Sum Extensive-Form Games via Tree Decompositions
Zhang, Brian Hu
Sandholm, Tuomas
THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / THE TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 5252 - 5259
[22] Iterative Algorithm for Solving Two-player Zero-sum Extensive-form Games with Imperfect Information
Bosansky, Branislav
Kiekintveld, Christopher
Lisy, Viliam
Pechoucek, Michal
20TH EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE (ECAI 2012), 2012, 242 : 193 - +
[23] Extensive-form Game-Based Pre-Disaster Resilience Planning Considering Post-Disaster Restoration Decisions
Chen, Ying
Yan, Yunqi
Nie, Huanhuan
Jiang, Zhe
Zhou, Ning
FRONTIERS IN ENERGY RESEARCH, 2022, 10
[24] Computing Hermite Normal Form Faster via Solving System of Linear Equations
Liu, Renzhang
Pan, Yanbin
PROCEEDINGS OF THE 2019 ACM INTERNATIONAL SYMPOSIUM ON SYMBOLIC AND ALGEBRAIC COMPUTATION (ISSAC '19), 2019, : 283 - 290
[25] Comparative analysis of extensive form zero sum game algorithms for Poker like games
Keshavarzi, Behbod
Navidi, Hamidreza
SCIENTIFIC REPORTS, 2025, 15 (01):
[26] Faster Game Solving via Predictive Blackwell Approachability: Connecting Regret Matching and Mirror Descent
Farina, Gabriele
Kroer, Christian
Sandholm, Tuomas
THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 5363 - 5371
[27] A Novel Driving-Strategy Generating Method of Collision Avoidance for Unmanned Ships Based on Extensive-Form Game Model with Fuzzy Credibility Numbers
Cui, Haotian
Zhang, Fangwei
Li, Mingjie
Cui, Yang
Wang, Rui
MATHEMATICS, 2022, 10 (18)
[28] Optimal Correlated Equilibria in General-Sum Extensive-Form Games: Fixed-Parameter Algorithms, Hardness, and Two-Sided Column-Generation
Zhang, Brian Hu
Farina, Gabriele
Celli, Andrea
Sandholm, Tuomas
MATHEMATICS OF OPERATIONS RESEARCH, 2025,
[29] IMPROVED ALGORITHMS FOR PARAMETER-IDENTIFICATION IN CONTINUOUS SYSTEMS VIA WALSH-FUNCTIONS
RAO, GP
PALANISAMY, KR
IEE PROCEEDINGS-D CONTROL THEORY AND APPLICATIONS, 1983, 130 (01): : 9 - 16
[30] Power Control Game via Improved Utility Functions of Primary-Secondary User in Cognitive Radio Networks
Deng, Zhang
Xu, Yitao
Wang, Naichao
2011 INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND NETWORK TECHNOLOGY (ICCSNT), VOLS 1-4, 2012, : 1460 - 1463

← 1 2 3 →