Safe and Nested Subgame Solving for Imperfect-Information Games

被引：0

作者：

Brown, Noam ^{[1
]}

Sandholm, Tuomas ^{[1
]}

机构：

[1] Carnegie Mellon Univ, Comp Sci Dept, Pittsburgh, PA 15217 USA

来源：

ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 30 (NIPS 2017) | 2017年 / 30卷

基金：

美国安德鲁·梅隆基金会; 美国国家科学基金会;

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In imperfect-information games, the optimal strategy in a subgame may depend on the strategy in other, unreached subgames. Thus a subgame cannot be solved in isolation and must instead consider the strategy for the entire game as a whole, unlike perfect-information games. Nevertheless, it is possible to first approximate a solution for the whole game and then improve it in individual subgames. This is referred to as subgame solving. We introduce subgame-solving techniques that outperform prior methods both in theory and practice. We also show how to adapt them, and past subgame-solving techniques, to respond to opponent actions that are outside the original action abstraction; this significantly outperforms the prior state-of-the-art approach, action translation. Finally, we show that subgame solving can be repeated as the game progresses down the game tree, leading to far lower exploitability. These techniques were a key component of Libratus, the first AI to defeat top humans in heads-up no-limit Texas hold' em poker.

引用

页数：11

共 50 条

[31] A Monte Carlo Neural Fictitious Self-Play approach to approximate Nash Equilibrium in imperfect-information dynamic games
Zhang, Li
Chen, Yuxuan
Wang, Wei
Han, Ziliang
Li, Shijian
Pan, Zhijie
Pan, Gang
FRONTIERS OF COMPUTER SCIENCE, 2021, 15 (05)
[32] A Monte Carlo Neural Fictitious Self-Play approach to approximate Nash Equilibrium in imperfect-information dynamic games
Li ZHANG
Yuxuan CHEN
Wei WANG
Ziliang HAN
Shijian Li
Zhijie PAN
Gang PAN
Frontiers of Computer Science, 2021, (05) : 135 - 148
[33] A Monte Carlo Neural Fictitious Self-Play approach to approximate Nash Equilibrium in imperfect-information dynamic games
Li Zhang
Yuxuan Chen
Wei Wang
Ziliang Han
Shijian Li
Zhijie Pan
Gang Pan
Frontiers of Computer Science, 2021, 15
[34] Subgame Perfection in Positive Recursive Games with Perfect Information
Flesch, J.
Kuipers, J.
Schoenmakers, G.
Vrieze, K.
MATHEMATICS OF OPERATIONS RESEARCH, 2010, 35 (01) : 193 - 207
[35] Parallel Counterfactual Regret Minimization in Crowdsourcing Imperfect-information Expanded Game
Zhang, Jie
Li, Kefan
Zhang, Baoming
Xu, Ming
Wang, Chongjun
19TH IEEE INTERNATIONAL SYMPOSIUM ON PARALLEL AND DISTRIBUTED PROCESSING WITH APPLICATIONS (ISPA/BDCLOUD/SOCIALCOM/SUSTAINCOM 2021), 2021, : 1444 - 1451
[36] OpenHoldem: A Benchmark for Large-Scale Imperfect-Information Game Research
Li, Kai
Xu, Hang
Zhao, Enmin
Wu, Zhe
Xing, Junliang
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (10) : 14618 - 14632
[37] Dynamic games with imperfect information
Z Angew Math Mech ZAMM, Suppl 3 (517):
[38] Dynamic games with imperfect information
Mokhonko, EZ
ZEITSCHRIFT FUR ANGEWANDTE MATHEMATIK UND MECHANIK, 1996, 76 : 517 - 518
[39] INFINITE GAMES WITH IMPERFECT INFORMATION
ORKIN, M
TRANSACTIONS OF THE AMERICAN MATHEMATICAL SOCIETY, 1972, 171 (SEP) : 501 - 507
[40] Solving Imperfect Information Poker Games Using Monte Carlo Search and POMDP Models
Yao, Jian
Zhang, Zeyu
Xia, Li
Yang, Jun
Zhao, Qianchuan
PROCEEDINGS OF 2020 IEEE 9TH DATA DRIVEN CONTROL AND LEARNING SYSTEMS CONFERENCE (DDCLS'20), 2020, : 1060 - 1065

← 1 2 3 4 5 →