Safe and Nested Subgame Solving for Imperfect-Information Games

被引：0

作者：

Brown, Noam ^{[1
]}

Sandholm, Tuomas ^{[1
]}

机构：

[1] Carnegie Mellon Univ, Comp Sci Dept, Pittsburgh, PA 15217 USA

来源：

ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 30 (NIPS 2017) | 2017年 / 30卷

基金：

美国安德鲁·梅隆基金会; 美国国家科学基金会;

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In imperfect-information games, the optimal strategy in a subgame may depend on the strategy in other, unreached subgames. Thus a subgame cannot be solved in isolation and must instead consider the strategy for the entire game as a whole, unlike perfect-information games. Nevertheless, it is possible to first approximate a solution for the whole game and then improve it in individual subgames. This is referred to as subgame solving. We introduce subgame-solving techniques that outperform prior methods both in theory and practice. We also show how to adapt them, and past subgame-solving techniques, to respond to opponent actions that are outside the original action abstraction; this significantly outperforms the prior state-of-the-art approach, action translation. Finally, we show that subgame solving can be repeated as the game progresses down the game tree, leading to far lower exploitability. These techniques were a key component of Libratus, the first AI to defeat top humans in heads-up no-limit Texas hold' em poker.

引用

页数：11

共 50 条

[21] Potential-Aware Imperfect-Recall Abstraction with Earth Mover's Distance in Imperfect-Information Games
Ganzfried, Sam
Sandholm, Tuomas
PROCEEDINGS OF THE TWENTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2014, : 682 - 690
[22] Alpaga: A Tool for Solving Parity Games with Imperfect Information
Berwanger, Dietmar
Chatterjee, Krishnendu
De Wulf, Martin
Doyen, Laurent
Henzinger, Thomas A.
TOOLS AND ALGORITHMS FOR THE CONSTRUCTION AND ANALYSIS OF SYSTEMS, PROCEEDINGS, 2009, 5505 : 58 - +
[23] Improved learning efficiency of deep Monte-Carlo for complex imperfect-information card games
Luo, Qian
Tan, Tien -Ping
APPLIED SOFT COMPUTING, 2024, 158
[24] Polynomial-Time Linear-Swap Regret Minimization in Imperfect-Information Sequential Games
Farina, Gabriele
Pipis, Charilaos
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
[25] Using structural information to construct ensemble representations in imperfect-information scenes
Zhu, Jingyin
Lu, Yilong
Zhou, Jifan
Shen, Mowei
INTERNATIONAL JOURNAL OF PSYCHOLOGY, 2023, 58 : 654 - 654
[26] Subgame perfection in recursive perfect information games
Kuipers, Jeroen
Flesch, Janos
Schoenmakers, Gijs
Vrieze, Koos
ECONOMIC THEORY, 2021, 71 (02) : 603 - 662
[27] Subgame perfection in recursive perfect information games
Jeroen Kuipers
János Flesch
Gijs Schoenmakers
Koos Vrieze
Economic Theory, 2021, 71 : 603 - 662
[28] Solving Imperfect Recall Games
Cermak, Jiri
AAMAS'17: PROCEEDINGS OF THE 16TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS, 2017, : 1820 - 1821
[29] A Review of Imperfect Information Games: Adversarial Solving Methods and Comparative Analysis
Yu, Chao
Liu, Zong-Kai
Hu, Chao-Haon
Huang, Kai-Qi
Zhang, Jun-Ge
Jisuanji Xuebao/Chinese Journal of Computers, 2024, 47 (09): : 2211 - 2246
[30] Research progress on sequential imperfect information game solving in computer games
Luo J.-R.
Zhang W.-P.
Su J.-M.
Wei T.-T.
Chen J.
Kongzhi yu Juece/Control and Decision, 2023, 38 (10): : 2721 - 2748

← 1 2 3 4 5 →