Safe and Nested Subgame Solving for Imperfect-Information Games

被引:0
|
作者
Brown, Noam [1 ]
Sandholm, Tuomas [1 ]
机构
[1] Carnegie Mellon Univ, Comp Sci Dept, Pittsburgh, PA 15217 USA
基金
美国安德鲁·梅隆基金会; 美国国家科学基金会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In imperfect-information games, the optimal strategy in a subgame may depend on the strategy in other, unreached subgames. Thus a subgame cannot be solved in isolation and must instead consider the strategy for the entire game as a whole, unlike perfect-information games. Nevertheless, it is possible to first approximate a solution for the whole game and then improve it in individual subgames. This is referred to as subgame solving. We introduce subgame-solving techniques that outperform prior methods both in theory and practice. We also show how to adapt them, and past subgame-solving techniques, to respond to opponent actions that are outside the original action abstraction; this significantly outperforms the prior state-of-the-art approach, action translation. Finally, we show that subgame solving can be repeated as the game progresses down the game tree, leading to far lower exploitability. These techniques were a key component of Libratus, the first AI to defeat top humans in heads-up no-limit Texas hold' em poker.
引用
收藏
页数:11
相关论文
共 50 条
  • [21] Potential-Aware Imperfect-Recall Abstraction with Earth Mover's Distance in Imperfect-Information Games
    Ganzfried, Sam
    Sandholm, Tuomas
    PROCEEDINGS OF THE TWENTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2014, : 682 - 690
  • [22] Alpaga: A Tool for Solving Parity Games with Imperfect Information
    Berwanger, Dietmar
    Chatterjee, Krishnendu
    De Wulf, Martin
    Doyen, Laurent
    Henzinger, Thomas A.
    TOOLS AND ALGORITHMS FOR THE CONSTRUCTION AND ANALYSIS OF SYSTEMS, PROCEEDINGS, 2009, 5505 : 58 - +
  • [23] Improved learning efficiency of deep Monte-Carlo for complex imperfect-information card games
    Luo, Qian
    Tan, Tien -Ping
    APPLIED SOFT COMPUTING, 2024, 158
  • [24] Polynomial-Time Linear-Swap Regret Minimization in Imperfect-Information Sequential Games
    Farina, Gabriele
    Pipis, Charilaos
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [25] Using structural information to construct ensemble representations in imperfect-information scenes
    Zhu, Jingyin
    Lu, Yilong
    Zhou, Jifan
    Shen, Mowei
    INTERNATIONAL JOURNAL OF PSYCHOLOGY, 2023, 58 : 654 - 654
  • [26] Subgame perfection in recursive perfect information games
    Kuipers, Jeroen
    Flesch, Janos
    Schoenmakers, Gijs
    Vrieze, Koos
    ECONOMIC THEORY, 2021, 71 (02) : 603 - 662
  • [27] Subgame perfection in recursive perfect information games
    Jeroen Kuipers
    János Flesch
    Gijs Schoenmakers
    Koos Vrieze
    Economic Theory, 2021, 71 : 603 - 662
  • [28] Solving Imperfect Recall Games
    Cermak, Jiri
    AAMAS'17: PROCEEDINGS OF THE 16TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS, 2017, : 1820 - 1821
  • [29] A Review of Imperfect Information Games: Adversarial Solving Methods and Comparative Analysis
    Yu, Chao
    Liu, Zong-Kai
    Hu, Chao-Haon
    Huang, Kai-Qi
    Zhang, Jun-Ge
    Jisuanji Xuebao/Chinese Journal of Computers, 2024, 47 (09): : 2211 - 2246
  • [30] Research progress on sequential imperfect information game solving in computer games
    Luo J.-R.
    Zhang W.-P.
    Su J.-M.
    Wei T.-T.
    Chen J.
    Kongzhi yu Juece/Control and Decision, 2023, 38 (10): : 2721 - 2748