State Aggregation in Monte Carlo Tree Search

被引:0
|
作者
Hostetler, Jesse [1 ]
Fern, Alan [1 ]
Dietterich, Tom [1 ]
机构
[1] Oregon State Univ, Dept Elect Engn & Comp Sci, Corvallis, OR 97331 USA
基金
美国国家科学基金会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Monte Carlo tree search (MCTS) algorithms are a popular approach to online decision-making in Markov decision processes (MDPs). These algorithms can, however, perform poorly in MDPs with high stochastic branching factors. In this paper, we study state aggregation as a way of reducing stochastic branching in tree search. Prior work has studied formal properties of MDP state aggregation in the context of dynamic programming and reinforcement learning, but little attention has been paid to state aggregation in MCTS. Our main result is a performance loss bound for a class of value function-based state aggregation criteria in expectimax search trees. We also consider how to construct MCTS algorithms that operate in the abstract state space but require a simulator of the ground dynamics only. We find that trajectory sampling algorithms like UCT can be adapted easily, but that sparse sampling algorithms present difficulties. As a proof of concept, we experimentally confirm that state aggregation can improve the finite-sample performance of UCT.
引用
收藏
页码:2446 / 2452
页数:7
相关论文
共 50 条
  • [1] Accelerating Monte Carlo Tree Search with Probability Tree State Abstraction
    Fu, Yangqing
    Sun, Ming
    Nie, Buqing
    Gao, Yue
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36, NEURIPS 2023, 2023,
  • [2] Monte Carlo Tree Search With Iteratively Refining State Abstractions
    Sokota, Samuel
    Ho, Caleb
    Ahmad, Zaheen
    Kolter, J. Zico
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [3] Multiagent Monte Carlo Tree Search
    Zerbel, Nicholas
    Yliniemi, Logan
    [J]. AAMAS '19: PROCEEDINGS OF THE 18TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS, 2019, : 2309 - 2311
  • [4] Monte Carlo Tree Search with Metaheuristics
    Mandziuk, Jacek
    Walczak, Patryk
    [J]. ARTIFICIAL INTELLIGENCE AND SOFT COMPUTING, ICAISC 2023, PT II, 2023, 14126 : 134 - 144
  • [5] Elastic Monte Carlo Tree Search
    Xu, Linjie
    Dockhorn, Alexander
    Perez-Liebana, Diego
    [J]. IEEE TRANSACTIONS ON GAMES, 2023, 15 (04) : 527 - 537
  • [6] Monte Carlo Tree Search in Hex
    Arneson, Broderick
    Hayward, Ryan B.
    Henderson, Philip
    [J]. IEEE TRANSACTIONS ON COMPUTATIONAL INTELLIGENCE AND AI IN GAMES, 2010, 2 (04) : 251 - 258
  • [7] Monte Carlo tree search in Kriegspiel
    Ciancarini, Paolo
    Favini, Gian Piero
    [J]. ARTIFICIAL INTELLIGENCE, 2010, 174 (11) : 670 - 684
  • [8] Monte Carlo Tree Search for Quoridor
    Respall, Victor Massague
    Brown, Joseph Alexander
    Aslam, Hamna
    [J]. 19TH INTERNATIONAL CONFERENCE ON INTELLIGENT GAMES AND SIMULATION (GAME-ON(R) 2018), 2018, : 5 - 9
  • [9] An Analysis of Monte Carlo Tree Search
    James, Steven
    Konidaris, George
    Rosman, Benjamin
    [J]. THIRTY-FIRST AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 3576 - 3582
  • [10] MONTE CARLO TREE SEARCH: A TUTORIAL
    Fu, Michael C.
    [J]. 2018 WINTER SIMULATION CONFERENCE (WSC), 2018, : 222 - 236