State Aggregation in Monte Carlo Tree Search

被引：0

作者：

Hostetler, Jesse ^{[1
]}

Fern, Alan ^{[1
]}

Dietterich, Tom ^{[1
]}

机构：

[1] Oregon State Univ, Dept Elect Engn & Comp Sci, Corvallis, OR 97331 USA

来源：

PROCEEDINGS OF THE TWENTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE | 2014年

基金：

美国国家科学基金会;

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Monte Carlo tree search (MCTS) algorithms are a popular approach to online decision-making in Markov decision processes (MDPs). These algorithms can, however, perform poorly in MDPs with high stochastic branching factors. In this paper, we study state aggregation as a way of reducing stochastic branching in tree search. Prior work has studied formal properties of MDP state aggregation in the context of dynamic programming and reinforcement learning, but little attention has been paid to state aggregation in MCTS. Our main result is a performance loss bound for a class of value function-based state aggregation criteria in expectimax search trees. We also consider how to construct MCTS algorithms that operate in the abstract state space but require a simulator of the ground dynamics only. We find that trajectory sampling algorithms like UCT can be adapted easily, but that sparse sampling algorithms present difficulties. As a proof of concept, we experimentally confirm that state aggregation can improve the finite-sample performance of UCT.

引用

页码：2446 / 2452

页数：7

共 50 条

[1] Accelerating Monte Carlo Tree Search with Probability Tree State Abstraction
Fu, Yangqing
Sun, Ming
Nie, Buqing
Gao, Yue
[J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36, NEURIPS 2023, 2023,
[2] Monte Carlo Tree Search With Iteratively Refining State Abstractions
Sokota, Samuel
Ho, Caleb
Ahmad, Zaheen
Kolter, J. Zico
[J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
[3] Multiagent Monte Carlo Tree Search
Zerbel, Nicholas
Yliniemi, Logan
[J]. AAMAS '19: PROCEEDINGS OF THE 18TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS, 2019, : 2309 - 2311
[4] Monte Carlo Tree Search with Metaheuristics
Mandziuk, Jacek
Walczak, Patryk
[J]. ARTIFICIAL INTELLIGENCE AND SOFT COMPUTING, ICAISC 2023, PT II, 2023, 14126 : 134 - 144
[5] Elastic Monte Carlo Tree Search
Xu, Linjie
Dockhorn, Alexander
Perez-Liebana, Diego
[J]. IEEE TRANSACTIONS ON GAMES, 2023, 15 (04) : 527 - 537
[6] Monte Carlo Tree Search in Hex
Arneson, Broderick
Hayward, Ryan B.
Henderson, Philip
[J]. IEEE TRANSACTIONS ON COMPUTATIONAL INTELLIGENCE AND AI IN GAMES, 2010, 2 (04) : 251 - 258
[7] Monte Carlo tree search in Kriegspiel
Ciancarini, Paolo
Favini, Gian Piero
[J]. ARTIFICIAL INTELLIGENCE, 2010, 174 (11) : 670 - 684
[8] Monte Carlo Tree Search for Quoridor
Respall, Victor Massague
Brown, Joseph Alexander
Aslam, Hamna
[J]. 19TH INTERNATIONAL CONFERENCE ON INTELLIGENT GAMES AND SIMULATION (GAME-ON(R) 2018), 2018, : 5 - 9
[9] An Analysis of Monte Carlo Tree Search
James, Steven
Konidaris, George
Rosman, Benjamin
[J]. THIRTY-FIRST AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 3576 - 3582
[10] MONTE CARLO TREE SEARCH: A TUTORIAL
Fu, Michael C.
[J]. 2018 WINTER SIMULATION CONFERENCE (WSC), 2018, : 222 - 236

← 1 2 3 4 5 →