Point-based Incremental Pruning for Monte-Carlo Tree Search

被引:0
|
作者
Wu, Bo [1 ]
Feng, Yanpeng [1 ]
机构
[1] Shenzhen Polytech, Educ Technol & Informat Ctr, Shenzhen, Peoples R China
关键词
Partially observable Markov decision processes (POMPDs); Monte-Carlo tree search (MCTS); incremental pruning; VALUE-ITERATION;
D O I
10.1109/ICISCE.2017.119
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Monte-Carlo tree search (MCTS) combines the generality of stochastic simulation and the accuracy of tree search, which has attracted the great attention of scholars. However, the MCTS search requires a sufficient number of iterations to converge to a good solution, which is more difficult to optimize. In order to solve this problem, this paper presents a point-based incremental pruning (PIP) for Monte-Carlo tree search. Instead of reasoning about the whole policy trees space when pruning the cross-sums of the value functions during policy construction, our algorithm uses boundary belief points to perform exact pruning, and exploits intermediate points to perform approximate pruning by generating policy trees, then uses real-time belief states to get the optimal policy in policy execution. The theoretical analysis and empirical results indicate that PIP can speed up the tree search efficiently.
引用
收藏
页码:545 / 548
页数:4
相关论文
共 50 条
  • [1] Pruning Playouts in Monte-Carlo Tree Search for the Game of Havannah
    Dugueperoux, Joris
    Mazyad, Ahmad
    Teytaud, Fabien
    Dehos, Julien
    [J]. COMPUTERS AND GAMES, CG 2016, 2016, 10068 : 47 - 57
  • [2] A SHOGI PROGRAM BASED ON MONTE-CARLO TREE SEARCH
    Sato, Yoshikuni
    Takahashi, Daisuke
    Grimbergen, Reijer
    [J]. ICGA JOURNAL, 2010, 33 (02) : 80 - 92
  • [3] Monte-Carlo Tree Search for Logistics
    Edelkamp, Stefan
    Gath, Max
    Greulich, Christoph
    Humann, Malte
    Herzog, Otthein
    Lawo, Michael
    [J]. COMMERCIAL TRANSPORT, 2016, : 427 - 440
  • [4] Parallel Monte-Carlo Tree Search
    Chaslot, Guillaume M. J. -B.
    Winands, Mark H. M.
    van den Herik, H. Jaap
    [J]. COMPUTERS AND GAMES, 2008, 5131 : 60 - +
  • [5] Monte-Carlo Tree Search Solver
    Winands, Mark H. M.
    Bjornsson, Yngvi
    Saito, Jahn-Takeshi
    [J]. COMPUTERS AND GAMES, 2008, 5131 : 25 - +
  • [6] Comparative Study of Monte-Carlo Tree Search and Alpha-Beta Pruning in Amazons
    Kato, Hikari
    Fazekas, Szilard Zsolt
    Takaya, Mayumi
    Yamamura, Akihiro
    [J]. INFORMATION AND COMMUNICATION TECHNOLOGY, 2015, 9357 : 139 - 148
  • [7] Monte-Carlo Tree Search with Tree Shape Control
    Marchenko, Oleksandr I.
    Marchenko, Oleksii O.
    [J]. 2017 IEEE FIRST UKRAINE CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING (UKRCON), 2017, : 812 - 817
  • [8] αβ-based Play-outs in Monte-Carlo Tree Search
    Winands, Mark H. M.
    Bjornsson, Yngvi
    [J]. 2011 IEEE CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND GAMES (CIG), 2011, : 110 - 117
  • [9] A Novel Point-based Incremental Pruning Algorithm for POMDP
    Wu, Bo
    Zheng, Hongyan
    Feng, Yanpeng
    [J]. APPLIED SCIENCE, MATERIALS SCIENCE AND INFORMATION TECHNOLOGIES IN INDUSTRY, 2014, 513-517 : 1088 - 1091
  • [10] Monte-Carlo Tree Search: To MC or to DP?
    Feldman, Zohar
    Domshlak, Carmel
    [J]. 21ST EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE (ECAI 2014), 2014, 263 : 321 - 326