Converging to a Player Model In Monte-Carlo Tree Search

被引：0

作者：

Sarratt, Trevor ^{[1
]}

Pynadath, David V. ^{[2
]}

Jhala, Arnav ^{[1
]}

机构：

[1] Univ Calif Santa Cruz, Santa Cruz, CA 95064 USA

[2] USC Inst Creat Technol, Los Angeles, CA 90094 USA

来源：

2014 IEEE CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND GAMES (CIG) | 2014年

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Player models allow search algorithms to account for differences in agent behavior according to player's preferences and goals. However, it is often not until the first actions are taken that an agent can begin assessing which models are relevant to its current opponent. This paper investigates the integration of belief distributions over player models in the Monte-Carlo Tree Search (MCTS) algorithm. We describe a method of updating belief distributions through leveraging information sampled during the MCTS. We then characterize the effect of tuning parameters of the MCTS to convergence of belief distributions. Evaluation of this approach is done in comparison with value iteration for an iterated version of the prisoner's dilemma problem. We show that for a sufficient quantity of iterations, our approach converges to the correct model faster than the same model under value iteration.

引用

页数：7

共 50 条

[1] Single-Player Monte-Carlo Tree Search
Schadd, Maarten P. D.
Winands, Mark H. M.
van den Herik, H. Jaap
Chaslot, Guillaume M. J. -B.
Uiterwijk, Jos W. H. M.
[J]. COMPUTERS AND GAMES, 2008, 5131 : 1 - +
[2] Single-player Monte-Carlo tree search for SameGame
Schadd, Maarten P. D.
Winands, Mark H. M.
Tak, Mandy J. W.
Uiterwijk, Jos W. H. M.
[J]. KNOWLEDGE-BASED SYSTEMS, 2012, 34 : 3 - 11
[3] Enhancements for Multi-Player Monte-Carlo Tree Search
Nijssen, J. A. M.
Winands, Mark H. M.
[J]. COMPUTERS AND GAMES, 2011, 6515 : 238 - +
[4] Single Player Monte-Carlo Tree Search Based on the Plackett-Luce Model
Mohr, Felix
Bengs, Viktor
Huellermeier, Eyke
[J]. THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 12373 - 12381
[5] Enhancement of CNN-based 2048 Player with Monte-Carlo Tree Search
Watanabe, Shota
Matsuzaki, Kiminori
[J]. 2022 INTERNATIONAL CONFERENCE ON TECHNOLOGIES AND APPLICATIONS OF ARTIFICIAL INTELLIGENCE, TAAI, 2022, : 48 - 53
[6] Monte-Carlo Tree Search for Logistics
Edelkamp, Stefan
Gath, Max
Greulich, Christoph
Humann, Malte
Herzog, Otthein
Lawo, Michael
[J]. COMMERCIAL TRANSPORT, 2016, : 427 - 440
[7] Monte-Carlo Tree Search Solver
Winands, Mark H. M.
Bjornsson, Yngvi
Saito, Jahn-Takeshi
[J]. COMPUTERS AND GAMES, 2008, 5131 : 25 - +
[8] Parallel Monte-Carlo Tree Search
Chaslot, Guillaume M. J. -B.
Winands, Mark H. M.
van den Herik, H. Jaap
[J]. COMPUTERS AND GAMES, 2008, 5131 : 60 - +
[9] Learning to Search Promising Regions by a Monte-Carlo Tree Model
Xia, Hai
Li, Changhe
Zeng, Sanyou
Tan, Qingshan
Wang, Junchen
Yang, Shengxiang
[J]. 2022 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2022,
[10] Monte-Carlo Tree Search with Tree Shape Control
Marchenko, Oleksandr I.
Marchenko, Oleksii O.
[J]. 2017 IEEE FIRST UKRAINE CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING (UKRCON), 2017, : 812 - 817

← 1 2 3 4 5 →