Converging to a Player Model In Monte-Carlo Tree Search

被引:0
|
作者
Sarratt, Trevor [1 ]
Pynadath, David V. [2 ]
Jhala, Arnav [1 ]
机构
[1] Univ Calif Santa Cruz, Santa Cruz, CA 95064 USA
[2] USC Inst Creat Technol, Los Angeles, CA 90094 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Player models allow search algorithms to account for differences in agent behavior according to player's preferences and goals. However, it is often not until the first actions are taken that an agent can begin assessing which models are relevant to its current opponent. This paper investigates the integration of belief distributions over player models in the Monte-Carlo Tree Search (MCTS) algorithm. We describe a method of updating belief distributions through leveraging information sampled during the MCTS. We then characterize the effect of tuning parameters of the MCTS to convergence of belief distributions. Evaluation of this approach is done in comparison with value iteration for an iterated version of the prisoner's dilemma problem. We show that for a sufficient quantity of iterations, our approach converges to the correct model faster than the same model under value iteration.
引用
收藏
页数:7
相关论文
共 50 条
  • [1] Single-Player Monte-Carlo Tree Search
    Schadd, Maarten P. D.
    Winands, Mark H. M.
    van den Herik, H. Jaap
    Chaslot, Guillaume M. J. -B.
    Uiterwijk, Jos W. H. M.
    [J]. COMPUTERS AND GAMES, 2008, 5131 : 1 - +
  • [2] Single-player Monte-Carlo tree search for SameGame
    Schadd, Maarten P. D.
    Winands, Mark H. M.
    Tak, Mandy J. W.
    Uiterwijk, Jos W. H. M.
    [J]. KNOWLEDGE-BASED SYSTEMS, 2012, 34 : 3 - 11
  • [3] Enhancements for Multi-Player Monte-Carlo Tree Search
    Nijssen, J. A. M.
    Winands, Mark H. M.
    [J]. COMPUTERS AND GAMES, 2011, 6515 : 238 - +
  • [4] Single Player Monte-Carlo Tree Search Based on the Plackett-Luce Model
    Mohr, Felix
    Bengs, Viktor
    Huellermeier, Eyke
    [J]. THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 12373 - 12381
  • [5] Enhancement of CNN-based 2048 Player with Monte-Carlo Tree Search
    Watanabe, Shota
    Matsuzaki, Kiminori
    [J]. 2022 INTERNATIONAL CONFERENCE ON TECHNOLOGIES AND APPLICATIONS OF ARTIFICIAL INTELLIGENCE, TAAI, 2022, : 48 - 53
  • [6] Monte-Carlo Tree Search for Logistics
    Edelkamp, Stefan
    Gath, Max
    Greulich, Christoph
    Humann, Malte
    Herzog, Otthein
    Lawo, Michael
    [J]. COMMERCIAL TRANSPORT, 2016, : 427 - 440
  • [7] Monte-Carlo Tree Search Solver
    Winands, Mark H. M.
    Bjornsson, Yngvi
    Saito, Jahn-Takeshi
    [J]. COMPUTERS AND GAMES, 2008, 5131 : 25 - +
  • [8] Parallel Monte-Carlo Tree Search
    Chaslot, Guillaume M. J. -B.
    Winands, Mark H. M.
    van den Herik, H. Jaap
    [J]. COMPUTERS AND GAMES, 2008, 5131 : 60 - +
  • [9] Learning to Search Promising Regions by a Monte-Carlo Tree Model
    Xia, Hai
    Li, Changhe
    Zeng, Sanyou
    Tan, Qingshan
    Wang, Junchen
    Yang, Shengxiang
    [J]. 2022 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2022,
  • [10] Monte-Carlo Tree Search with Tree Shape Control
    Marchenko, Oleksandr I.
    Marchenko, Oleksii O.
    [J]. 2017 IEEE FIRST UKRAINE CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING (UKRCON), 2017, : 812 - 817