Single Player Monte-Carlo Tree Search Based on the Plackett-Luce Model

被引:0
|
作者
Mohr, Felix [1 ]
Bengs, Viktor [2 ]
Huellermeier, Eyke [2 ]
机构
[1] Univ La Sabana, Campus Puente Comun,Km 7, Autopista Norte De Bogot, Chia, Colombia
[2] Paderborn Univ, Warburgerstr 100, Paderborn, Germany
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The problem of minimal cost path search is especially difficult when no useful heuristics are available. A common solution is roll-out-based search like Monte Carlo Tree Search (MCTS). However, MCTS is mostly used in stochastic or adversarial environments, with the goal to identify an agent's best next move. For this reason, even though single player versions of MCTS exist, most algorithms, including UCT, are not directly tailored to classical minimal cost path search. We present Plackett-Luce MCTS (PL-MCTS), a path search algorithm based on a probabilistic model over the qualities of successor nodes. We empirically show that PL-MCTS is competitive and often superior to the state of the art.
引用
收藏
页码:12373 / 12381
页数:9
相关论文
共 50 条
  • [1] Single-Player Monte-Carlo Tree Search
    Schadd, Maarten P. D.
    Winands, Mark H. M.
    van den Herik, H. Jaap
    Chaslot, Guillaume M. J. -B.
    Uiterwijk, Jos W. H. M.
    [J]. COMPUTERS AND GAMES, 2008, 5131 : 1 - +
  • [2] Converging to a Player Model In Monte-Carlo Tree Search
    Sarratt, Trevor
    Pynadath, David V.
    Jhala, Arnav
    [J]. 2014 IEEE CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND GAMES (CIG), 2014,
  • [3] Single-player Monte-Carlo tree search for SameGame
    Schadd, Maarten P. D.
    Winands, Mark H. M.
    Tak, Mandy J. W.
    Uiterwijk, Jos W. H. M.
    [J]. KNOWLEDGE-BASED SYSTEMS, 2012, 34 : 3 - 11
  • [4] A listwise collaborative filtering based on Plackett-Luce model
    Li, Lisha
    Guo, Fenzhuo
    Qin, Sujuan
    [J]. PROCEEDINGS OF 2017 3RD IEEE INTERNATIONAL CONFERENCE ON COMPUTER AND COMMUNICATIONS (ICCC), 2017, : 2159 - 2163
  • [5] PAC Battling Bandits in the Plackett-Luce Model
    Saha, Aadirupa
    Gopalan, Aditya
    [J]. ALGORITHMIC LEARNING THEORY, VOL 98, 2019, 98
  • [6] Enhancements for Multi-Player Monte-Carlo Tree Search
    Nijssen, J. A. M.
    Winands, Mark H. M.
    [J]. COMPUTERS AND GAMES, 2011, 6515 : 238 - +
  • [7] Enhancement of CNN-based 2048 Player with Monte-Carlo Tree Search
    Watanabe, Shota
    Matsuzaki, Kiminori
    [J]. 2022 INTERNATIONAL CONFERENCE ON TECHNOLOGIES AND APPLICATIONS OF ARTIFICIAL INTELLIGENCE, TAAI, 2022, : 48 - 53
  • [8] On Bayesian inference for the Extended Plackett-Luce model
    Johnson, Stephen R.
    Henderson, Daniel A.
    Boys, Richard J.
    [J]. BAYESIAN ANALYSIS, 2022, 17 (02): : 465 - 490
  • [9] Dyad Ranking Using A Bilinear Plackett-Luce Model
    Schaefer, Dirk
    Huellermeier, Eyke
    [J]. MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2015, PT II, 2015, 9285 : 227 - 242
  • [10] A Note on Ranking in the Plackett-Luce Model for Multiple Comparisons
    Jing Luo
    Hong Qin
    [J]. Acta Mathematicae Applicatae Sinica, English Series, 2019, 35 : 885 - 892