Single Player Monte-Carlo Tree Search Based on the Plackett-Luce Model

被引：0

作者：

Mohr, Felix ^{[1
]}

Bengs, Viktor ^{[2
]}

Huellermeier, Eyke ^{[2
]}

机构：

[1] Univ La Sabana, Campus Puente Comun,Km 7, Autopista Norte De Bogot, Chia, Colombia

[2] Paderborn Univ, Warburgerstr 100, Paderborn, Germany

来源：

THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE | 2021年 / 35卷

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The problem of minimal cost path search is especially difficult when no useful heuristics are available. A common solution is roll-out-based search like Monte Carlo Tree Search (MCTS). However, MCTS is mostly used in stochastic or adversarial environments, with the goal to identify an agent's best next move. For this reason, even though single player versions of MCTS exist, most algorithms, including UCT, are not directly tailored to classical minimal cost path search. We present Plackett-Luce MCTS (PL-MCTS), a path search algorithm based on a probabilistic model over the qualities of successor nodes. We empirically show that PL-MCTS is competitive and often superior to the state of the art.

引用

页码：12373 / 12381

页数：9

共 50 条

[41] Tensor Implementation of Monte-Carlo Tree Search for Model-Based Reinforcement Learning
Balaz, Marek
Tarabek, Peter
[J]. APPLIED SCIENCES-BASEL, 2023, 13 (03):
[42] Evaluation of Simulation Strategy on Single-Player Monte-Carlo Tree Search and its Discussion for a Practical Scheduling Problem
Matsumoto, Shimpei
Hirosue, Noriaki
Itonaga, Kyohei
Yokoo, Kazuma
Futahashi, Hisatomo
[J]. INTERNATIONAL MULTICONFERENCE OF ENGINEERS AND COMPUTER SCIENTISTS (IMECS 2010), VOLS I-III, 2010, : 2086 - +
[43] Heuristic Model Checking using a Monte-Carlo Tree Search Algorithm
Poulding, Simon
Feldt, Robert
[J]. GECCO'15: PROCEEDINGS OF THE 2015 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE, 2015, : 1359 - 1366
[44] Learning-to-Rank with Partitioned Preference: Fast Estimation for the Plackett-Luce Model
Ma, Jiaqi
Yi, Xinyang
Tang, Weijing
Zhao, Zhe
Hong, Lichan
Chi, Ed H.
Mei, Qiaozhu
[J]. 24TH INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS (AISTATS), 2021, 130
[45] A Skat player based on Monte-Carlo simulation
Kupferschmid, Sebastian
Helmert, Malte
[J]. COMPUTERS AND GAMES, 2007, 4630 : 135 - +
[46] Multiple Tree for Partially Observable Monte-Carlo Tree Search
Auger, David
[J]. APPLICATIONS OF EVOLUTIONARY COMPUTATION, PT I, 2011, 6624 : 53 - 62
[47] Monocular Depth Estimation via Listwise Ranking using the Plackett-Luce Model
Lienen, Julian
Huellermeier, Eyke
Ewerth, Ralph
Nommensen, Nils
[J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 14590 - 14599
[48] Point-based Incremental Pruning for Monte-Carlo Tree Search
Wu, Bo
Feng, Yanpeng
[J]. 2017 4TH INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE AND CONTROL ENGINEERING (ICISCE), 2017, : 545 - 548
[49] Quality-based Rewards for Monte-Carlo Tree Search Simulations
Pepels, Tom
Tak, Mandy J. W.
Lanctot, Marc
Winands, Mark H. M.
[J]. 21ST EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE (ECAI 2014), 2014, 263 : 705 - 710
[50] Tracking Control for Petri Nets based on Monte-Carlo Tree Search
Fritz, Raphael
Napitupulu, Juliver
Zhang, Ping
[J]. 2019 18TH EUROPEAN CONTROL CONFERENCE (ECC), 2019, : 4180 - 4185

← 1 2 3 4 5 →