Monte Carlo Search Algorithm Discovery for Single-Player Games

被引:7
|
作者
Maes, Francis [1 ]
St-Pierre, David Lupien [2 ]
Ernst, Damien [2 ]
机构
[1] Univ Liege, Syst & Modeling Res Unit, B-4000 Liege, Belgium
[2] Univ Liege, Dept Elect Engn & Comp Sci, B-4000 Liege, Belgium
关键词
Algorithm selection; grammar of algorithms; Monte Carlo search (MCS);
D O I
10.1109/TCIAIG.2013.2239295
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Much current research in AI and games is being devoted to Monte Carlo search (MCS) algorithms. While the quest for a single unified MCS algorithm that would perform well on all problems is of major interest for AI, practitioners often know in advance the problem they want to solve, and spend plenty of time exploiting this knowledge to customize their MCS algorithm in a problem-driven way. We propose an MCS algorithm discovery scheme to perform this in an automatic and reproducible way. First, we introduce a grammar over MCS algorithms that enables inducing a rich space of candidate algorithms. Afterwards, we search in this space for the algorithm that performs best on average for a given distribution of training problems. We rely on multiarmed bandits to approximately solve this optimization problem. The experiments, generated on three different domains, show that our approach enables discovering algorithms that outperform several well-known MCS algorithms such as upper confidence bounds applied to trees and nested Monte Carlo search. We also show that the discovered algorithms are generally quite robust with respect to changes in the distribution over the training problems.
引用
收藏
页码:201 / 213
页数:13
相关论文
共 50 条
  • [1] Single-Player Monte-Carlo Tree Search
    Schadd, Maarten P. D.
    Winands, Mark H. M.
    van den Herik, H. Jaap
    Chaslot, Guillaume M. J. -B.
    Uiterwijk, Jos W. H. M.
    [J]. COMPUTERS AND GAMES, 2008, 5131 : 1 - +
  • [2] Single-player Monte-Carlo tree search for SameGame
    Schadd, Maarten P. D.
    Winands, Mark H. M.
    Tak, Mandy J. W.
    Uiterwijk, Jos W. H. M.
    [J]. KNOWLEDGE-BASED SYSTEMS, 2012, 34 : 3 - 11
  • [3] An analysis of Single-Player Monte Carlo Tree Search performance in Sokoban
    Crippa, Mattia
    Lanzi, Pier Luca
    Marocchi, Fabio
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2022, 192
  • [4] Combining UCT and Nested Monte Carlo Search for Single-Player General Game Playing
    Mehat, Jean
    Cazenave, Tristan
    [J]. IEEE TRANSACTIONS ON COMPUTATIONAL INTELLIGENCE AND AI IN GAMES, 2010, 2 (04) : 271 - 277
  • [5] A single-player Monte Carlo tree search method combined with node importance for virtual network embedding
    Zheng, Guangcong
    Wang, Cong
    Shao, Weijie
    Yuan, Ying
    Tian, Zejie
    Peng, Sancheng
    Bashir, Ali Kashif
    Mumtaz, Shahid
    [J]. ANNALS OF TELECOMMUNICATIONS, 2021, 76 (5-6) : 297 - 312
  • [6] Optimal Sequencing in Single-Player Games
    Li, Yifu
    Ryan, Christopher Thomas
    Sheng, Lifei
    [J]. MANAGEMENT SCIENCE, 2023, 69 (10) : 6057 - 6075
  • [7] A single-player Monte Carlo tree search method combined with node importance for virtual network embedding
    Guangcong Zheng
    Cong Wang
    Weijie Shao
    Ying Yuan
    Zejie Tian
    Sancheng Peng
    Ali Kashif Bashir
    Shahid Mumtaz
    [J]. Annals of Telecommunications, 2021, 76 : 297 - 312
  • [8] Worker’s knowledge evaluation with single-player Monte Carlo tree search for a practical reentrant scheduling problem
    Furuoka R.
    Matsumoto S.
    [J]. Artificial Life and Robotics, 2017, 22 (1) : 130 - 138
  • [9] Nested Monte Carlo Search for Two-Player Games
    Cazenave, Tristan
    Saffidine, Abdallah
    Schofield, Michael
    Thielscher, Michael
    [J]. THIRTIETH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2016, : 687 - 693
  • [10] Evaluation of Simulation Strategy on Single-Player Monte-Carlo Tree Search and its Discussion for a Practical Scheduling Problem
    Matsumoto, Shimpei
    Hirosue, Noriaki
    Itonaga, Kyohei
    Yokoo, Kazuma
    Futahashi, Hisatomo
    [J]. INTERNATIONAL MULTICONFERENCE OF ENGINEERS AND COMPUTER SCIENTISTS (IMECS 2010), VOLS I-III, 2010, : 2086 - +