Can Monte-Carlo Tree Search learn to sacrifice?

被引:0
|
作者
Nathan Companez
Aldeida Aleti
机构
[1] Monash University,Faculty of Information Technology
来源
Journal of Heuristics | 2016年 / 22卷
关键词
Monte-Carlo Tree Search; Sacrifice moves; Artificial intelligence; Games;
D O I
暂无
中图分类号
学科分类号
摘要
One of the most basic activities performed by an intelligent agent is deciding what to do next. The decision is usually about selecting the move with the highest expectation, or exploring new scenarios. Monte-Carlo Tree Search (MCTS), which was developed as a game playing agent, deals with this exploration–exploitation ‘dilemma’ using a multi-armed bandits strategy. The success of MCTS in a wide range of problems, such as combinatorial optimisation, reinforcement learning, and games, is due to its ability to rapidly evaluate problem states without requiring domain-specific knowledge. However, it has been acknowledged that the trade-off between exploration and exploitation is crucial for the performance of the algorithm, and affects the efficiency of the agent in learning deceptive states. One type of deception is states that give immediate rewards, but lead to a suboptimal solution in the long run. These states are known as trap states, and have been thoroughly investigated in previous research. In this work, we study the opposite of trap states, known as sacrifice states, which are deceptive moves that result in a local loss but are globally optimal, and investigate the efficiency of MCTS enhancements in identifying this type of moves.
引用
下载
收藏
页码:783 / 813
页数:30
相关论文
共 50 条
  • [21] Monte-Carlo tree search as regularized policy optimization
    Grill, Jean-Bastien
    Altche, Florent
    Tang, Yunhao
    Hubert, Thomas
    Valko, Michal
    Antonoglou, Ioannis
    Munos, Remi
    25TH AMERICAS CONFERENCE ON INFORMATION SYSTEMS (AMCIS 2019), 2019,
  • [22] AIs for Dominion Using Monte-Carlo Tree Search
    Tollisen, Robin
    Jansen, Jon Vegard
    Goodwin, Morten
    Glimsdal, Sondre
    CURRENT APPROACHES IN APPLIED ARTIFICIAL INTELLIGENCE, 2015, 9101 : 43 - 52
  • [23] Parallel Monte-Carlo Tree Search with Simulation Servers
    Kato, Hideki
    Takeuchi, Ikuo
    INTERNATIONAL CONFERENCE ON TECHNOLOGIES AND APPLICATIONS OF ARTIFICIAL INTELLIGENCE (TAAI 2010), 2010, : 491 - 498
  • [24] A SHOGI PROGRAM BASED ON MONTE-CARLO TREE SEARCH
    Sato, Yoshikuni
    Takahashi, Daisuke
    Grimbergen, Reijer
    ICGA JOURNAL, 2010, 33 (02) : 80 - 92
  • [25] Generalized Mean Estimation in Monte-Carlo Tree Search
    Dam, Tuan
    Klink, Pascal
    D'Eramo, Carlo
    Peters, Jan
    Pajarinen, Joni
    PROCEEDINGS OF THE TWENTY-NINTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, : 2397 - 2404
  • [26] Automated Machine Learning with Monte-Carlo Tree Search
    Rakotoarison, Herilalaina
    Schoenauer, Marc
    Sebag, Michele
    PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2019, : 3296 - 3303
  • [27] Monte-Carlo Tree Search Parallelisation for Computer Go
    van Niekerk, Francois
    Kroon, Steve
    van Rooyen, Gert-Jan
    Inggs, Cornelia P.
    PROCEEDINGS OF THE SOUTH AFRICAN INSTITUTE FOR COMPUTER SCIENTISTS AND INFORMATION TECHNOLOGISTS CONFERENCE, 2012, : 129 - 138
  • [28] CROSS-ENTROPY FOR MONTE-CARLO TREE SEARCH
    Chaslot, Guillaume M. J. B.
    Winands, Mark H. M.
    Szita, Istvan
    van den Herik, H. Jaap
    ICGA JOURNAL, 2008, 31 (03) : 145 - 156
  • [29] Monte-Carlo Tree Search for the Game of Scotland Yard
    Nijssen, J. A. M.
    Winands, Mark H. M.
    2011 IEEE CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND GAMES (CIG), 2011, : 158 - 165
  • [30] Bayesian Optimization for Backpropagation in Monte-Carlo Tree Search
    Lim, Nengli
    Li, Yueqin
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2021, PT II, 2021, 12892 : 209 - 221