Monte Carlo tree search with optimal computing budget allocation

被引:0
|
作者
Li, Yunchuan [1 ,2 ]
Fu, Michael [2 ,3 ]
Xu, Jie [4 ]
机构
[1] Univ Maryland, Dept Elect & Comp Engn, College Pk, MD 20742 USA
[2] Univ Maryland, Syst Res Inst, College Pk, MD 20742 USA
[3] Univ Maryland, Robert H Smith Sch Business, College Pk, MD 20742 USA
[4] George Mason Univ, Dept Syst Engn & Operat Res, Fairfax, VA 22030 USA
基金
美国国家科学基金会;
关键词
EFFICIENCY;
D O I
10.1109/cdc40024.2019.9030099
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We analyze a tree search problem with an underlying Markov decision process, in which the goal is to identify the best action at the root that achieves the highest cumulative reward. We present a new tree policy that optimally allocates a limited computing budget to maximize a lower bound on the probability of correctly selecting the best action at each node. Compared to the widely used Upper Confidence Bound (UCB) type of tree policies, the new tree policy presents a more balanced approach to manage the exploration and exploitation trade-off when the sampling budget is limited. Furthermore, UCB assumes that the support of reward distribution is known, whereas our algorithm relaxes this assumption, and can be applied to game trees with mild modifications. A numerical experiment is conducted to demonstrate the efficiency of our algorithm in selecting the best action at the root.
引用
收藏
页码:6332 / 6337
页数:6
相关论文
共 50 条
  • [1] An Optimal Computing Budget Allocation Tree Policy for Monte Carlo Tree Search
    Li, Yunchuan
    Fu, Michael C.
    Xu, Jie
    IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2022, 67 (06) : 2685 - 2699
  • [2] Optimal computing budget allocation for Monte Carlo simulation with application to product design
    Chen, CH
    Donohue, K
    Yücesan, E
    Lin, JW
    SIMULATION MODELLING PRACTICE AND THEORY, 2003, 11 (01) : 57 - 74
  • [3] Monte Carlo Tree Search for Multi-Robot Task Allocation
    Kartal, Bilal
    Nunes, Ernesto
    Godoy, Julio
    Gini, Maria
    THIRTIETH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2016, : 4222 - 4223
  • [4] The Monte Carlo tree search based bandwidth slicing allocation algorithm
    Chang, Shun-Chieh
    WIRELESS NETWORKS, 2024, 30 (02) : 835 - 844
  • [5] The Monte Carlo tree search based bandwidth slicing allocation algorithm
    Shun-Chieh Chang
    Wireless Networks, 2024, 30 : 835 - 844
  • [6] AlphaTruss: Monte Carlo Tree Search for Optimal Truss Layout Design
    Luo, Ruifeng
    Wang, Yifan
    Xiao, Weifang
    Zhao, Xianzhong
    BUILDINGS, 2022, 12 (05)
  • [7] Toward Optimal FDM Toolpath Planning with Monte Carlo Tree Search
    Yoo, Chanyeol
    Lensgraf, Samuel
    Fitch, Robert
    Clemon, Lee M.
    Mettu, Ramgopal
    2020 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2020, : 4037 - 4043
  • [8] A Comprehensive Optimal Design of Inductors Using Monte Carlo Tree Search
    Yin, Shuli
    Sato, Hayaho
    Igarashi, Hajime
    IEEE TRANSACTIONS ON MAGNETICS, 2024, 60 (03)
  • [9] Multiagent Monte Carlo Tree Search
    Zerbel, Nicholas
    Yliniemi, Logan
    AAMAS '19: PROCEEDINGS OF THE 18TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS, 2019, : 2309 - 2311
  • [10] Monte Carlo Tree Search with Metaheuristics
    Mandziuk, Jacek
    Walczak, Patryk
    ARTIFICIAL INTELLIGENCE AND SOFT COMPUTING, ICAISC 2023, PT II, 2023, 14126 : 134 - 144