A comparison of Monte Carlo tree search and rolling horizon optimization for large-scale dynamic resource allocation problems

被引:30
|
作者
Bertsimas, Dimitris [1 ,2 ]
Griffith, J. Daniel [3 ]
Gupta, Vishal [4 ]
Kochenderfer, Mykel J. [5 ]
Misic, Velibor V. [6 ]
机构
[1] MIT, Sloan Sch Management, 77 Massachusetts Ave, Cambridge, MA 02139 USA
[2] MIT, Ctr Operat Res, 77 Massachusetts Ave, Cambridge, MA 02139 USA
[3] MIT, Lincoln Lab, 244 Wood St, Lexington, MA 02420 USA
[4] Univ Southern Calif, Marshall Sch Business, Dept Data Sci & Operat, 3670 Trousdale Pkwy, Los Angeles, CA 90089 USA
[5] Stanford Univ, Dept Aeronaut & Astronaut, 496 Lomita Mali, Stanford, CA 94305 USA
[6] Univ Calif Los Angeles, Anderson Sch Management, 110 Westwood Plaza, Los Angeles, CA 90024 USA
基金
加拿大自然科学与工程研究理事会;
关键词
Dynamic resource allocation; Monte Carlo tree search; Rolling horizon optimization; Wildfire management; Queueing control; INITIAL ATTACK; ALGORITHM; MODEL;
D O I
10.1016/j.ejor.2017.05.032
中图分类号
C93 [管理学];
学科分类号
12 ; 1201 ; 1202 ; 120202 ;
摘要
Dynamic resource allocation (DRA) problems constitute an important class of dynamic stochastic optimization problems that arise in many real-world applications. DRA problems are notoriously difficult to solve since they combine stochastic dynamics with intractably large state and action spaces. Although the artificial intelligence and operations research communities have independently proposed two successful frameworks for solving such problems Monte Carlo tree search (MCTS) and rolling horizon optirhization (RHO), respectively the relative merits of these two approaches are not well understood. In this paper, we adapt MCTS and RHO to two problems - a problem inspired by tactical wildfire management and a classical problem involving the control of queueing networks - and undertake an extensive computational study comparing the two methods on large scale instances of both problems in terms of both the state and the action spaces. Both methods are able to greatly improve on a baseline, problem-specific heuristic. On smaller instances, the MCTS and RHO approaches perform comparably, but RHO outperforms MCTS as the size of the problem increases for a, fixed computational budget. (C) 2017 Elsevier B.V. All rights reserved.
引用
收藏
页码:664 / 678
页数:15
相关论文
共 50 条
  • [1] Improving Performance Insensitivity of Large-Scale Multiobjective Optimization via Monte Carlo Tree Search
    Hong, Haokai
    Jiang, Min
    Yen, Gary G.
    [J]. IEEE TRANSACTIONS ON CYBERNETICS, 2024, 54 (03) : 1816 - 1827
  • [2] Multi-space evolutionary search with dynamic resource allocation strategy for large-scale optimization
    Shang, Qingxia
    Huang, Yuxiao
    Dong, Junwei
    Hou, Yaqing
    Wang, Yu
    Li, Min
    Feng, Liang
    [J]. NEURAL COMPUTING & APPLICATIONS, 2022, 34 (10): : 7673 - 7689
  • [3] Multi-space evolutionary search with dynamic resource allocation strategy for large-scale optimization
    Qingxia Shang
    Yuxiao Huang
    Junwei Dong
    Yaqing Hou
    Yu Wang
    Min Li
    Liang Feng
    [J]. Neural Computing and Applications, 2022, 34 : 7673 - 7689
  • [4] AlphaRoute: Large-Scale Coordinated Route Planning via Monte Carlo Tree Search
    Luo, Guiyang
    Wang, Yantao
    Zhang, Hui
    Yuan, Quan
    Li, Jinglin
    [J]. THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 10, 2023, : 12058 - 12067
  • [5] A Monte Carlo technique for large-scale dynamic tomography
    Butala, Mark D.
    Frazin, Richard A.
    Chen, Yuguo
    Kamalabadi, Farzad
    [J]. 2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL III, PTS 1-3, PROCEEDINGS, 2007, : 1217 - +
  • [6] Large Scale Hard Sample Mining with Monte Carlo Tree Search
    Canevet, Olivier
    Fleuret, Francois
    [J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 5128 - 5137
  • [7] Cooperative Particle Swarm Optimization With a Bilevel Resource Allocation Mechanism for Large-Scale Dynamic Optimization
    Liu, Xiao-Fang
    Zhang, Jun
    Wang, Jun
    [J]. IEEE TRANSACTIONS ON CYBERNETICS, 2023, 53 (02) : 1000 - 1011
  • [8] On the effectiveness of Monte Carlo simulation and heuristic search for solving large-scale block layout problems
    Chan, Wai Kin Victor
    Malmborg, Charles J.
    [J]. INTERNATIONAL JOURNAL OF PRODUCTION RESEARCH, 2013, 51 (14) : 4258 - 4272
  • [9] Rolling horizon procedure for large-scale job-shop scheduling problems
    Wang, Bing
    Li, Qiaoyun
    [J]. 2007 IEEE INTERNATIONAL CONFERENCE ON AUTOMATION AND LOGISTICS, VOLS 1-6, 2007, : 829 - 834
  • [10] Rolling Horizon Procedure for Large-scale Equipment Replacement Problems with Multiple Alternatives
    Wang, Bing
    Liu, Dongpo
    [J]. 2011 CHINESE CONTROL AND DECISION CONFERENCE, VOLS 1-6, 2011, : 2741 - +