Monte Carlo Tree Search for Priced Timed Automata

被引:0
|
作者
Jensen, Peter Gjol [1 ]
Kiviriga, Andrej [1 ]
Larsen, Kim Guldstrand [1 ]
Nyman, Ulrik [1 ]
Mijacika, Adriana [1 ]
Mortensen, Jeppe Hoiriis [1 ]
机构
[1] Aalborg Univ, Selma Lagerlofs Vej 300, DK-9220 Aalborg, Denmark
关键词
Priced Timed Automata (PTA); Model-checking; Monte Carlo Tree Search (MCTS); Planning; Upper confidence bounds for trees (UCT); OPTIMAL REACHABILITY;
D O I
10.1007/978-3-031-16336-4_19
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Priced timed automata (PTA) were introduced in the early 2000s to allow for generic modelling of resource-consumption problems for systems with real-time constraints. Optimal schedules for allocation of resources may here be recast as optimal reachability problems. In the setting of PTA this problem has been shown decidable and efficient symbolic reachability algorithms have been developed. Moreover, PTA has been successfully applied in a variety of applications. Still, we believe that using techniques from the planning community may provide further improvements. Thus, in this paper we consider exploiting Monte Carlo Tree Search (MCTS), adapting it to problems formulated as PTA reachability problems. We evaluate our approach on a large benchmark set of PTAs modelling either Task graph or Job-shop scheduling problems. We discuss and implement different complete and incomplete exploration policies and study their performance on the benchmark. In addition, we experiment with both wellestablished and our novel MTCS-based optimizations of PTA and study their impact. We compare our method to the existing symbolic optimal reachability engines for PTAs and demonstrate that our method (1) finds near-optimal plans, and (2) can construct plans for problems infeasible to solve with existing symbolic planners for PTA.
引用
收藏
页码:381 / 398
页数:18
相关论文
共 50 条
  • [21] Statistical Model Checking for Networks of Priced Timed Automata
    David, Alexandre
    Larsen, Kim G.
    Legay, Axel
    Mikucionis, Marius
    Poulsen, Danny Bogsted
    van Vliet, Jonas
    Wang, Zheng
    [J]. FORMAL MODELING AND ANALYSIS OF TIMED SYSTEMS, 2011, 6919 : 80 - +
  • [22] A TUTORIAL INTRODUCTION TO MONTE CARLO TREE SEARCH
    Fu, Michael C.
    [J]. 2020 WINTER SIMULATION CONFERENCE (WSC), 2020, : 1178 - 1193
  • [23] Approximation Methods for Monte Carlo Tree Search
    Aksenov, Kirill
    Panov, Aleksandr, I
    [J]. PROCEEDINGS OF THE FOURTH INTERNATIONAL SCIENTIFIC CONFERENCE INTELLIGENT INFORMATION TECHNOLOGIES FOR INDUSTRY (IITI'19), 2020, 1156 : 68 - 74
  • [24] Monte-Carlo Tree Search for Logistics
    Edelkamp, Stefan
    Gath, Max
    Greulich, Christoph
    Humann, Malte
    Herzog, Otthein
    Lawo, Michael
    [J]. COMMERCIAL TRANSPORT, 2016, : 427 - 440
  • [25] LinUCB applied to Monte Carlo tree search
    Mandai, Yusaku
    Kaneko, Tomoyuki
    [J]. THEORETICAL COMPUTER SCIENCE, 2016, 644 : 114 - 126
  • [26] Monte Carlo Tree Search for Trading and Hedging
    Vittori, Edoardo
    Likmeta, Amarildo
    Restelli, Marcello
    [J]. ICAIF 2021: THE SECOND ACM INTERNATIONAL CONFERENCE ON AI IN FINANCE, 2021,
  • [27] A Survey of Monte Carlo Tree Search Methods
    Browne, Cameron B.
    Powley, Edward
    Whitehouse, Daniel
    Lucas, Simon M.
    Cowling, Peter I.
    Rohlfshagen, Philipp
    Tavener, Stephen
    Perez, Diego
    Samothrakis, Spyridon
    Colton, Simon
    [J]. IEEE TRANSACTIONS ON COMPUTATIONAL INTELLIGENCE AND AI IN GAMES, 2012, 4 (01) : 1 - 43
  • [28] Nonasymptotic Analysis of Monte Carlo Tree Search
    Shah, Devavrat
    Xie, Qiaomin
    Xu, Zhi
    [J]. OPERATIONS RESEARCH, 2022, 70 (06) : 3234 - 3260
  • [29] Monte Carlo Tree Search with Robust Exploration
    Imagawa, Takahisa
    Kaneko, Tomoyuki
    [J]. COMPUTERS AND GAMES, CG 2016, 2016, 10068 : 34 - 46
  • [30] State Aggregation in Monte Carlo Tree Search
    Hostetler, Jesse
    Fern, Alan
    Dietterich, Tom
    [J]. PROCEEDINGS OF THE TWENTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2014, : 2446 - 2452