Monte Carlo Tree Search for Priced Timed Automata

被引：0

作者：

Jensen, Peter Gjol ^{[1
]}

Kiviriga, Andrej ^{[1
]}

Larsen, Kim Guldstrand ^{[1
]}

Nyman, Ulrik ^{[1
]}

Mijacika, Adriana ^{[1
]}

Mortensen, Jeppe Hoiriis ^{[1
]}

机构：

[1] Aalborg Univ, Selma Lagerlofs Vej 300, DK-9220 Aalborg, Denmark

来源：

QUANTITATIVE EVALUATION OF SYSTEMS (QEST 2022) | 2022年 / 13479卷

关键词：

Priced Timed Automata (PTA); Model-checking; Monte Carlo Tree Search (MCTS); Planning; Upper confidence bounds for trees (UCT); OPTIMAL REACHABILITY;

D O I：

10.1007/978-3-031-16336-4_19

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

Priced timed automata (PTA) were introduced in the early 2000s to allow for generic modelling of resource-consumption problems for systems with real-time constraints. Optimal schedules for allocation of resources may here be recast as optimal reachability problems. In the setting of PTA this problem has been shown decidable and efficient symbolic reachability algorithms have been developed. Moreover, PTA has been successfully applied in a variety of applications. Still, we believe that using techniques from the planning community may provide further improvements. Thus, in this paper we consider exploiting Monte Carlo Tree Search (MCTS), adapting it to problems formulated as PTA reachability problems. We evaluate our approach on a large benchmark set of PTAs modelling either Task graph or Job-shop scheduling problems. We discuss and implement different complete and incomplete exploration policies and study their performance on the benchmark. In addition, we experiment with both wellestablished and our novel MTCS-based optimizations of PTA and study their impact. We compare our method to the existing symbolic optimal reachability engines for PTAs and demonstrate that our method (1) finds near-optimal plans, and (2) can construct plans for problems infeasible to solve with existing symbolic planners for PTA.

引用

页码：381 / 398

页数：18

共 50 条

[21] Statistical Model Checking for Networks of Priced Timed Automata
David, Alexandre
Larsen, Kim G.
Legay, Axel
Mikucionis, Marius
Poulsen, Danny Bogsted
van Vliet, Jonas
Wang, Zheng
[J]. FORMAL MODELING AND ANALYSIS OF TIMED SYSTEMS, 2011, 6919 : 80 - +
[22] A TUTORIAL INTRODUCTION TO MONTE CARLO TREE SEARCH
Fu, Michael C.
[J]. 2020 WINTER SIMULATION CONFERENCE (WSC), 2020, : 1178 - 1193
[23] Approximation Methods for Monte Carlo Tree Search
Aksenov, Kirill
Panov, Aleksandr, I
[J]. PROCEEDINGS OF THE FOURTH INTERNATIONAL SCIENTIFIC CONFERENCE INTELLIGENT INFORMATION TECHNOLOGIES FOR INDUSTRY (IITI'19), 2020, 1156 : 68 - 74
[24] Monte-Carlo Tree Search for Logistics
Edelkamp, Stefan
Gath, Max
Greulich, Christoph
Humann, Malte
Herzog, Otthein
Lawo, Michael
[J]. COMMERCIAL TRANSPORT, 2016, : 427 - 440
[25] LinUCB applied to Monte Carlo tree search
Mandai, Yusaku
Kaneko, Tomoyuki
[J]. THEORETICAL COMPUTER SCIENCE, 2016, 644 : 114 - 126
[26] Monte Carlo Tree Search for Trading and Hedging
Vittori, Edoardo
Likmeta, Amarildo
Restelli, Marcello
[J]. ICAIF 2021: THE SECOND ACM INTERNATIONAL CONFERENCE ON AI IN FINANCE, 2021,
[27] A Survey of Monte Carlo Tree Search Methods
Browne, Cameron B.
Powley, Edward
Whitehouse, Daniel
Lucas, Simon M.
Cowling, Peter I.
Rohlfshagen, Philipp
Tavener, Stephen
Perez, Diego
Samothrakis, Spyridon
Colton, Simon
[J]. IEEE TRANSACTIONS ON COMPUTATIONAL INTELLIGENCE AND AI IN GAMES, 2012, 4 (01) : 1 - 43
[28] Nonasymptotic Analysis of Monte Carlo Tree Search
Shah, Devavrat
Xie, Qiaomin
Xu, Zhi
[J]. OPERATIONS RESEARCH, 2022, 70 (06) : 3234 - 3260
[29] Monte Carlo Tree Search with Robust Exploration
Imagawa, Takahisa
Kaneko, Tomoyuki
[J]. COMPUTERS AND GAMES, CG 2016, 2016, 10068 : 34 - 46
[30] State Aggregation in Monte Carlo Tree Search
Hostetler, Jesse
Fern, Alan
Dietterich, Tom
[J]. PROCEEDINGS OF THE TWENTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2014, : 2446 - 2452

← 1 2 3 4 5 →