Monte Carlo Tree Search for Priced Timed Automata

被引:0
|
作者
Jensen, Peter Gjol [1 ]
Kiviriga, Andrej [1 ]
Larsen, Kim Guldstrand [1 ]
Nyman, Ulrik [1 ]
Mijacika, Adriana [1 ]
Mortensen, Jeppe Hoiriis [1 ]
机构
[1] Aalborg Univ, Selma Lagerlofs Vej 300, DK-9220 Aalborg, Denmark
关键词
Priced Timed Automata (PTA); Model-checking; Monte Carlo Tree Search (MCTS); Planning; Upper confidence bounds for trees (UCT); OPTIMAL REACHABILITY;
D O I
10.1007/978-3-031-16336-4_19
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Priced timed automata (PTA) were introduced in the early 2000s to allow for generic modelling of resource-consumption problems for systems with real-time constraints. Optimal schedules for allocation of resources may here be recast as optimal reachability problems. In the setting of PTA this problem has been shown decidable and efficient symbolic reachability algorithms have been developed. Moreover, PTA has been successfully applied in a variety of applications. Still, we believe that using techniques from the planning community may provide further improvements. Thus, in this paper we consider exploiting Monte Carlo Tree Search (MCTS), adapting it to problems formulated as PTA reachability problems. We evaluate our approach on a large benchmark set of PTAs modelling either Task graph or Job-shop scheduling problems. We discuss and implement different complete and incomplete exploration policies and study their performance on the benchmark. In addition, we experiment with both wellestablished and our novel MTCS-based optimizations of PTA and study their impact. We compare our method to the existing symbolic optimal reachability engines for PTAs and demonstrate that our method (1) finds near-optimal plans, and (2) can construct plans for problems infeasible to solve with existing symbolic planners for PTA.
引用
收藏
页码:381 / 398
页数:18
相关论文
共 50 条
  • [1] Quantitative Attack Tree Analysis via Priced Timed Automata
    Kumar, Rajesh
    Ruijters, Enno
    Stoelinga, Marielle
    [J]. FORMAL MODELING AND ANALYSIS OF TIMED SYSTEMS (FORMATS 2015), 2015, 9268 : 156 - 171
  • [2] Costs and rewards in priced timed automata
    Fraenzle, Martin
    Shirmohammadi, Mahsa
    Swaminathan, Mani
    Worrell, James
    [J]. INFORMATION AND COMPUTATION, 2022, 282
  • [3] Priced timed automata: Algorithms and applications
    Behrmann, G
    Larsen, KG
    Rasmussen, JI
    [J]. FORMAL METHODS FOR COMPONENTS AND OBJECTS, 2005, 3657 : 162 - 182
  • [4] Concavely-Priced Probabilistic Timed Automata
    Jurdzinski, Marcin
    Kwiatkowska, Marta
    Norman, Gethin
    Trivedi, Ashutosh
    [J]. CONCUR 2009 - CONCURRENCY THEORY, PROCEEDINGS, 2009, 5710 : 415 - +
  • [5] Optimal strategies in priced timed game automata
    Bouyer, P
    Cassez, F
    Fleury, E
    Larsen, KG
    [J]. FSTTCS 2004: FOUNDATIONS OF SOFTWARE TECHNOLOGY AND THEORETICAL COMPUTER SCIENCE, 2004, 3328 : 148 - 160
  • [6] Statistical Model Checking for Priced Timed Automata
    Bulychev, Peter
    David, Alexandre
    Larsen, Kim Guldstrand
    Legay, Axel
    Mikucionis, Marius
    Poulsen, Danny Bogsted
    Wang, Zheng
    [J]. ELECTRONIC PROCEEDINGS IN THEORETICAL COMPUTER SCIENCE, 2012, (85): : 1 - 16
  • [7] A Monte-Carlo Tree Search based Tracking Control Approach for Timed Petri Nets
    Fritz, Raphael
    Krebs, Nico
    Zhang, Ping
    [J]. IFAC PAPERSONLINE, 2020, 53 (02): : 2095 - 2100
  • [8] Multiagent Monte Carlo Tree Search
    Zerbel, Nicholas
    Yliniemi, Logan
    [J]. AAMAS '19: PROCEEDINGS OF THE 18TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS, 2019, : 2309 - 2311
  • [9] Monte Carlo Tree Search with Metaheuristics
    Mandziuk, Jacek
    Walczak, Patryk
    [J]. ARTIFICIAL INTELLIGENCE AND SOFT COMPUTING, ICAISC 2023, PT II, 2023, 14126 : 134 - 144
  • [10] Elastic Monte Carlo Tree Search
    Xu, Linjie
    Dockhorn, Alexander
    Perez-Liebana, Diego
    [J]. IEEE TRANSACTIONS ON GAMES, 2023, 15 (04) : 527 - 537