Reinforcement Learning for Stochastic Max-Plus Linear Systems

被引：0

作者：

Subramanian, Vignesh ^{[1
]}

Farhadi, Farzaneh ^{[2
]}

Soudjani, Sadegh ^{[3
]}

机构：

[1] Georgia Inst Technol, Atlanta, GA 30332 USA

[2] Newcastle Univ, Sch Engn, Newcastle Upon Tyne, Tyne & Wear, England

[3] Newcastle Univ, Sch Comp, Newcastle Upon Tyne, Tyne & Wear, England

来源：

2023 62ND IEEE CONFERENCE ON DECISION AND CONTROL, CDC | 2023年

基金：

英国工程与自然科学研究理事会;

关键词：

DISCRETE-EVENT SYSTEMS; REACHABILITY ANALYSIS;

D O I：

10.1109/CDC49753.2023.10384207

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This paper studies the design of control policies for Discrete Event Systems under uncertainties. We capture the timing of the events using the framework of max-plus-linear systems in which the time between consecutive events depends on random delays with unknown distributions. Our policy synthesis approach is with respect to a cost function, and it can be extended directly to satisfy safety specifications on the timing of events. The main novelty of our approach is to translate the system evolution to a Markov decision process (MDP) that has an uncountable state space and develop a stochastic optimisation problem under the evolution of the MDP. To tackle the unknown distribution of uncertainties (thus unknown transition probabilities in the MDP), we employ model-free reinforcement learning to perform optimisations and find control policies for the system. Our implementation results on the 9-dimensional model of a railway network show superiority of our learning approach in comparison with the stochastic model predictive control approach.

引用

页码：5631 / 5638

页数：8

共 50 条

[1] Cycle time of stochastic max-plus linear systems
Merlet, Glen
ELECTRONIC JOURNAL OF PROBABILITY, 2008, 13 : 322 - 340
[2] Max-plus approximation for reinforcement learning
Goncalves, Vinicius Mariano
AUTOMATICA, 2021, 129
[3] On the exponentiality of stochastic linear systems under the Max-Plus algebra
Chang, CS
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 1996, 41 (08) : 1182 - 1188
[4] Stochastic Filtering of Max-Plus Linear Systems With Bounded Disturbances
Mendes, Rafael Santos
Hardouin, Laurent
Lhommeau, Mehdi
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2019, 64 (09) : 3706 - 3715
[5] Max-plus algebra and max-plus linear discrete event systems: An introduction
De Schutter, Bart
van den Boom, Ton
WODES' 08: PROCEEDINGS OF THE 9TH INTERNATIONAL WORKSHOP ON DISCRETE EVENT SYSTEMS, 2008, : 36 - 42
[6] On the eigenstructure of a class of max-plus linear systems
Lopes, G. A. D.
Kersbergen, B.
van den Boom, T.
De Schutter, B.
Babuska, R.
2011 50TH IEEE CONFERENCE ON DECISION AND CONTROL AND EUROPEAN CONTROL CONFERENCE (CDC-ECC), 2011, : 1823 - 1828
[7] Comparison and aggregation of max-plus linear systems
Ledoux, J
Truffet, L
LINEAR ALGEBRA AND ITS APPLICATIONS, 2004, 378 : 245 - 272
[8] Interval max-plus systems of linear equations
Myskova, Helena
LINEAR ALGEBRA AND ITS APPLICATIONS, 2012, 437 (08) : 1992 - 2000
[9] Tropical Abstractions of Max-Plus Linear Systems
Mufid, Muhammad Syifa'ul
Adzkiya, Dieky
Abate, Alessandro
FORMAL MODELING AND ANALYSIS OF TIMED SYSTEMS, FORMATS 2018, 2018, 11022 : 271 - 287
[10] Reachability for Interval Max-Plus Linear Systems
Wang, Cailu
Tao, Yuegang
Yang, Peng
PROCEEDINGS OF THE 36TH CHINESE CONTROL CONFERENCE (CCC 2017), 2017, : 2392 - 2396

← 1 2 3 4 5 →