Reinforcement Learning for Stochastic Max-Plus Linear Systems

被引:0
|
作者
Subramanian, Vignesh [1 ]
Farhadi, Farzaneh [2 ]
Soudjani, Sadegh [3 ]
机构
[1] Georgia Inst Technol, Atlanta, GA 30332 USA
[2] Newcastle Univ, Sch Engn, Newcastle Upon Tyne, Tyne & Wear, England
[3] Newcastle Univ, Sch Comp, Newcastle Upon Tyne, Tyne & Wear, England
基金
英国工程与自然科学研究理事会;
关键词
DISCRETE-EVENT SYSTEMS; REACHABILITY ANALYSIS;
D O I
10.1109/CDC49753.2023.10384207
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper studies the design of control policies for Discrete Event Systems under uncertainties. We capture the timing of the events using the framework of max-plus-linear systems in which the time between consecutive events depends on random delays with unknown distributions. Our policy synthesis approach is with respect to a cost function, and it can be extended directly to satisfy safety specifications on the timing of events. The main novelty of our approach is to translate the system evolution to a Markov decision process (MDP) that has an uncountable state space and develop a stochastic optimisation problem under the evolution of the MDP. To tackle the unknown distribution of uncertainties (thus unknown transition probabilities in the MDP), we employ model-free reinforcement learning to perform optimisations and find control policies for the system. Our implementation results on the 9-dimensional model of a railway network show superiority of our learning approach in comparison with the stochastic model predictive control approach.
引用
收藏
页码:5631 / 5638
页数:8
相关论文
共 50 条
  • [1] Cycle time of stochastic max-plus linear systems
    Merlet, Glen
    ELECTRONIC JOURNAL OF PROBABILITY, 2008, 13 : 322 - 340
  • [2] Max-plus approximation for reinforcement learning
    Goncalves, Vinicius Mariano
    AUTOMATICA, 2021, 129
  • [3] On the exponentiality of stochastic linear systems under the Max-Plus algebra
    Chang, CS
    IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 1996, 41 (08) : 1182 - 1188
  • [4] Stochastic Filtering of Max-Plus Linear Systems With Bounded Disturbances
    Mendes, Rafael Santos
    Hardouin, Laurent
    Lhommeau, Mehdi
    IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2019, 64 (09) : 3706 - 3715
  • [5] Max-plus algebra and max-plus linear discrete event systems: An introduction
    De Schutter, Bart
    van den Boom, Ton
    WODES' 08: PROCEEDINGS OF THE 9TH INTERNATIONAL WORKSHOP ON DISCRETE EVENT SYSTEMS, 2008, : 36 - 42
  • [6] On the eigenstructure of a class of max-plus linear systems
    Lopes, G. A. D.
    Kersbergen, B.
    van den Boom, T.
    De Schutter, B.
    Babuska, R.
    2011 50TH IEEE CONFERENCE ON DECISION AND CONTROL AND EUROPEAN CONTROL CONFERENCE (CDC-ECC), 2011, : 1823 - 1828
  • [7] Comparison and aggregation of max-plus linear systems
    Ledoux, J
    Truffet, L
    LINEAR ALGEBRA AND ITS APPLICATIONS, 2004, 378 : 245 - 272
  • [8] Interval max-plus systems of linear equations
    Myskova, Helena
    LINEAR ALGEBRA AND ITS APPLICATIONS, 2012, 437 (08) : 1992 - 2000
  • [9] Tropical Abstractions of Max-Plus Linear Systems
    Mufid, Muhammad Syifa'ul
    Adzkiya, Dieky
    Abate, Alessandro
    FORMAL MODELING AND ANALYSIS OF TIMED SYSTEMS, FORMATS 2018, 2018, 11022 : 271 - 287
  • [10] Reachability for Interval Max-Plus Linear Systems
    Wang, Cailu
    Tao, Yuegang
    Yang, Peng
    PROCEEDINGS OF THE 36TH CHINESE CONTROL CONFERENCE (CCC 2017), 2017, : 2392 - 2396