Reinforcement Learning for Stochastic Max-Plus Linear Systems

被引:0
|
作者
Subramanian, Vignesh [1 ]
Farhadi, Farzaneh [2 ]
Soudjani, Sadegh [3 ]
机构
[1] Georgia Inst Technol, Atlanta, GA 30332 USA
[2] Newcastle Univ, Sch Engn, Newcastle Upon Tyne, Tyne & Wear, England
[3] Newcastle Univ, Sch Comp, Newcastle Upon Tyne, Tyne & Wear, England
基金
英国工程与自然科学研究理事会;
关键词
DISCRETE-EVENT SYSTEMS; REACHABILITY ANALYSIS;
D O I
10.1109/CDC49753.2023.10384207
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper studies the design of control policies for Discrete Event Systems under uncertainties. We capture the timing of the events using the framework of max-plus-linear systems in which the time between consecutive events depends on random delays with unknown distributions. Our policy synthesis approach is with respect to a cost function, and it can be extended directly to satisfy safety specifications on the timing of events. The main novelty of our approach is to translate the system evolution to a Markov decision process (MDP) that has an uncountable state space and develop a stochastic optimisation problem under the evolution of the MDP. To tackle the unknown distribution of uncertainties (thus unknown transition probabilities in the MDP), we employ model-free reinforcement learning to perform optimisations and find control policies for the system. Our implementation results on the 9-dimensional model of a railway network show superiority of our learning approach in comparison with the stochastic model predictive control approach.
引用
收藏
页码:5631 / 5638
页数:8
相关论文
共 50 条
  • [41] State geometric adjustability for interval max-plus linear systems
    Yin, Yingxuan
    Chen, Haiyong
    Tao, Yuegang
    IET Control Theory and Applications, 2024, 18 (17): : 2468 - 2481
  • [42] Observer-Based Controllers for Max-Plus Linear Systems
    Hardouin, Laurent
    Shang, Ying
    Maia, Carlos Andrey
    Cottenceau, Bertrand
    IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2017, 62 (05) : 2153 - 2165
  • [43] Max-plus linear inverse problems: 2-norm regression and system identification of max-plus linear dynamical systems with Gaussian noise
    Hook, James
    LINEAR ALGEBRA AND ITS APPLICATIONS, 2019, 579 : 1 - 31
  • [44] A fast approximation algorithm for the Lyapunov exponent of stochastic max-plus systems
    Goverde, Rob M. P.
    Heidergott, Bernd
    Merlet, Glenn
    WODES' 08: PROCEEDINGS OF THE 9TH INTERNATIONAL WORKSHOP ON DISCRETE EVENT SYSTEMS, 2008, : 49 - +
  • [45] Towards Geometric Control of Max-Plus Linear Systems with Applications to Manufacturing Systems
    Hardouin, Laurent
    Lhommeau, Mehdi
    Shang, Ying
    2011 50TH IEEE CONFERENCE ON DECISION AND CONTROL AND EUROPEAN CONTROL CONFERENCE (CDC-ECC), 2011, : 1149 - 1154
  • [46] Optimistic optimization for model predictive control of max-plus linear systems
    Xu, Jia
    van den Boom, Ton
    De Schutter, Bart
    AUTOMATICA, 2016, 74 : 16 - 22
  • [47] A Compositional Model for Multi-Rate Max-Plus Linear Systems
    Elahi, H.
    Geilen, M.
    Basten, T.
    IFAC PAPERSONLINE, 2020, 53 (04): : 54 - 61
  • [48] A tour of systems with the max-plus flavor
    Cohen, Guy
    POSITIVE SYSTEMS, PROCEEDINGS, 2006, 341 : 19 - 24
  • [49] MAX-PLUS AGEBRA IN QUEUING SYSTEMS
    Nemcova, Zuzana
    HRADECKE EKONOMICKE DNY 2011, DIL I: EKONOMICKY ROZVOJ A MANAGEMENT REGIONU. ECONOMIC DEVELOPMENT AND MANAGEMENT OF REGIONS, 2011, : 215 - 219
  • [50] Bi-Objective Optimization for Interval Max-Plus Linear Systems
    Wang, Cailu
    Zhang, Jiye
    Chen, Pengcheng
    Zhao, Haichao
    MATHEMATICS, 2024, 12 (05)