Optimal control of discrete event systems under uncertain environment based on supervisory control theory and reinforcement learning

被引:0
|
作者
Liu, Yingjun [1 ]
Liu, Fuchun [1 ]
机构
[1] Guangdong Univ Technol, Sch Comp Sci & Technol, Guangzhou 510006, Peoples R China
来源
SCIENTIFIC REPORTS | 2024年 / 14卷 / 01期
关键词
Discrete event systems; Optimal control; Supervisory control theory; Reinforcement learning;
D O I
10.1038/s41598-024-76371-4
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Discrete event systems (DESs) are powerful abstract representations for large human-made physical systems in a wide variety of industries. Safety control issues on DESs have been extensively studied based on the logical specifications of the systems in various literature. However, when facing the DESs under uncertain environment which brings into the implicit specifications, the classical supervisory control approach may not be capable of achieving the performance. So in this research, we propose a new approach for optimal control of DESs under uncertain environment based on supervisory control theory (SCT) and reinforcement learning (RL). Firstly, we use SCT to gather deliberative planning algorithms with the aim to safe control. Then we convert the supervised system to Markov Decision Process simulation environments that is suitable for optimal algorithm training. Furthermore, a SCT-based RL algorithm is designed to maximize performance of the system based on the probabilistic attributes of the state transitions. Finally, a case study on the autonomous navigation task of a delivery robot is provided to corroborate the proposed method by multiple simulation experiments. The result shows the proposed approach owning 8.27%\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\%$$\end{document} performance improvement compared with the non-intelligent methods. This research will contribute to further studying the optimal control of human-made physical systems in a wide variety of industries.
引用
收藏
页数:13
相关论文
共 50 条
  • [21] OPTIMAL SUPERVISORY CONTROL OF DISCRETE-EVENT DYNAMICAL-SYSTEMS
    KUMAR, R
    GARG, VK
    SIAM JOURNAL ON CONTROL AND OPTIMIZATION, 1995, 33 (02) : 419 - 439
  • [22] Metric Based Nonblocking Supervisory Control of Discrete Event Systems
    Park, Jun-Sang
    Jo, Hyun-Wook
    Oh, Jun-Han
    Lim, Jong-Tae
    INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND SYSTEMS (ICCAS 2010), 2010, : 627 - 630
  • [23] Supervisory Control of Fuzzy Discrete Event Systems Based on Agent
    张颖
    邵世煌
    Journal of Shanghai Jiaotong University(Science), 2006, (04) : 465 - 471
  • [24] Supervisory control of discrete-event systems
    Komenda, Jan
    Masopust, Tomáš
    Lecture Notes in Control and Information Sciences, 2015, 456 : 129 - 136
  • [25] MODULAR SUPERVISORY CONTROL OF DISCRETE EVENT SYSTEMS
    RAMADGE, PJ
    WONHAM, WM
    LECTURE NOTES IN CONTROL AND INFORMATION SCIENCES, 1986, 83 : 202 - 214
  • [26] Reinforcement learning event-triggered output feedback control for uncertain nonlinear discrete systems
    Ren, Jianwei
    Li, Ping
    Song, Zhibao
    TRANSACTIONS OF THE INSTITUTE OF MEASUREMENT AND CONTROL, 2024, 46 (08) : 1467 - 1488
  • [27] Supervisory control of Boolean Discrete event systems
    Lu Jianning
    Zhao Guangzhou
    Proceedings of the 24th Chinese Control Conference, Vols 1 and 2, 2005, : 950 - 953
  • [28] Supervisory control of discrete event systems with distinguishers
    Cury, Jose E. R.
    de Queiroz, Max Hering
    Bouzon, Gustavo
    Teixeira, Marcelo
    AUTOMATICA, 2015, 56 : 93 - 104
  • [29] Supervisory control of fuzzy discrete event systems
    Cao, YZ
    Ying, MS
    IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2005, 35 (02): : 366 - 371
  • [30] Supervisory control of interacting discrete event systems
    Abdelwahed, S
    Wonham, WM
    PROCEEDINGS OF THE 41ST IEEE CONFERENCE ON DECISION AND CONTROL, VOLS 1-4, 2002, : 1175 - 1180