A model-based deep reinforcement learning approach to the nonblocking coordination of modular supervisors of discrete event systems

被引:4
|
作者
Yang, Junjun [1 ]
Tan, Kaige [2 ]
Feng, Lei [2 ]
Li, Zhiwu [1 ,3 ]
机构
[1] Xidian Univ, Sch Electromech Engn, Xian 710071, Peoples R China
[2] KTH Royal Inst Technol, Dept Machine Design, S-10044 Stockholm, Sweden
[3] Macau Univ Sci & Technol, Inst Syst Engn, Taipa, Macau, Peoples R China
基金
国家重点研发计划; 中国国家自然科学基金;
关键词
Deep reinforcement learning; Discrete event system; Local modular control; Supervisory control theory; COMPLEXITY; DESIGN;
D O I
10.1016/j.ins.2023.02.033
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Modular supervisory control may lead to conflicts among the modular supervisors for large-scale discrete event systems. The existing methods for ensuring nonblocking control of modular supervisors either exploit favorable structures in the system model to guarantee the nonblocking property of modular supervisors or employ hierarchical model abstraction methods for reducing the computational complexity of designing a nonblocking coordinator. The nonblocking modular control problem is, in general, NP-hard. This study integrates supervisory control theory and a model-based deep reinforcement learning method to synthesize a nonblocking coordinator for the modular supervisors. The deep reinforcement learning method significantly reduces the computational complexity by avoiding the computation of synchronization of multiple modular supervisors and the plant models. The supervisory control function is approximated by the deep neural network instead of a large-sized finite automaton. Furthermore, the proposed model-based deep reinforcement learning method is more efficient than the standard deep Q network algorithm.
引用
收藏
页码:305 / 321
页数:17
相关论文
共 50 条
  • [1] The bottleneck simulator: A model-based deep reinforcement learning approach
    Serban, Iulian Vlad
    Sankar, Chinnadhurai
    Pieper, Michael
    Pineau, Joelle
    Bengio, Yoshua
    Journal of Artificial Intelligence Research, 2020, 69 : 571 - 612
  • [2] The Bottleneck Simulator: A Model-Based Deep Reinforcement Learning Approach
    Serban, Iulian Vlad
    Sankar, Chinnadhurai
    Pieper, Michael
    Pineau, Joelle
    Bengio, Yoshua
    JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2020, 69 : 571 - 612
  • [3] A Learning-Based Synthesis Approach to the Supremal Nonblocking Supervisor of Discrete-Event Systems
    Zhang, Huimin
    Feng, Lei
    Li, Zhiwu
    IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2018, 63 (10) : 3345 - 3360
  • [4] Nonblocking Supervisors for Similarity Control of Nondeterministic Discrete Event Systems under Event and State Observations
    Yamada, Hiroki
    Takai, Shigemasa
    2017 56TH ANNUAL CONFERENCE OF THE SOCIETY OF INSTRUMENT AND CONTROL ENGINEERS OF JAPAN (SICE), 2017, : 160 - 163
  • [5] Bounded Synthesis and Reinforcement Learning of Supervisors for Stochastic Discrete Event Systems With LTL Specifications
    Oura, Ryohei
    Ushio, Toshimitsu
    Sakakibara, Ami
    IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2024, 69 (10) : 6668 - 6683
  • [6] Language model approach to nonblocking supervisor synthesis for nondeterministic discrete event systems
    Park, SJ
    Lim, JT
    2001 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, VOLS I-IV, PROCEEDINGS, 2001, : 650 - 655
  • [7] Maximally Permissive Supervisors for Nonblocking Similarity Control of Nondeterministic Discrete-Event Systems
    Li, Jinglun
    Takai, Shigemasa
    IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2023, 68 (06) : 3529 - 3544
  • [8] Model-based diagnosis of discrete event systems
    Flochova, Jana
    Jiroveanu, George
    Boel, Rene K.
    ITI 2006: PROCEEDINGS OF THE 28TH INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY INTERFACES, 2006, : 519 - +
  • [9] Learning to Paint With Model-based Deep Reinforcement Learning
    Huang, Zhewei
    Heng, Wen
    Zhou, Shuchang
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 8708 - 8717
  • [10] Calibrated Model-Based Deep Reinforcement Learning
    Malik, Ali
    Kuleshov, Volodymyr
    Song, Jiaming
    Nemer, Danny
    Seymour, Harlan
    Ermon, Stefano
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 97, 2019, 97