A model-based deep reinforcement learning approach to the nonblocking coordination of modular supervisors of discrete event systems

被引:4
|
作者
Yang, Junjun [1 ]
Tan, Kaige [2 ]
Feng, Lei [2 ]
Li, Zhiwu [1 ,3 ]
机构
[1] Xidian Univ, Sch Electromech Engn, Xian 710071, Peoples R China
[2] KTH Royal Inst Technol, Dept Machine Design, S-10044 Stockholm, Sweden
[3] Macau Univ Sci & Technol, Inst Syst Engn, Taipa, Macau, Peoples R China
基金
国家重点研发计划; 中国国家自然科学基金;
关键词
Deep reinforcement learning; Discrete event system; Local modular control; Supervisory control theory; COMPLEXITY; DESIGN;
D O I
10.1016/j.ins.2023.02.033
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Modular supervisory control may lead to conflicts among the modular supervisors for large-scale discrete event systems. The existing methods for ensuring nonblocking control of modular supervisors either exploit favorable structures in the system model to guarantee the nonblocking property of modular supervisors or employ hierarchical model abstraction methods for reducing the computational complexity of designing a nonblocking coordinator. The nonblocking modular control problem is, in general, NP-hard. This study integrates supervisory control theory and a model-based deep reinforcement learning method to synthesize a nonblocking coordinator for the modular supervisors. The deep reinforcement learning method significantly reduces the computational complexity by avoiding the computation of synchronization of multiple modular supervisors and the plant models. The supervisory control function is approximated by the deep neural network instead of a large-sized finite automaton. Furthermore, the proposed model-based deep reinforcement learning method is more efficient than the standard deep Q network algorithm.
引用
收藏
页码:305 / 321
页数:17
相关论文
共 50 条
  • [21] An Efficient Approach to Model-Based Hierarchical Reinforcement Learning
    Li, Zhuoru
    Narayan, Akshay
    Leong, Tze-Yun
    THIRTY-FIRST AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 3583 - 3589
  • [22] A Model-based Factored Bayesian Reinforcement Learning Approach
    Wu, Bo
    Feng, Yanpeng
    Zheng, Hongyan
    APPLIED SCIENCE, MATERIALS SCIENCE AND INFORMATION TECHNOLOGIES IN INDUSTRY, 2014, 513-517 : 1092 - 1095
  • [23] Model-Based Diagnosis of Discrete Event Systems with an Incomplete System Model
    Zhao, Xiangfu
    Ouyang, Dantong
    ECAI 2008, PROCEEDINGS, 2008, 178 : 189 - +
  • [24] Model-based inverse reinforcement learning for deterministic systems
    Self, Ryan
    Abudia, Moad
    Mahmud, S. M. Nahid
    Kamalapurkar, Rushikesh
    AUTOMATICA, 2022, 140
  • [25] Decentralized supervisory control of discrete event systems based on reinforcement learning
    Yamasaki, T
    Ushio, T
    IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2005, E88A (11) : 3045 - 3050
  • [26] Optimal LLP supervisor for discrete event systems based on reinforcement learning
    Umemoto, Hijiri
    Yamasaki, Tatsushi
    2015 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC 2015): BIG DATA ANALYTICS FOR HUMAN-CENTRIC SYSTEMS, 2015, : 545 - 550
  • [27] Incremental model evolution and reusability of supervisors for discrete event systems
    Chen, YL
    Lafortune, S
    Lin, F
    AUTOMATICA, 2000, 36 (02) : 243 - 259
  • [28] Knowledge Transfer using Model-Based Deep Reinforcement Learning
    Boloka, Tlou
    Makondo, Ndivhuwo
    Rosman, Benjamin
    2021 SOUTHERN AFRICAN UNIVERSITIES POWER ENGINEERING CONFERENCE/ROBOTICS AND MECHATRONICS/PATTERN RECOGNITION ASSOCIATION OF SOUTH AFRICA (SAUPEC/ROBMECH/PRASA), 2021,
  • [29] Model-based deep reinforcement learning for wind energy bidding
    Sanayha, Manassakan
    Vateekul, Peerapon
    INTERNATIONAL JOURNAL OF ELECTRICAL POWER & ENERGY SYSTEMS, 2022, 136
  • [30] Deep Reinforcement Learning with Model-based Acceleration for Hyperparameter Optimization
    Chen, SenPeng
    Wu, Jia
    Chen, XiuYun
    2019 IEEE 31ST INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2019), 2019, : 170 - 177