Permissive Supervisor Synthesis for Markov Decision Processes Through Learning

被引：10

作者：

Wu, Bo ^{[1
]}

Zhang, Xiaobin ^{[1
]}

Lin, Hai ^{[1
]}

机构：

[1] Univ Notre Dame, Dept Elect Engn, Notre Dame, IN 46556 USA

来源：

IEEE TRANSACTIONS ON AUTOMATIC CONTROL | 2019年 / 64卷 / 08期

关键词：

Automata learning; formal methods; model checking; supervisor synthesis; GUARANTEE; ASSUME;

D O I：

10.1109/TAC.2018.2879505

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This paper considers the permissive supervisor synthesis for probabilistic systems modeled as Markov Decision Processes (MDP). Such systems are prevalent in power grids, transportation networks, communication networks, and robotics. We propose a novel supervisor synthesis framework using automata learning and compositional model checking to generate the permissive local supervisors in a distributed manner. With the recent advances in assume-guarantee reasoning verification for MDPs, constructing the composed system can be avoided to alleviate the state space explosion. Our framework learns the supervisors iteratively using counterexamples from the verification and is guaranteed to terminate in finite steps and to be correct.

引用

页码：3332 / 3338

页数：7

共 50 条

[1] Counterexample-guided permissive supervisor synthesis for probabilistic systems through learning
Wu, Bo
Lin, Hai
[J]. 2015 AMERICAN CONTROL CONFERENCE (ACC), 2015, : 2894 - 2899
[2] Learning Parameterized Policies for Markov Decision Processes through Demonstrations
Hanawal, Manjesh K.
Liu, Hao
Zhu, Henghui
Paschalidis, Ioannis Ch.
[J]. 2016 IEEE 55TH CONFERENCE ON DECISION AND CONTROL (CDC), 2016, : 7087 - 7092
[3] Learning to Collaborate in Markov Decision Processes
Radanovic, Goran
Devidze, Rati
Parkes, David C.
Singla, Adish
[J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 97, 2019, 97
[4] Learning in Constrained Markov Decision Processes
Singh, Rahul
Gupta, Abhishek
Shroff, Ness B.
[J]. IEEE TRANSACTIONS ON CONTROL OF NETWORK SYSTEMS, 2023, 10 (01): : 441 - 453
[5] Counterexample-guided Distributed Permissive Supervisor Synthesis for Probabilistic Multi-agent Systems through Learning
Wu, Bo
Lin, Hai
[J]. 2016 AMERICAN CONTROL CONFERENCE (ACC), 2016, : 5519 - 5524
[6] Design Synthesis Through a Markov Decision Process and Reinforcement Learning Framework
Ororbia, Maximilian E.
Warn, Gordon P.
[J]. JOURNAL OF COMPUTING AND INFORMATION SCIENCE IN ENGINEERING, 2022, 22 (02)
[7] Blackwell Online Learning for Markov Decision Processes
Li, Tao
Peng, Guanze
Zhu, Quanyan
[J]. 2021 55TH ANNUAL CONFERENCE ON INFORMATION SCIENCES AND SYSTEMS (CISS), 2021,
[8] Online Learning in Kernelized Markov Decision Processes
Chowdhury, Sayak Ray
Gopalan, Aditya
[J]. 22ND INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 89, 2019, 89
[9] Learning Factored Markov Decision Processes with Unawareness
Innes, Craig
Lascarides, Alex
[J]. 35TH UNCERTAINTY IN ARTIFICIAL INTELLIGENCE CONFERENCE (UAI 2019), 2020, 115 : 123 - 133
[10] Bayesian Learning of Noisy Markov Decision Processes
Singh, Sumeetpal S.
Chopin, Nicolas
Whiteley, Nick
[J]. ACM TRANSACTIONS ON MODELING AND COMPUTER SIMULATION, 2013, 23 (01):

← 1 2 3 4 5 →