Permissive Supervisor Synthesis for Markov Decision Processes Through Learning

被引:10
|
作者
Wu, Bo [1 ]
Zhang, Xiaobin [1 ]
Lin, Hai [1 ]
机构
[1] Univ Notre Dame, Dept Elect Engn, Notre Dame, IN 46556 USA
关键词
Automata learning; formal methods; model checking; supervisor synthesis; GUARANTEE; ASSUME;
D O I
10.1109/TAC.2018.2879505
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper considers the permissive supervisor synthesis for probabilistic systems modeled as Markov Decision Processes (MDP). Such systems are prevalent in power grids, transportation networks, communication networks, and robotics. We propose a novel supervisor synthesis framework using automata learning and compositional model checking to generate the permissive local supervisors in a distributed manner. With the recent advances in assume-guarantee reasoning verification for MDPs, constructing the composed system can be avoided to alleviate the state space explosion. Our framework learns the supervisors iteratively using counterexamples from the verification and is guaranteed to terminate in finite steps and to be correct.
引用
收藏
页码:3332 / 3338
页数:7
相关论文
共 50 条
  • [1] Counterexample-guided permissive supervisor synthesis for probabilistic systems through learning
    Wu, Bo
    Lin, Hai
    [J]. 2015 AMERICAN CONTROL CONFERENCE (ACC), 2015, : 2894 - 2899
  • [2] Learning Parameterized Policies for Markov Decision Processes through Demonstrations
    Hanawal, Manjesh K.
    Liu, Hao
    Zhu, Henghui
    Paschalidis, Ioannis Ch.
    [J]. 2016 IEEE 55TH CONFERENCE ON DECISION AND CONTROL (CDC), 2016, : 7087 - 7092
  • [3] Learning to Collaborate in Markov Decision Processes
    Radanovic, Goran
    Devidze, Rati
    Parkes, David C.
    Singla, Adish
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 97, 2019, 97
  • [4] Learning in Constrained Markov Decision Processes
    Singh, Rahul
    Gupta, Abhishek
    Shroff, Ness B.
    [J]. IEEE TRANSACTIONS ON CONTROL OF NETWORK SYSTEMS, 2023, 10 (01): : 441 - 453
  • [5] Counterexample-guided Distributed Permissive Supervisor Synthesis for Probabilistic Multi-agent Systems through Learning
    Wu, Bo
    Lin, Hai
    [J]. 2016 AMERICAN CONTROL CONFERENCE (ACC), 2016, : 5519 - 5524
  • [6] Design Synthesis Through a Markov Decision Process and Reinforcement Learning Framework
    Ororbia, Maximilian E.
    Warn, Gordon P.
    [J]. JOURNAL OF COMPUTING AND INFORMATION SCIENCE IN ENGINEERING, 2022, 22 (02)
  • [7] Blackwell Online Learning for Markov Decision Processes
    Li, Tao
    Peng, Guanze
    Zhu, Quanyan
    [J]. 2021 55TH ANNUAL CONFERENCE ON INFORMATION SCIENCES AND SYSTEMS (CISS), 2021,
  • [8] Online Learning in Kernelized Markov Decision Processes
    Chowdhury, Sayak Ray
    Gopalan, Aditya
    [J]. 22ND INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 89, 2019, 89
  • [9] Learning Factored Markov Decision Processes with Unawareness
    Innes, Craig
    Lascarides, Alex
    [J]. 35TH UNCERTAINTY IN ARTIFICIAL INTELLIGENCE CONFERENCE (UAI 2019), 2020, 115 : 123 - 133
  • [10] Bayesian Learning of Noisy Markov Decision Processes
    Singh, Sumeetpal S.
    Chopin, Nicolas
    Whiteley, Nick
    [J]. ACM TRANSACTIONS ON MODELING AND COMPUTER SIMULATION, 2013, 23 (01):