Constructing MDP Abstractions Using Data With Formal Guarantees

被引:8
|
作者
Lavaei, Abolfazl [1 ]
Soudjani, Sadegh [2 ]
Frazzoli, Emilio [1 ]
Zamani, Majid [3 ,4 ]
机构
[1] Swiss Fed Inst Technol, Inst Dynam Syst & Control, CH-8092 Zurich, Switzerland
[2] Newcastle Univ, Sch Comp, Newcastle Upon Tyne NE4 5TG, Tyne & Wear, England
[3] Univ Colorado, Comp Sci Dept, Boulder, CO 80309 USA
[4] Ludwig Maximilians Univ Munchen, Dept Comp Sci, D-80539 Munich, Germany
来源
基金
瑞士国家科学基金会; 英国工程与自然科学研究理事会;
关键词
Stochastic processes; Trajectory; Control systems; Stochastic systems; Probabilistic logic; Markov processes; Picture archiving and communication systems; Data-driven synthesis; MDP abstractions; stochastic bisimulation functions; formal guarantees; STOCHASTIC-SYSTEMS; VERIFICATION; SAFETY;
D O I
10.1109/LCSYS.2022.3188535
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This letter is concerned with a data-driven technique for constructing finite Markov decision processes (MDPs) as finite abstractions of discrete-time stochastic control systems with unknown dynamics while providing formal closeness guarantees. The proposed scheme is based on notions of stochastic bisimulation functions (SBF) to capture the probabilistic distance between state trajectories of an unknown stochastic system and those of finite MDP. In our proposed setting, we first reformulate corresponding conditions of SBF as a robust convex program (RCP). We then propose a scenario convex program (SCP) associated to the original RCP by collecting a finite number of data from trajectories of the system. We ultimately construct an SBF between the data-driven finite MDP and the unknown stochastic system with a given confidence level by establishing a probabilistic relation between optimal values of the SCP and the RCP. We also propose two different approaches for the construction of finite MDPs from data. We illustrate the efficacy of our results over a nonlinear jet engine compressor with unknown dynamics. We construct a data-driven finite MDP as a suitable substitute of the original system to synthesize controllers maintaining the system in a safe set with some probability of satisfaction and a desirable confidence level.
引用
收藏
页码:460 / 465
页数:6
相关论文
共 50 条
  • [41] VLSI SIMULATION AND DATA ABSTRACTIONS
    KATZENELSON, J
    WEITZ, E
    IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 1986, 5 (03) : 371 - 378
  • [42] An automatic method for the dynamic construction of abstractions of states of a formal model
    A. V. Kolchin
    Cybernetics and Systems Analysis, 2010, 46 (4) : 583 - 601
  • [43] QUANTITATIVE APPROXIMATION OF THE PROBABILITY DISTRIBUTION OF A MARKOV PROCESS BY FORMAL ABSTRACTIONS
    Soudjani, Sadegh Esmeil Zadeh
    Abate, Alessandro
    LOGICAL METHODS IN COMPUTER SCIENCE, 2015, 11 (03)
  • [44] Data-Driven Controller Synthesis via Co-Buchi Barrier Certificates With Formal Guarantees
    Ajeleye, Daniel
    Zamani, Majid
    IEEE CONTROL SYSTEMS LETTERS, 2024, 8 : 958 - 963
  • [45] Publishing Attributed Social Graphs with Formal Privacy Guarantees
    Jorgensen, Zach
    Yu, Ting
    Cormode, Graham
    SIGMOD'16: PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2016, : 107 - 122
  • [46] Formal Guarantees on the Robustness of a Classifier against Adversarial Manipulation
    Hein, Matthias
    Andriushchenko, Maksym
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 30 (NIPS 2017), 2017, 30
  • [47] AN AUTOMATIC METHOD FOR THE DYNAMIC CONSTRUCTION OF ABSTRACTIONS OF STATES OF A FORMAL MODEL
    Kolchin, A. V.
    CYBERNETICS AND SYSTEMS ANALYSIS, 2010, 46 (04) : 583 - 601
  • [48] Abstractions for dynamic data distribution
    Deltz, SJ
    Chamberlain, BL
    Snyder, L
    NINTH INTERNATIONAL WORKSHOP ON HIGH-LEVEL PARALLEL PROGRAMMING MODELS AND SUPPORTIVE ENVIRONMENTS, PROCEEDINGS, 2004, : 42 - 51
  • [49] Aggregation and Control of Populations of Thermostatically Controlled Loads by Formal Abstractions
    Soudjani, Sadegh Esmaeil Zadeh
    Abate, Alessandro
    IEEE TRANSACTIONS ON CONTROL SYSTEMS TECHNOLOGY, 2015, 23 (03) : 975 - 990
  • [50] Programming with shared data abstractions
    Dobson, S
    Goodeve, D
    SOLVING IRREGULARLY STRUCTURED PROBLEMS IN PARALLEL, 1997, 1253 : 93 - 102