Constructing MDP Abstractions Using Data With Formal Guarantees

被引:8
|
作者
Lavaei, Abolfazl [1 ]
Soudjani, Sadegh [2 ]
Frazzoli, Emilio [1 ]
Zamani, Majid [3 ,4 ]
机构
[1] Swiss Fed Inst Technol, Inst Dynam Syst & Control, CH-8092 Zurich, Switzerland
[2] Newcastle Univ, Sch Comp, Newcastle Upon Tyne NE4 5TG, Tyne & Wear, England
[3] Univ Colorado, Comp Sci Dept, Boulder, CO 80309 USA
[4] Ludwig Maximilians Univ Munchen, Dept Comp Sci, D-80539 Munich, Germany
来源
基金
瑞士国家科学基金会; 英国工程与自然科学研究理事会;
关键词
Stochastic processes; Trajectory; Control systems; Stochastic systems; Probabilistic logic; Markov processes; Picture archiving and communication systems; Data-driven synthesis; MDP abstractions; stochastic bisimulation functions; formal guarantees; STOCHASTIC-SYSTEMS; VERIFICATION; SAFETY;
D O I
10.1109/LCSYS.2022.3188535
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This letter is concerned with a data-driven technique for constructing finite Markov decision processes (MDPs) as finite abstractions of discrete-time stochastic control systems with unknown dynamics while providing formal closeness guarantees. The proposed scheme is based on notions of stochastic bisimulation functions (SBF) to capture the probabilistic distance between state trajectories of an unknown stochastic system and those of finite MDP. In our proposed setting, we first reformulate corresponding conditions of SBF as a robust convex program (RCP). We then propose a scenario convex program (SCP) associated to the original RCP by collecting a finite number of data from trajectories of the system. We ultimately construct an SBF between the data-driven finite MDP and the unknown stochastic system with a given confidence level by establishing a probabilistic relation between optimal values of the SCP and the RCP. We also propose two different approaches for the construction of finite MDPs from data. We illustrate the efficacy of our results over a nonlinear jet engine compressor with unknown dynamics. We construct a data-driven finite MDP as a suitable substitute of the original system to synthesize controllers maintaining the system in a safe set with some probability of satisfaction and a desirable confidence level.
引用
收藏
页码:460 / 465
页数:6
相关论文
共 50 条
  • [31] Efficient Processing of Streaming Data using Multiple Abstractions
    Qadeer, Abdul
    Heidemann, John
    2021 IEEE 14TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING (CLOUD 2021), 2021, : 157 - 167
  • [32] Data-Driven Abstractions via Binary-Tree Gaussian Processes for Formal Verification
    Schon, Oliver
    Naseer, Shammakh
    Wooding, Ben
    Soudjani, Sadegh
    IFAC PAPERSONLINE, 2024, 58 (11): : 115 - 122
  • [33] Constructing Control System Abstractions from Modular Components
    Kim, Eric S.
    Arcak, Murat
    Zamani, Majid
    HSCC 2018: PROCEEDINGS OF THE 21ST INTERNATIONAL CONFERENCE ON HYBRID SYSTEMS: COMPUTATION AND CONTROL (PART OF CPS WEEK), 2018, : 137 - 146
  • [34] Abstractions of data types
    Ferucio Laurenţiu Ţiplea
    Constantin Enea
    Acta Informatica, 2006, 42 : 639 - 671
  • [35] Abstractions of data types
    Tiplea, FL
    Enea, C
    ACTA INFORMATICA, 2006, 42 (8-9) : 639 - 671
  • [36] Formal Method to Derive Interoperability Requirements and Guarantees
    El-Gendy, Hazem
    Amer, Magdi
    Talkhan, Ihab
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2013, 4 (01) : 9 - 14
  • [37] LiDAR Point Cloud Registration with Formal Guarantees
    Marchi, Matteo
    Bunton, Jonathan
    Gharesifard, Bahman
    Tabuada, Paulo
    2022 IEEE 61ST CONFERENCE ON DECISION AND CONTROL (CDC), 2022, : 3462 - 3467
  • [38] Constructing (Bi)Similar Finite State Abstractions using Asynchronous l-Complete Approximations
    Schmuck, Anne-Kathrin
    Raisch, Joerg
    2014 IEEE 53RD ANNUAL CONFERENCE ON DECISION AND CONTROL (CDC), 2014, : 6744 - 6751
  • [39] Poster Abstract: Data-Driven Estimation of Collision Risks for Autonomous Vehicles with Formal Guarantees
    Lavaei, Abolfazl
    Di Lillo, Luigi
    Atzei, Margherita
    Censi, Andrea
    Frazzoli, Emilio
    HSCC 2022: PROCEEDINGS OF THE 25TH ACM INTERNATIONAL CONFERENCE ON HYBRID SYSTEMS: COMPUTATION AND CONTROL (PART OF CPS-IOT WEEK 2022), 2022,
  • [40] RELIABLE SOFTWARE THROUGH REQUIREMENTS DEFINITION USING DATA ABSTRACTIONS
    BEZANSON, WR
    MICROELECTRONICS AND RELIABILITY, 1978, 17 (01): : 85 - 91