Constructing MDP Abstractions Using Data With Formal Guarantees

被引:8
|
作者
Lavaei, Abolfazl [1 ]
Soudjani, Sadegh [2 ]
Frazzoli, Emilio [1 ]
Zamani, Majid [3 ,4 ]
机构
[1] Swiss Fed Inst Technol, Inst Dynam Syst & Control, CH-8092 Zurich, Switzerland
[2] Newcastle Univ, Sch Comp, Newcastle Upon Tyne NE4 5TG, Tyne & Wear, England
[3] Univ Colorado, Comp Sci Dept, Boulder, CO 80309 USA
[4] Ludwig Maximilians Univ Munchen, Dept Comp Sci, D-80539 Munich, Germany
来源
基金
瑞士国家科学基金会; 英国工程与自然科学研究理事会;
关键词
Stochastic processes; Trajectory; Control systems; Stochastic systems; Probabilistic logic; Markov processes; Picture archiving and communication systems; Data-driven synthesis; MDP abstractions; stochastic bisimulation functions; formal guarantees; STOCHASTIC-SYSTEMS; VERIFICATION; SAFETY;
D O I
10.1109/LCSYS.2022.3188535
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This letter is concerned with a data-driven technique for constructing finite Markov decision processes (MDPs) as finite abstractions of discrete-time stochastic control systems with unknown dynamics while providing formal closeness guarantees. The proposed scheme is based on notions of stochastic bisimulation functions (SBF) to capture the probabilistic distance between state trajectories of an unknown stochastic system and those of finite MDP. In our proposed setting, we first reformulate corresponding conditions of SBF as a robust convex program (RCP). We then propose a scenario convex program (SCP) associated to the original RCP by collecting a finite number of data from trajectories of the system. We ultimately construct an SBF between the data-driven finite MDP and the unknown stochastic system with a given confidence level by establishing a probabilistic relation between optimal values of the SCP and the RCP. We also propose two different approaches for the construction of finite MDPs from data. We illustrate the efficacy of our results over a nonlinear jet engine compressor with unknown dynamics. We construct a data-driven finite MDP as a suitable substitute of the original system to synthesize controllers maintaining the system in a safe set with some probability of satisfaction and a desirable confidence level.
引用
收藏
页码:460 / 465
页数:6
相关论文
共 50 条
  • [1] Data-Driven Controller Synthesis via Finite Abstractions With Formal Guarantees
    Ajeleye, Daniel
    Lavaei, Abolfazl
    Zamani, Majid
    IEEE CONTROL SYSTEMS LETTERS, 2023, 7 : 3453 - 3458
  • [2] Constructing appropriate data abstractions for mining classification knowledge
    Okubo, Y
    Kudoh, Y
    Haraguchi, M
    WEB KNOWLEDGE MANAGEMENT AND DECISION SUPPORTS, 2003, 2543 : 276 - 289
  • [3] MDP Abstractions from Data: Large-Scale Stochastic Networks
    Lavaei, Abolfazl
    2023 62ND IEEE CONFERENCE ON DECISION AND CONTROL, CDC, 2023, : 6058 - 6063
  • [4] A PRACTICAL ASSESSMENT OF FORMAL SPECIFICATION APPROACHES FOR DATA ABSTRACTIONS
    VENTOURIS, KP
    PINTELAS, PE
    JOURNAL OF SYSTEMS AND SOFTWARE, 1992, 17 (02) : 169 - 188
  • [5] Data-Driven Abstractions With Probabilistic Guarantees for Linear PETC Systems
    Peruffo, Andrea
    Mazo, Manuel, Jr.
    IEEE CONTROL SYSTEMS LETTERS, 2022, 7 : 115 - 120
  • [6] Guarantees for Sound Abstractions for Generalized Planning
    Bonet, Blai
    Fuentetaja, Raquel
    E-Martin, Yolanda
    Borrajo, Daniel
    PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2019, : 1566 - 1573
  • [7] Constructing Symbolic Abstractions From Data Without Model Assumptions
    Devonport, Alex
    Saoud, Adnane
    Arcak, Murat
    2ND INTERNATIONAL WORKSHOP ON COMPUTATION-AWARE ALGORITHMIC DESIGN FOR CYBER-PHYSICAL SYSTEMS (CAADCPS 2022), 2022, : 9 - 10
  • [8] Safe abstractions of data encodings in formal security protocol models
    Pironti, Alfredo
    Sisto, Riccardo
    FORMAL ASPECTS OF COMPUTING, 2014, 26 (01) : 125 - 167
  • [9] Symbolic Abstractions with Guarantees: A Data-Driven Divide-and-Conquer Strategy
    Lavaei, Abolfazl
    2023 62ND IEEE CONFERENCE ON DECISION AND CONTROL, CDC, 2023, : 7994 - 7999
  • [10] Formal checking of properties in complex systems using abstractions
    Moundanos, D
    Abraham, JA
    NINTH GREAT LAKES SYMPOSIUM ON VLSI, PROCEEDINGS, 1999, : 280 - 283