Certified policy synthesis for general Markov decision processes: An application in building automation systems

被引:11
|
作者
Haesaert, Sofie [1 ]
Cauchi, Nathalie [2 ]
Abate, Alessandro [2 ]
机构
[1] Tech Univ Eindhoven, Dept Elect Engn, Eindhoven, Netherlands
[2] Univ Oxford, Dept Comp Sci, Wolfson Bldg,Parks Rd, Oxford, England
关键词
Verification; Synthesis; General Markov decision processes; Safety; Building automation systems; Temperature control; MODEL-PREDICTIVE CONTROL; PROBABILITY-MEASURES; ENERGY MANAGEMENT; REDUCTION; EXISTENCE;
D O I
10.1016/j.peva.2017.09.005
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we present an industrial application of new approximate similarity relations for Markov models, and show that they are key for the synthesis of control strategies. Typically, modern engineering systems are modelled using complex and high-order models which make the correct-by-design controller construction computationally hard. Using the new approximate similarity relations, this complexity is reduced and we provide certificates on the performance of the synthesised policies. The application deals with stochastic models for the thermal dynamics in a "smart building" setup: such building automation system set-up can be described by discrete-time Markov decision processes evolving over an uncountable state space and endowed with an output quantifying the room temperature. The new similarity relations draw a quantitative connection between different levels of model abstraction, and allow to quantitatively refine over complex models control strategies synthesised on simpler ones. The new relations, underpinned by the use of metrics, allow in particular for a useful trade-off between deviations over probability distributions on states and distances between model outputs. We develop a software toolbox supporting the application and the computational implementation of these new relations. (C) 2017 Elsevier B.V. All rights reserved.
引用
收藏
页码:75 / 103
页数:29
相关论文
共 50 条
  • [1] Switched Linear Systems Meet Markov Decision Processes: Stability Guaranteed Policy Synthesis
    Wu, Bo
    Cubuktepe, Murat
    Topcu, Ufuk
    2019 IEEE 58TH CONFERENCE ON DECISION AND CONTROL (CDC), 2019, : 2509 - 2515
  • [2] Policy Iteration for Parameterized Markov Decision Processes and Its Application
    Xia, Li
    Jia, Qing-Shan
    2013 9TH ASIAN CONTROL CONFERENCE (ASCC), 2013,
  • [3] Privacy-Preserving Policy Synthesis in Markov Decision Processes
    Gohari, Parham
    Hale, Matthew
    Topcu, Ufuk
    2020 59TH IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2020, : 6266 - 6271
  • [4] VERIFICATION OF GENERAL MARKOV DECISION PROCESSES BY APPROXIMATE SIMILARITY RELATIONS AND POLICY REFINEMENT
    Haesaert, Sofie
    Soudjani, Sadegh Esmaeil Zadeh
    Abate, Alessandro
    SIAM JOURNAL ON CONTROL AND OPTIMIZATION, 2017, 55 (04) : 2333 - 2367
  • [5] Verification of General Markov Decision Processes by Approximate Similarity Relations and Policy Refinement
    Haesaert, Sofie
    Abate, Alessandro
    Van den Hof, Paul M. J.
    QUANTITATIVE EVALUATION OF SYSTEMS, QEST 2016, 2016, 9826 : 227 - 243
  • [6] Temporal logic control of general Markov decision processes by approximate policy refinement
    Haesaert, Sofie
    Soudjani, Sadegh
    Abate, Alessandro
    IFAC PAPERSONLINE, 2018, 51 (16): : 73 - 78
  • [7] Steady-State Policy Synthesis in Multichain Markov Decision Processes
    Atia, George
    Beckus, Andre
    Alkhouri, Ismail
    Velasquez, Alvaro
    PROCEEDINGS OF THE TWENTY-NINTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, : 4069 - 4075
  • [8] Navigating to the Best Policy in Markov Decision Processes
    Al Marjani, Aymen
    Garivier, Aurelien
    Proutiere, Alexandre
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [9] The policy iteration algorithm for average reward Markov decision processes with general state space
    Meyn, SP
    IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 1997, 42 (12) : 1663 - 1680
  • [10] Geometric Policy Iteration for Markov Decision Processes
    Wu, Yue
    De Loera, Jesus A.
    PROCEEDINGS OF THE 28TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2022, 2022, : 2070 - 2078