Specification Revision for Markov Decision Processes with Optimal Trade-off

被引:0
|
作者
Lahijanian, M. [1 ]
Kwiatkowska, M. [1 ]
机构
[1] Univ Oxford, Dept Comp Sci, Oxford, England
基金
英国工程与自然科学研究理事会;
关键词
LOGIC; SYSTEMS;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Optimal control policy synthesis for probabilistic systems from high-level specifications is increasingly often studied. One major question that is commonly faced, however, is what to do when the optimal probability of achieving the specification is not satisfactory? We address this question by viewing the specification as a soft constraint and present a synthesis framework for MDPs that encodes and automates specification revision in a trade-off for higher probability. The method uses co-safe LTL as the specification language and quantifies the revisions to the specification according to user defined proposition costs. The framework computes a control policy that optimizes the trade-off between the probability of satisfaction and the cost of specification revision. The key idea of the method is a rule for the composition of the MDP, the automaton representing the specification, and the proposition costs such that all possible specification revisions along with their costs and probabilities of satisfaction are captured in one structure. The problem is then reduced to multi-objective optimization on an MDP. The power of the method is illustrated though simulations of a complex robotic scenario.
引用
收藏
页码:7411 / 7418
页数:8
相关论文
共 50 条
  • [31] Uncertainty trade-off and disturbance trade-off for quantum measurements
    Srinivas, M. D.
    Mandayam, Prabha
    CURRENT SCIENCE, 2015, 109 (11): : 2044 - 2051
  • [32] Optimal Efficiency-Envy Trade-Off via Optimal Transport
    Yin, Steven
    Kroer, Christian
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [33] Optimal Policies for Quantum Markov Decision Processes
    Ying, Ming-Sheng
    Feng, Yuan
    Ying, Sheng-Gang
    INTERNATIONAL JOURNAL OF AUTOMATION AND COMPUTING, 2021, 18 (03) : 410 - 421
  • [34] Optimal adaptive policies for Markov decision processes
    Burnetas, AN
    Katehakis, MN
    MATHEMATICS OF OPERATIONS RESEARCH, 1997, 22 (01) : 222 - 255
  • [35] Optimal Policies for Quantum Markov Decision Processes
    Ming-Sheng Ying
    Yuan Feng
    Sheng-Gang Ying
    Machine Intelligence Research, 2021, 18 (03) : 410 - 421
  • [36] IDENTIFICATION OF OPTIMAL POLICIES IN MARKOV DECISION PROCESSES
    Sladky, Karel
    KYBERNETIKA, 2010, 46 (03) : 558 - 570
  • [37] Free trade trade-off
    Freedman, M
    FORBES, 2002, 169 (06): : 44 - 44
  • [38] Optimal Policies for Quantum Markov Decision Processes
    Ming-Sheng Ying
    Yuan Feng
    Sheng-Gang Ying
    International Journal of Automation and Computing, 2021, 18 : 410 - 421
  • [39] Bayesian Optimal Decision Making in Risk-Return Trade-off under Spatiotemporal Motor Variability
    Yao, Qirui
    Sakaguchi, Yutaka
    I-PERCEPTION, 2019, 10 : 102 - 103
  • [40] Space-Time Trade-Off in Decision Analysis Software
    Danielson, Mats
    Ekenberg, Love
    NEW TRENDS IN INTELLIGENT SOFTWARE METHODOLOGIES, TOOLS AND TECHNIQUES (SOMET_18), 2018, 303 : 251 - 258