Specification Revision for Markov Decision Processes with Optimal Trade-off

被引:0
|
作者
Lahijanian, M. [1 ]
Kwiatkowska, M. [1 ]
机构
[1] Univ Oxford, Dept Comp Sci, Oxford, England
基金
英国工程与自然科学研究理事会;
关键词
LOGIC; SYSTEMS;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Optimal control policy synthesis for probabilistic systems from high-level specifications is increasingly often studied. One major question that is commonly faced, however, is what to do when the optimal probability of achieving the specification is not satisfactory? We address this question by viewing the specification as a soft constraint and present a synthesis framework for MDPs that encodes and automates specification revision in a trade-off for higher probability. The method uses co-safe LTL as the specification language and quantifies the revisions to the specification according to user defined proposition costs. The framework computes a control policy that optimizes the trade-off between the probability of satisfaction and the cost of specification revision. The key idea of the method is a rule for the composition of the MDP, the automaton representing the specification, and the proposition costs such that all possible specification revisions along with their costs and probabilities of satisfaction are captured in one structure. The problem is then reduced to multi-objective optimization on an MDP. The power of the method is illustrated though simulations of a complex robotic scenario.
引用
下载
收藏
页码:7411 / 7418
页数:8
相关论文
共 50 条
  • [21] Multiresolution implementation of optimal trade-off correlation filters
    Bigue, L
    Ambs, P
    OPTICS IN COMPUTING 98, 1998, 3490 : 34 - 37
  • [22] Utility/privacy trade-off as regularized optimal transport
    Boursier, Etienne
    Perchet, Vianney
    MATHEMATICAL PROGRAMMING, 2024, 203 (1-2) : 703 - 726
  • [23] Utility/privacy trade-off as regularized optimal transport
    Etienne Boursier
    Vianney Perchet
    Mathematical Programming, 2024, 203 : 703 - 726
  • [24] Towards an Optimal Trade-off of Viterbi Decoder Design
    He, Jinjin
    Wang, Zhongfeng
    Cui, Zhigiang
    Li, Li
    ISCAS: 2009 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOLS 1-5, 2009, : 3030 - +
  • [25] Optimal Taxation with a Trade-Off Between Income and Children
    Yoshida, Masatoshi
    JAPANESE ECONOMIC REVIEW, 1998, 49 (04) : 426 - 439
  • [26] Optimal Taxation with a Trade-Off Between Income and Children
    Masatoshi Yoshida
    The Japanese Economic Review, 1998, 49 : 426 - 439
  • [27] Nonlinear Optimal Trade-Off Control for LQG Problem
    Qian, Fucai
    Xie, Guo
    Liu, Ding
    Xie, Wenfang
    2010 AMERICAN CONTROL CONFERENCE, 2010, : 1931 - 1936
  • [28] Optimal Risk Trade-Off in Relative Performance Evaluation
    Wu, Martin G. H.
    JOURNAL OF MANAGEMENT ACCOUNTING RESEARCH, 2019, 31 (01) : 247 - 259
  • [29] OPTIMAL MONETARY-POLICY WITH A TRADE-OFF FUNCTION
    SCARFE, BL
    OXFORD ECONOMIC PAPERS-NEW SERIES, 1979, 31 (01): : 20 - 35
  • [30] Optimal implementation delay of taxation with trade-off for spectrally negative Lévy risk processes
    Wenyuan Wang
    Xueyuan Wu
    Cheng Chi
    European Actuarial Journal, 2021, 11 : 285 - 317