Specification Revision for Markov Decision Processes with Optimal Trade-off

被引:0
|
作者
Lahijanian, M. [1 ]
Kwiatkowska, M. [1 ]
机构
[1] Univ Oxford, Dept Comp Sci, Oxford, England
基金
英国工程与自然科学研究理事会;
关键词
LOGIC; SYSTEMS;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Optimal control policy synthesis for probabilistic systems from high-level specifications is increasingly often studied. One major question that is commonly faced, however, is what to do when the optimal probability of achieving the specification is not satisfactory? We address this question by viewing the specification as a soft constraint and present a synthesis framework for MDPs that encodes and automates specification revision in a trade-off for higher probability. The method uses co-safe LTL as the specification language and quantifies the revisions to the specification according to user defined proposition costs. The framework computes a control policy that optimizes the trade-off between the probability of satisfaction and the cost of specification revision. The key idea of the method is a rule for the composition of the MDP, the automaton representing the specification, and the proposition costs such that all possible specification revisions along with their costs and probabilities of satisfaction are captured in one structure. The problem is then reduced to multi-objective optimization on an MDP. The power of the method is illustrated though simulations of a complex robotic scenario.
引用
收藏
页码:7411 / 7418
页数:8
相关论文
共 50 条
  • [1] Trade-Off in Decision-Making Processes
    Marco, Campi
    [J]. PROCEEDINGS OF THE 2012 24TH CHINESE CONTROL AND DECISION CONFERENCE (CCDC), 2012, : XX - XX
  • [2] Optimal Energy Trade-off Schedules
    Cole, Daniel
    Letsios, Dimitrios
    Nugent, Michael
    Pruhs, Kirk
    [J]. 2012 INTERNATIONAL GREEN COMPUTING CONFERENCE (IGCC), 2012,
  • [3] Optimal energy trade-off schedules
    Barcelo, Neal
    Cole, Daniel
    Letsios, Dimitrios
    Nugent, Michael
    Pruhs, Kirk
    [J]. SUSTAINABLE COMPUTING-INFORMATICS & SYSTEMS, 2013, 3 (03): : 207 - 217
  • [4] RE-STORM: Mapping the Decision-Making Problem and Non-Functional Requirements Trade-off to Partially Observable Markov Decision Processes
    Paucar, Luis H. Garcia
    Bencomo, Nelly
    [J]. 2018 IEEE/ACM 13TH INTERNATIONAL SYMPOSIUM ON SOFTWARE ENGINEERING FOR ADAPTIVE AND SELF-MANAGING SYSTEMS (SEAMS), 2018, : 19 - 25
  • [5] Time-cost trade-off via optimal control theory in Markov PERT networks
    Amir Azaron
    Hideki Katagiri
    Masatoshi Sakawa
    [J]. Annals of Operations Research, 2007, 150 : 47 - 64
  • [6] Time-cost trade-off via optimal control theory in Markov PERT networks
    Azaron, Amir
    Katagiri, Hideki
    Sakawa, Masatoshi
    [J]. ANNALS OF OPERATIONS RESEARCH, 2007, 150 (01) : 47 - 64
  • [7] Optimal trade-off for Merkle tree traversal
    Berman, Piotr
    Karpinski, Marek
    Nekrich, Yakov
    [J]. THEORETICAL COMPUTER SCIENCE, 2007, 372 (01) : 26 - 36
  • [8] Earth loading and hauling optimal trade-off
    Marinelli, Marina
    Lambropoulos, Sergios
    [J]. TRANSPORT RESEARCH ARENA 2012, 2012, 48 : 2325 - 2335
  • [9] Optimal trade-off filter for the correlation of fingerprints
    Roberge, D
    Soutar, C
    Kumar, BVKV
    [J]. OPTICAL ENGINEERING, 1999, 38 (01) : 108 - 113
  • [10] Optimal trade-off between exploration and exploitation
    Simpkins, Alex
    de Callafon, Raymond
    Todorov, Emanuel
    [J]. 2008 AMERICAN CONTROL CONFERENCE, VOLS 1-12, 2008, : 33 - +