A Fully Polynomial Time Approximation Scheme for Constrained MDPs under Local Transitions

被引:0
|
作者
Khonji, Majid [1 ]
机构
[1] Khalifa Univ, Elect Engn & Comp Sci Dept, Abu Dhabi, U Arab Emirates
关键词
MARKOV DECISION-PROCESSES; HEURISTIC-SEARCH;
D O I
10.1109/CDC49753.2023.10383727
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The fixed-horizon constrained Markov Decision Process (C-MDP) is a well-known model for planning in stochastic environments under operating constraints. Chance-constrained MDP (CC-MDP) is a variant that allows bounding the probability of constraint violation, which is desired in many safety-critical applications. CC-MDP can also model a class of MDPs, called Stochastic Shortest Path (SSP), under dead-ends, where there is a trade-off between the probability-togoal and cost-to-goal. This work studies the structure of (C)C-MDP, particularly an important variant that involves local transition. In this variant, the state reachability exhibits a certain degree of locality and independence from the remaining states. More precisely, the number of states, at a given time, that share some reachable future states is always constant. (C)C-MDP under local transition is NP-Hard even for a planning horizon of two. In this work, we propose a fully polynomial-time approximation scheme for (C)C-MDP that computes (near) optimal deterministic policies. Such an algorithm is among the best approximation algorithms attainable in theory and gives insights into the approximability of constrained MDP and its variants.
引用
收藏
页码:1782 / 1789
页数:8
相关论文
共 50 条
  • [1] A fully polynomial bicriteria approximation scheme for the constrained spanning tree problem
    Hong, SP
    Chung, SJ
    Park, BH
    OPERATIONS RESEARCH LETTERS, 2004, 32 (03) : 233 - 239
  • [2] A Fully Polynomial-Time Approximation Scheme for Timing-Constrained Minimum Cost Layer Assignment
    Hu, Shiyan
    Li, Zhuo
    Alpert, Charles J.
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, 2009, 56 (07) : 580 - 584
  • [3] An asymptotic fully polynomial time approximation scheme for bin covering
    Jansen, K
    Solis-Oba, R
    ALGORITHMS AND COMPUTATION, PROCEEDINGS, 2002, 2518 : 175 - 186
  • [4] A fully polynomial time approximation scheme for the Replenishment Storage problem
    Hochbaum, Dorit S.
    Rao, Xu
    OPERATIONS RESEARCH LETTERS, 2020, 48 (06) : 835 - 839
  • [5] A New Fully Polynomial Time Approximation Scheme for the Knapsack Problem
    Hans Kellerer
    Ulrich Pferschy
    Journal of Combinatorial Optimization, 1999, 3 : 59 - 71
  • [6] A Fully Polynomial Time Approximation Scheme for Packing While Traveling
    Neumann, Frank
    Polyakovskiy, Sergey
    Skutella, Martin
    Stougie, Leen
    Wu, Junhua
    ALGORITHMIC ASPECTS OF CLOUD COMPUTING (ALGOCLOUD 2018), 2019, 11409 : 59 - 72
  • [7] An asymptotic fully polynomial time approximation scheme for bin covering
    Jansen, K
    Solis-Oba, R
    THEORETICAL COMPUTER SCIENCE, 2003, 306 (1-3) : 543 - 551
  • [8] A new fully polynomial time approximation scheme for the knapsack problem
    Kellerer, H
    Pferschy, U
    JOURNAL OF COMBINATORIAL OPTIMIZATION, 1999, 3 (01) : 59 - 71
  • [9] A fully polynomial time approximation scheme for the smallest diameter of imprecise points
    Keikha, Vahideh
    Loffler, Maarten
    Mohades, Ali
    THEORETICAL COMPUTER SCIENCE, 2020, 814 : 259 - 270
  • [10] FULLY POLYNOMIAL TIME APPROXIMATION SCHEME FOR TOTAL WEIGHTED TARDINESS MINIMIZATION UNDER A COMMON DUE DATE
    Kacem, Imed
    PROCEEDINGS OF THE 38TH INTERNATIONAL CONFERENCE ON COMPUTERS AND INDUSTRIAL ENGINEERING, VOLS 1-3, 2008, : 161 - 163