Approximation of Stationary Control Policies by Quantized Control in Markov Decision Processes

被引:0
|
作者
Saldi, Noel [1 ]
Linder, Tamas [1 ]
Yueksel, Serdar [1 ]
机构
[1] Queens Univ, Dept Math & Stat, Kingston, ON K7L 3N6, Canada
关键词
FINITE-STATE APPROXIMATIONS; SPACE;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We consider the problem of approximating optimal stationary control policies by quantized control. Stationary quantizer policies are introduced and it is shown that such policies are epsilon-optimal among stationary policies under mild technical conditions. Quantitative bounds on the approximation error in terms of the rate of the approximating quantizers are also derived. Thus, one can search for epsilon-optimal policies within quantized control policies. These pave the way for applications in optimal design of networked control systems where controller actions need to be quantized, as well as for a new computational method for the generation of approximately optimal Markov decision policies in general (Borel) state and action spaces for both discounted cost and average cost infinite horizon optimal control problems.
引用
收藏
页码:78 / 84
页数:7
相关论文
共 50 条
  • [1] Asymptotic Optimality and Rates of Convergence of Quantized Stationary Policies in Continuous-Time Markov Decision Processes
    Wu, Xiao
    Tang, Yanqiu
    DISCRETE DYNAMICS IN NATURE AND SOCIETY, 2022, 2022
  • [2] Efficient Policies for Stationary Possibilistic Markov Decision Processes
    Ben Amor, Nahla
    El Khalfi, Zeineb
    Fargier, Helene
    Sabaddin, Regis
    SYMBOLIC AND QUANTITATIVE APPROACHES TO REASONING WITH UNCERTAINTY, ECSQARU 2017, 2017, 10369 : 306 - 317
  • [3] Markov decision processes approximation with coupled dynamics via Markov deterministic control systems
    Portillo-Ramirez, Gustavo
    Cruz-Suarez, Hugo
    Lopez-Rios, Ruy
    Blancas-Rivera, Ruben
    OPEN MATHEMATICS, 2023, 21 (01):
  • [4] Deriving Explicit Control Policies for Markov Decision Processes Using Symbolic Regression
    Hristov, A.
    Bosman, J. W.
    Bhulai, S.
    van der Mei, R. D.
    PROCEEDINGS OF THE 13TH EAI INTERNATIONAL CONFERENCE ON PERFORMANCE EVALUATION METHODOLOGIES AND TOOLS ( VALUETOOLS 2020), 2020, : 41 - 47
  • [5] Markov decision processes based optimal control policies for probabilistic boolean networks
    Abul, O
    Alhajj, R
    Polat, F
    BIBE 2004: FOURTH IEEE SYMPOSIUM ON BIOINFORMATICS AND BIOENGINEERING, PROCEEDINGS, 2004, : 337 - 344
  • [6] Notes on equivalent stationary policies in Markov decision processes with total rewards
    Feinberg, Eugene A.
    Sonin, Isaac M.
    ZOR. Zeitschrift fur Operations-Research, 1996, 44 (02): : 205 - 221
  • [7] Approximation of Markov stationary sequences in stochastic control problems
    Fetisov, VN
    AUTOMATION AND REMOTE CONTROL, 2000, 61 (10) : 1665 - 1670
  • [8] Mode-matching control policies for multi-mode Markov decision processes
    Ren, ZY
    Krogh, BH
    PROCEEDINGS OF THE 2001 AMERICAN CONTROL CONFERENCE, VOLS 1-6, 2001, : 95 - 100
  • [9] UNIFORMIZATION FOR SEMI-MARKOV DECISION-PROCESSES UNDER STATIONARY POLICIES
    BEUTLER, FJ
    ROSS, KW
    JOURNAL OF APPLIED PROBABILITY, 1987, 24 (03) : 644 - 656
  • [10] Splitting Randomized Stationary Policies in Total-Reward Markov Decision Processes
    Feinberg, Eugene A.
    Rothblum, Uriel G.
    MATHEMATICS OF OPERATIONS RESEARCH, 2012, 37 (01) : 129 - 153