Approximation of Stationary Control Policies by Quantized Control in Markov Decision Processes

被引：0

作者：

Saldi, Noel ^{[1
]}

Linder, Tamas ^{[1
]}

Yueksel, Serdar ^{[1
]}

机构：

[1] Queens Univ, Dept Math & Stat, Kingston, ON K7L 3N6, Canada

来源：

2013 51ST ANNUAL ALLERTON CONFERENCE ON COMMUNICATION, CONTROL, AND COMPUTING (ALLERTON) | 2013年

关键词：

FINITE-STATE APPROXIMATIONS; SPACE;

D O I：

暂无

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

We consider the problem of approximating optimal stationary control policies by quantized control. Stationary quantizer policies are introduced and it is shown that such policies are epsilon-optimal among stationary policies under mild technical conditions. Quantitative bounds on the approximation error in terms of the rate of the approximating quantizers are also derived. Thus, one can search for epsilon-optimal policies within quantized control policies. These pave the way for applications in optimal design of networked control systems where controller actions need to be quantized, as well as for a new computational method for the generation of approximately optimal Markov decision policies in general (Borel) state and action spaces for both discounted cost and average cost infinite horizon optimal control problems.

引用

页码：78 / 84

页数：7

共 50 条

[21] Approximation of average cost optimal policies for general Markov decision processes with unbounded costs
Gordienko, E
Montes-de-Oca, R
Minjarez-Sosa, A
MATHEMATICAL METHODS OF OPERATIONS RESEARCH, 1997, 45 (02) : 245 - 263
[22] Approximation of average cost optimal policies for general Markov decision processes with unbounded costs
Evgueni Gordienko
Raúl Montes-De-Oca
Adolfo Minjárez-Sosa
Mathematical Methods of Operations Research, 1997, 45 : 245 - 263
[23] Monotone optimal control for a class of Markov decision processes
Zhuang, Weifen
Li, Michael Z. F.
EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2012, 217 (02) : 342 - 350
[24] Server Frequency Control Using Markov Decision Processes
Chen, Lydia Y.
Gautam, Natarajan
IEEE INFOCOM 2009 - IEEE CONFERENCE ON COMPUTER COMMUNICATIONS, VOLS 1-5, 2009, : 2951 - +
[25] Optimal control in light traffic Markov Decision Processes
INRIA, Sophia Antipolis, France
ZOR, 1 (63-79):
[26] Optimal control in light traffic Markov decision processes
Ger Koole
Olaf Passchier
Mathematical Methods of Operations Research, 1997, 45 : 63 - 79
[27] Decentralized Control of Partially Observable Markov Decision Processes
Amato, Christopher
Chowdhary, Girish
Geramifard, Alborz
Uere, N. Kemal
Kochenderfer, Mykel J.
2013 IEEE 52ND ANNUAL CONFERENCE ON DECISION AND CONTROL (CDC), 2013, : 2398 - 2405
[28] Control of Markov Decision Processes from PCTL specifications
Lahijanian, M.
Andersson, S. B.
Belta, C.
2011 AMERICAN CONTROL CONFERENCE, 2011, : 311 - 316
[29] MARKOV DECISION MODEL FOR SELECTING OPTIMAL CREDIT CONTROL POLICIES
LIEBMAN, LH
MANAGEMENT SCIENCE SERIES B-APPLICATION, 1972, 18 (10): : B519 - B525
[30] Optimal control in light traffic Markov decision processes
Koole, G
Passchier, O
MATHEMATICAL METHODS OF OPERATIONS RESEARCH, 1997, 45 (01) : 63 - 79

← 1 2 3 4 5 →