Zero-sum Markov games and worst-case optimal control of queueing systems

被引:26
|
作者
Altman, E [1 ]
Hordijk, A [1 ]
机构
[1] LEIDEN UNIV,DEPT MATH & COMP SCI,2300 RA LEIDEN,NETHERLANDS
关键词
zero-sum stochastic games; discounted and expected average cost; worst case control of queueing networks; value iteration; structural properties of optimal policies and value function;
D O I
10.1007/BF01149169
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Zero-sum stochastic games model situations where two persons, called players, control some dynamic system, and both have opposite objectives. One player wishes typically to minimize a cost which has to be paid to the other player. Such a game may also be used to model problems with a single controller who has only partial information on the system: the dynamic of the system may depend on some parameter that is unknown to the controller, and may vary in time in an unpredictable way. A worst-case criterion may be considered, where the unknown parameter is assumed to be chosen by ''nature'' (called player 1), and the objective of the controller (player 2) is then to design a policy that guarantees the best performance under worst-case behaviour of nature. The purpose of this paper is to present a survey of stochastic games in queues, where both tools and applications are considered. The first part is devoted to the tools. We present some existing tools for solving finite horizon and infinite horizon discounted Markov games with unbounded cost, and develop new ones that are typically applicable in queueing problems. We then present some new tools and theory of expected average cost stochastic games with unbounded cost. In the second part of the paper we present a survey on existing results on worst-case control of queues, and illustrate the structural properties of best policies of the controller, worst-case policies of nature, and of the value function. Using the theory developed in the first part of the paper, we extend some of the above results, which were known to hold for finite horizon costs or for the discounted cost, to the expected average cost.
引用
收藏
页码:415 / 447
页数:33
相关论文
共 50 条