Zero-sum Markov games and worst-case optimal control of queueing systems

被引：26

作者：

Altman, E ^{[1
]}

Hordijk, A ^{[1
]}

机构：

[1] LEIDEN UNIV,DEPT MATH & COMP SCI,2300 RA LEIDEN,NETHERLANDS

来源：

QUEUEING SYSTEMS | 1995年 / 21卷 / 3-4期

关键词：

zero-sum stochastic games; discounted and expected average cost; worst case control of queueing networks; value iteration; structural properties of optimal policies and value function;

D O I：

10.1007/BF01149169

中图分类号：

TP39 [计算机的应用];

学科分类号：

081203 ; 0835 ;

摘要：

Zero-sum stochastic games model situations where two persons, called players, control some dynamic system, and both have opposite objectives. One player wishes typically to minimize a cost which has to be paid to the other player. Such a game may also be used to model problems with a single controller who has only partial information on the system: the dynamic of the system may depend on some parameter that is unknown to the controller, and may vary in time in an unpredictable way. A worst-case criterion may be considered, where the unknown parameter is assumed to be chosen by ''nature'' (called player 1), and the objective of the controller (player 2) is then to design a policy that guarantees the best performance under worst-case behaviour of nature. The purpose of this paper is to present a survey of stochastic games in queues, where both tools and applications are considered. The first part is devoted to the tools. We present some existing tools for solving finite horizon and infinite horizon discounted Markov games with unbounded cost, and develop new ones that are typically applicable in queueing problems. We then present some new tools and theory of expected average cost stochastic games with unbounded cost. In the second part of the paper we present a survey on existing results on worst-case control of queues, and illustrate the structural properties of best policies of the controller, worst-case policies of nature, and of the value function. Using the theory developed in the first part of the paper, we extend some of the above results, which were known to hold for finite horizon costs or for the discounted cost, to the expected average cost.

引用

页码：415 / 447

页数：33

共 50 条

[21] Limit Optimal Trajectories in Zero-Sum Stochastic Games
Sorin, Sylvain
Vigeral, Guillaume
[J]. DYNAMIC GAMES AND APPLICATIONS, 2020, 10 (02) : 555 - 572
[22] Decomposition Techniques for Markov Zero-sum Games with Nested Information
Zheng, Jiefu
Castanon, David A.
[J]. 2013 IEEE 52ND ANNUAL CONFERENCE ON DECISION AND CONTROL (CDC), 2013, : 574 - 581
[23] Mean-field risk sensitive control and zero-sum games for Markov chains
Choutri, Salah Eddine
Djehiche, Boualem
[J]. BULLETIN DES SCIENCES MATHEMATIQUES, 2019, 152 : 1 - 39
[24] Almost Optimal Algorithms for Two-player Zero-Sum Linear Mixture Markov Games
Chen, Zixiang
Zhou, Dongruo
Gu, Quanquan
[J]. INTERNATIONAL CONFERENCE ON ALGORITHMIC LEARNING THEORY, VOL 167, 2022, 167
[25] Optimal Strategies in Zero-Sum Repeated Games with Incomplete Information: The Dependent Case
Gensbittel, Fabien
Oliu-Barton, Miquel
[J]. DYNAMIC GAMES AND APPLICATIONS, 2020, 10 (04) : 819 - 835
[26] Optimal Strategies in Zero-Sum Repeated Games with Incomplete Information: The Dependent Case
Fabien Gensbittel
Miquel Oliu-Barton
[J]. Dynamic Games and Applications, 2020, 10 : 819 - 835
[27] Zero-sum games with ambiguity
Rosenberg, Dinah
Vieille, Nicolas
[J]. GAMES AND ECONOMIC BEHAVIOR, 2019, 117 : 238 - 249
[28] Zero-sum revision games
Gensbittel, Fabien
Lovo, Stefano
Renault, Jerome
Tomala, Tristan
[J]. GAMES AND ECONOMIC BEHAVIOR, 2018, 108 : 504 - 522
[29] Zero-sum games with charges
Flesch, Janos
Vermeulen, Dries
Zseleva, Anna
[J]. GAMES AND ECONOMIC BEHAVIOR, 2017, 102 : 666 - 686
[30] WAR AND ZERO-SUM GAMES
KIERNAN, BP
[J]. VIRGINIA QUARTERLY REVIEW, 1977, 53 (01) : 17 - 31

← 1 2 3 4 5 →