Zero-sum Markov games and worst-case optimal control of queueing systems

被引:26
|
作者
Altman, E [1 ]
Hordijk, A [1 ]
机构
[1] LEIDEN UNIV,DEPT MATH & COMP SCI,2300 RA LEIDEN,NETHERLANDS
关键词
zero-sum stochastic games; discounted and expected average cost; worst case control of queueing networks; value iteration; structural properties of optimal policies and value function;
D O I
10.1007/BF01149169
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Zero-sum stochastic games model situations where two persons, called players, control some dynamic system, and both have opposite objectives. One player wishes typically to minimize a cost which has to be paid to the other player. Such a game may also be used to model problems with a single controller who has only partial information on the system: the dynamic of the system may depend on some parameter that is unknown to the controller, and may vary in time in an unpredictable way. A worst-case criterion may be considered, where the unknown parameter is assumed to be chosen by ''nature'' (called player 1), and the objective of the controller (player 2) is then to design a policy that guarantees the best performance under worst-case behaviour of nature. The purpose of this paper is to present a survey of stochastic games in queues, where both tools and applications are considered. The first part is devoted to the tools. We present some existing tools for solving finite horizon and infinite horizon discounted Markov games with unbounded cost, and develop new ones that are typically applicable in queueing problems. We then present some new tools and theory of expected average cost stochastic games with unbounded cost. In the second part of the paper we present a survey on existing results on worst-case control of queues, and illustrate the structural properties of best policies of the controller, worst-case policies of nature, and of the value function. Using the theory developed in the first part of the paper, we extend some of the above results, which were known to hold for finite horizon costs or for the discounted cost, to the expected average cost.
引用
收藏
页码:415 / 447
页数:33
相关论文
共 50 条
  • [21] Limit Optimal Trajectories in Zero-Sum Stochastic Games
    Sorin, Sylvain
    Vigeral, Guillaume
    [J]. DYNAMIC GAMES AND APPLICATIONS, 2020, 10 (02) : 555 - 572
  • [22] Decomposition Techniques for Markov Zero-sum Games with Nested Information
    Zheng, Jiefu
    Castanon, David A.
    [J]. 2013 IEEE 52ND ANNUAL CONFERENCE ON DECISION AND CONTROL (CDC), 2013, : 574 - 581
  • [23] Mean-field risk sensitive control and zero-sum games for Markov chains
    Choutri, Salah Eddine
    Djehiche, Boualem
    [J]. BULLETIN DES SCIENCES MATHEMATIQUES, 2019, 152 : 1 - 39
  • [24] Almost Optimal Algorithms for Two-player Zero-Sum Linear Mixture Markov Games
    Chen, Zixiang
    Zhou, Dongruo
    Gu, Quanquan
    [J]. INTERNATIONAL CONFERENCE ON ALGORITHMIC LEARNING THEORY, VOL 167, 2022, 167
  • [25] Optimal Strategies in Zero-Sum Repeated Games with Incomplete Information: The Dependent Case
    Gensbittel, Fabien
    Oliu-Barton, Miquel
    [J]. DYNAMIC GAMES AND APPLICATIONS, 2020, 10 (04) : 819 - 835
  • [26] Optimal Strategies in Zero-Sum Repeated Games with Incomplete Information: The Dependent Case
    Fabien Gensbittel
    Miquel Oliu-Barton
    [J]. Dynamic Games and Applications, 2020, 10 : 819 - 835
  • [27] Zero-sum games with ambiguity
    Rosenberg, Dinah
    Vieille, Nicolas
    [J]. GAMES AND ECONOMIC BEHAVIOR, 2019, 117 : 238 - 249
  • [28] Zero-sum revision games
    Gensbittel, Fabien
    Lovo, Stefano
    Renault, Jerome
    Tomala, Tristan
    [J]. GAMES AND ECONOMIC BEHAVIOR, 2018, 108 : 504 - 522
  • [29] Zero-sum games with charges
    Flesch, Janos
    Vermeulen, Dries
    Zseleva, Anna
    [J]. GAMES AND ECONOMIC BEHAVIOR, 2017, 102 : 666 - 686
  • [30] WAR AND ZERO-SUM GAMES
    KIERNAN, BP
    [J]. VIRGINIA QUARTERLY REVIEW, 1977, 53 (01) : 17 - 31