Monotone optimal control for a class of Markov decision processes

被引:13
|
作者
Zhuang, Weifen [1 ]
Li, Michael Z. F. [2 ]
机构
[1] Xiamen Univ, Sch Management, Xiamen 361005, Fujian, Peoples R China
[2] Nanyang Technol Univ, Nanyang Business Sch, Singapore 639798, Singapore
关键词
Markov processes; D-multimodularity; Monotone optimal control; Substitution; Complement; STOCK PRODUCTION SYSTEM; PRODUCTION-INVENTORY SYSTEM; OPTIMAL POLICY STRUCTURE; LOST-SALES; STRUCTURAL-PROPERTIES; REVENUE MANAGEMENT; QUEUING-SYSTEMS; CONVEXITY; SERVICE; DEMAND;
D O I
10.1016/j.ejor.2011.09.021
中图分类号
C93 [管理学];
学科分类号
12 ; 1201 ; 1202 ; 120202 ;
摘要
This paper provides a unified framework to study monotone optimal control for a class of Markov decision processes through D-multimodularity. We demonstrate that each system in this class can be classified as either a substitution-type or a complement-type system according to the possible transition set, which can be used as a classification mechanism that integrates a variety of models in the literature. We develop a generic proof of the structural properties of both types of system. In particular, we show that D-multimodularity is a generally sufficient condition for monotone optimal control of different types of system in this class. With this unified theory, there is no need to pursue each problem ad hoc and the structural properties of this class of MDPs follow with ease. (C) 2011 Elsevier B.V. All rights reserved.
引用
收藏
页码:342 / 350
页数:9
相关论文
共 50 条
  • [21] IDENTIFICATION OF OPTIMAL POLICIES IN MARKOV DECISION PROCESSES
    Sladky, Karel
    KYBERNETIKA, 2010, 46 (03) : 558 - 570
  • [22] Optimal Policies for Quantum Markov Decision Processes
    Ming-Sheng Ying
    Yuan Feng
    Sheng-Gang Ying
    International Journal of Automation and Computing, 2021, 18 : 410 - 421
  • [23] Optimal Control of Logically Constrained Partially Observable and Multiagent Markov Decision Processes
    Kalagarla, Krishna C.
    Kartik, Dhruva
    Shen, Dongming
    Jain, Rahul
    Nayyar, Ashutosh
    Nuzzo, Pierluigi
    IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2025, 70 (01) : 263 - 277
  • [24] Nearly optimal control of singularly perturbed Markov decision processes in discrete time
    Liu, RH
    Zhang, Q
    Yin, G
    APPLIED MATHEMATICS AND OPTIMIZATION, 2001, 44 (02): : 105 - 129
  • [25] Markov decision processes based optimal control policies for probabilistic boolean networks
    Abul, O
    Alhajj, R
    Polat, F
    BIBE 2004: FOURTH IEEE SYMPOSIUM ON BIOINFORMATICS AND BIOENGINEERING, PROCEEDINGS, 2004, : 337 - 344
  • [26] MONOTONE OPTIMAL POLICIES IN DISCOUNTED MARKOV DECISION PROCESSES WITH TRANSITION PROBABILITIES INDEPENDENT OF THE CURRENT STATE: EXISTENCE AND APPROXIMATION
    Flores-Hernandez, Rosa M.
    KYBERNETIKA, 2013, 49 (05) : 705 - 719
  • [27] Optimal control of Markov Regenerative Processes
    Pfening, A
    Telek, M
    1998 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS, VOLS 1-5, 1998, : 663 - 668
  • [28] CLASS OF NON-MARKOV DECISION-PROCESSES
    GLAZEBROOK, KD
    JOURNAL OF APPLIED PROBABILITY, 1978, 15 (04) : 689 - 698
  • [29] Optimal control of stochastic hybrid systems based on locally consistent Markov decision processes
    Koutsoukos, XD
    2005 IEEE INTERNATIONAL SYMPOSIUM ON INTELLIGENT CONTROL & 13TH MEDITERRANEAN CONFERENCE ON CONTROL AND AUTOMATION, VOLS 1 AND 2, 2005, : 435 - 440
  • [30] Quickest Change Detection Approach to Optimal Control in Markov Decision Processes with Model Changes
    Banerjee, Taposh
    Liu, Miao
    How, Jonathan P.
    2017 AMERICAN CONTROL CONFERENCE (ACC), 2017, : 399 - 405