Towards the optimal control of Markov chains with constraints

被引:18
|
作者
Miller, Boris [1 ,2 ]
Miller, Gregory [3 ]
Siemenikhin, Konstantin [4 ]
机构
[1] Monash Univ, Sch Math Sci, Clayton, Vic 3800, Australia
[2] Inst Informat Transmiss Problems, Moscow 127994, Russia
[3] RAS, Inst Informat Problems, Moscow 119333, Russia
[4] Moscow Inst Aviat Technol, Probabil Theory Dept, Moscow 125993, Russia
基金
澳大利亚研究理事会;
关键词
Markov chains; Constraints; Optimal control; Maximum principle; DECISION-PROCESSES;
D O I
10.1016/j.automatica.2010.06.003
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
An optimal control problem with constraints is considered on a finite interval fora non-stationary Markov chain with a finite state space. The constraints are given as a set of inequalities. The optimal solution existence is proved under a natural assumption that the set of admissible controls is non-empty. The stochastic control problem is reduced to a deterministic one and it is shown that the optimal solution satisfies the maximum principle, moreover it can be chosen within a class of Markov controls. On the basis of this result an approach to the numerical solution is proposed and its implementation is illustrated by examples. (c) 2010 Published by Elsevier Ltd
引用
收藏
页码:1495 / 1502
页数:8
相关论文
共 50 条
  • [21] Finding Provably Optimal Markov Chains
    Spel, Jip
    Junges, Sebastian
    Katoen, Joost-Pieter
    TOOLS AND ALGORITHMS FOR THE CONSTRUCTION AND ANALYSIS OF SYSTEMS, PT I, TACAS 2021, 2021, 12651 : 173 - 190
  • [22] Optimal switching problem for Markov chains
    Yushkevich, AA
    MARKOV PROCESSES AND CONTROLLED MARKOV CHAINS, 2002, : 255 - 286
  • [23] On optimal condition numbers for Markov chains
    Kirkland, Stephen J.
    Neumann, Michael
    Sze, Nung-Sing
    NUMERISCHE MATHEMATIK, 2008, 110 (04) : 521 - 537
  • [24] OPTIMAL STOPPING FOR FUNCTIONS OF MARKOV CHAINS
    RUIZMONC.A
    ANNALS OF MATHEMATICAL STATISTICS, 1967, 38 (06): : 1939 - &
  • [25] Updating, transition constraints and possibilistic markov chains
    Dubois, Didier
    Dupin de Saintcyr, Florence
    Prade, Henri
    Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2014, 945 : 261 - 272
  • [26] Updating, transition constraints and possibilistic Markov chains
    Dubois, D
    Cyr, FDD
    Prade, H
    ADVANCES IN INTELLIGENT COMPUTING - IPMU '94, 1995, 945 : 263 - 272
  • [27] Optimal control of time-inhomogeneous Markov chains with application to dam management
    McInnes, Daniel
    Miller, Boris
    2013 3RD AUSTRALIAN CONTROL CONFERENCE (AUCC), 2013, : 230 - 237
  • [28] Optimal Control of Probability on a Target Set for Continuous-Time Markov Chains
    Ma, Chenglin
    Zhao, Huaizhong
    IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2024, 69 (02) : 1202 - 1209
  • [29] AGGREGATION AND OPTIMAL-CONTROL OF NEARLY COMPLETELY DECOMPOSABLE MARKOV-CHAINS
    ALDHAHERI, RW
    KHALIL, HK
    PROCEEDINGS OF THE 28TH IEEE CONFERENCE ON DECISION AND CONTROL, VOLS 1-3, 1989, : 1277 - 1282
  • [30] Optimal control problem regularization for the Markov process with finite number of states and constraints
    Miller, B. M.
    Miller, G. B.
    Semenikhin, K. V.
    AUTOMATION AND REMOTE CONTROL, 2016, 77 (09) : 1589 - 1611