IMPULSIVE CONTROL FOR CONTINUOUS-TIME MARKOV DECISION PROCESSES

被引:0
|
作者
Dufour, Francois [1 ,2 ]
Piunovskiy, Alexei B. [3 ]
机构
[1] Univ Bordeaux, IMB, Bordeaux, France
[2] INRIA Bordeaux Sud Quest, F-33405 Talence, France
[3] Univ Liverpool, Liverpool L69 3BX, Merseyside, England
基金
英国工程与自然科学研究理事会;
关键词
Impulsive control; continuous control; continuous-time Markov decision process; discounted cost; DRIFT PROCESSES;
D O I
暂无
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
In this paper our objective is to study continuous-time Markov decision processes on a general Borel state space with both impulsive and continuous controls for the infinite time horizon discounted cost. The continuous-time controlled process is shown to be nonexplosive under appropriate hypotheses. The so-called Bellman equation associated to this control problem is studied. Sufficient conditions ensuring the existence and the uniqueness of a bounded measurable solution to this optimality equation are provided. Moreover, it is shown that the value function of the optimization problem under consideration satisfies this optimality equation. Sufficient conditions are also presented to ensure on the one hand the existence of an optimal control strategy, and on the other hand the existence of a epsilon-optimal control strategy. The decomposition of the state space into two disjoint subsets is exhibited where, roughly speaking, one should apply a gradual action or an impulsive action correspondingly to obtain an optimal or epsilon-optimal strategy. An interesting consequence of our previous results is as follows: the set of strategies that allow interventions, at time t = 0 and only immediately after natural jumps is a sufficient set for the control problem tinder consideraion.
引用
收藏
页码:106 / 127
页数:22
相关论文
共 50 条
  • [1] Impulsive Control for Continuous-Time Markov Decision Processes: A Linear Programming Approach
    Dufour, F.
    Piunovskiy, A. B.
    [J]. APPLIED MATHEMATICS AND OPTIMIZATION, 2016, 74 (01): : 129 - 161
  • [2] Impulsive Control for Continuous-Time Markov Decision Processes: A Linear Programming Approach
    F. Dufour
    A. B. Piunovskiy
    [J]. Applied Mathematics & Optimization, 2016, 74 : 129 - 161
  • [3] The Transformation Method for Continuous-Time Markov Decision Processes
    Piunovskiy, Alexey
    Zhang, Yi
    [J]. JOURNAL OF OPTIMIZATION THEORY AND APPLICATIONS, 2012, 154 (02) : 691 - 712
  • [4] Continuous-Time Markov Decision Processes with Controlled Observations
    Huang, Yunhan
    Kavitha, Veeraruna
    Zhu, Quanyan
    [J]. 2019 57TH ANNUAL ALLERTON CONFERENCE ON COMMUNICATION, CONTROL, AND COMPUTING (ALLERTON), 2019, : 32 - 39
  • [5] Continuous-Time Markov Decision Processes with Exponential Utility
    Zhang, Yi
    [J]. SIAM JOURNAL ON CONTROL AND OPTIMIZATION, 2017, 55 (04) : 2636 - 2660
  • [6] REALIZABLE STRATEGIES IN CONTINUOUS-TIME MARKOV DECISION PROCESSES
    Piunovskiy, Alexey
    [J]. SIAM JOURNAL ON CONTROL AND OPTIMIZATION, 2018, 56 (01) : 473 - 495
  • [7] The Transformation Method for Continuous-Time Markov Decision Processes
    Alexey Piunovskiy
    Yi Zhang
    [J]. Journal of Optimization Theory and Applications, 2012, 154 : 691 - 712
  • [8] Delayed Nondeterminism in Continuous-Time Markov Decision Processes
    Neuhaeusser, Martin R.
    Stoelinga, Marielle
    Katoen, Joost-Pieter
    [J]. FOUNDATIONS OF SOFTWARE SCIENCE AND COMPUTATIONAL STRUCTURES, PROCEEDINGS, 2009, 5504 : 364 - +
  • [9] Sufficiency of Markov Policies for Continuous-Time Jump Markov Decision Processes
    Feinberg, Eugene A.
    Mandava, Manasa
    Shiryaev, Albert N.
    [J]. MATHEMATICS OF OPERATIONS RESEARCH, 2022, 47 (02) : 1266 - 1286
  • [10] MARKOV DECISION-PROCESSES WITH BOTH CONTINUOUS AND IMPULSIVE CONTROL
    YUSHKEVICH, AA
    [J]. LECTURE NOTES IN CONTROL AND INFORMATION SCIENCES, 1986, 81 : 234 - 246