Asymptotic optimality of a class of controlled non-Markov processes

被引:0
|
作者
Dufour, Francois [1 ]
Tran, Ky [2 ]
Wang, Le Yi [3 ]
Yin, George [4 ]
机构
[1] Univ Bordeau, Inst Polytech Bordeaux, Team ASTRAL, INRIA Bordeaux Sud Ouest,Inst Math Bordeaux, Bordeaux, France
[2] State Univ New York, Dept Appl Math & Stat, Incheon, South Korea
[3] Wayne State Univ, Dept Elect & Comp Engn, Detroit, MI USA
[4] Univ Connecticut, Dept Math, Storrs, CT 06269 USA
基金
美国国家科学基金会; 新加坡国家研究基金会;
关键词
Non-Markov controlled switching process; two-time scale system; weak convergence; asymptotic optimality; SINGULAR PERTURBATION; SYSTEMS; DIFFUSIONS;
D O I
10.1080/00036811.2024.2368078
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
Motivated by applications in power systems and problems arising in simulation of large scale complex system optimizations, this work is concerned with controlled stochastic switching systems. The system of interest displays a multi-time scale structure. In contrast to the so-called singularly perturbed diffusions and multi-scale Markov decision processes, controlled non-Markov processes (also known as non-Markov decision processes) are treated. The novelty of our work is the treatment of the non-Markov controlled processes and the time-scale used. The fast and slow processes are coupled through a stochastic differential equation. Using averaging, it is first shown that the non-Markov switching process has a weak limit that is a Markov decision process. Then asymptotic optimal control of the non-Markov process is obtained by using the limit process.
引用
收藏
页数:13
相关论文
共 50 条