A Sensitivity-Based Construction Approach to Sample-Path Variance Minimization of Markov Decision Processes

被引:0
|
作者
Huang, Yonghao [1 ]
Chen, Xi [1 ]
机构
[1] Tsinghua Univ, TNList, Dept Automat, Ctr Intelligent & Networked Syst, Beijing 100084, Peoples R China
基金
中国国家自然科学基金;
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We study the limiting average variance along the sample path as the secondary criterion for Markov decision processes, with the long-run average performance as the primary criterion. By applying the sensitivity-based approach, we intuitively construct the difference formula for the sample path variance under different policies. Thereby, a sufficient condition for the sample-path variance optimality can be easily derived. This work extends the sensitivity-based construction approach to the Markov decision processes with the nonstandard performance criterion. Compared with the pure mathematical verification, the sensitivity-based construction approach shows more intuition and provides insights on the sample-path structure of Markov decision processes.
引用
收藏
页码:215 / 220
页数:6
相关论文
共 50 条