ON THE CONSTRUCTION OF epsilon-OPTIMAL STRATEGIES IN PARTIALLY OBSERVED MDPs

被引:2
|
作者
Runggaldier, Wolfgang J. [1 ]
机构
[1] Univ Padua, Dipartimento Matemat Pura & Applicata, I-35131 Padua, Italy
关键词
Partially observable MDPs; epsilon-optimal strategies; approximation techniques;
D O I
10.1007/BF02055576
中图分类号
C93 [管理学]; O22 [运筹学];
学科分类号
070105 ; 12 ; 1201 ; 1202 ; 120202 ;
摘要
The purpose of the paper is to give a survey of methods, partly derived by the author in joint work with other researchers, concerning the problem of constructing epsilon-optimal strategies for partially observable MDPs. The methods basically consist in transforming the problem into one of approximation: Starting from the original problem a sequence of approximating problems is constructed such that: (i) For each approximating problem an optimal strategy can actually be computed. (ii) Given epsilon > 0, there exists an approximating problem such that the optimal strategy for the latter is epsilon-optimal for the original problem.
引用
收藏
页码:81 / 95
页数:15
相关论文
共 50 条