ON THE CONSTRUCTION OF epsilon-OPTIMAL STRATEGIES IN PARTIALLY OBSERVED MDPs

被引:2
|
作者
Runggaldier, Wolfgang J. [1 ]
机构
[1] Univ Padua, Dipartimento Matemat Pura & Applicata, I-35131 Padua, Italy
关键词
Partially observable MDPs; epsilon-optimal strategies; approximation techniques;
D O I
10.1007/BF02055576
中图分类号
C93 [管理学]; O22 [运筹学];
学科分类号
070105 ; 12 ; 1201 ; 1202 ; 120202 ;
摘要
The purpose of the paper is to give a survey of methods, partly derived by the author in joint work with other researchers, concerning the problem of constructing epsilon-optimal strategies for partially observable MDPs. The methods basically consist in transforming the problem into one of approximation: Starting from the original problem a sequence of approximating problems is constructed such that: (i) For each approximating problem an optimal strategy can actually be computed. (ii) Given epsilon > 0, there exists an approximating problem such that the optimal strategy for the latter is epsilon-optimal for the original problem.
引用
收藏
页码:81 / 95
页数:15
相关论文
共 50 条
  • [1] STATIONARY EPSILON-OPTIMAL STRATEGIES IN STOCHASTIC GAMES
    THUIJSMAN, F
    VRIEZE, K
    OR SPEKTRUM, 1993, 15 (01) : 9 - 15
  • [2] SEARCH FOR EPSILON-OPTIMAL STRATEGIES OF OBSERVATION OF A MARKOV PROCESS
    PIUNOVSKII, AB
    SAKSONOV, EA
    AUTOMATION AND REMOTE CONTROL, 1981, 42 (01) : 23 - 31
  • [3] ON EXISTENCE OF EPSILON-OPTIMAL HOMOGENEUOS MARKOFF STRATEGIES FOR CONTROLLED CHAIN
    KRYLOV, NV
    DOKLADY AKADEMII NAUK SSSR, 1964, 155 (04): : 747 - &
  • [4] PROGRAM ITERATIONS AND UNIVERSAL EPSILON-OPTIMAL STRATEGIES IN POSITIONAL DIFFERENTIAL GAME
    CHISTYAKOV, SV
    DOKLADY AKADEMII NAUK SSSR, 1991, 319 (06): : 1333 - 1335
  • [5] An epsilon-Optimal Portfolio with Stochastic Volatility
    Gabih, Abdelali
    Grecksch, Wilfried
    MONTE CARLO METHODS AND APPLICATIONS, 2005, 11 (01): : 1 - 14
  • [6] EPSILON-OPTIMAL STUBBORN LEARNING-MECHANISMS
    CHRISTENSEN, JPR
    OOMMEN, BJ
    IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS, 1990, 20 (05): : 1209 - 1216
  • [7] EPSILON-OPTIMAL DISCRETIZED PURSUIT LEARNING AUTOMATA
    OOMMEN, BJ
    LANCTOT, JK
    1989 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS, VOLS 1-3: CONFERENCE PROCEEDINGS, 1989, : 6 - 12
  • [8] APPROXIMATIONS FOR DISCRETE-TIME ADAPTIVE-CONTROL - CONSTRUCTION OF EPSILON-OPTIMAL CONTROLS
    RUNGGALDIER, WJ
    ZANE, O
    MATHEMATICS OF CONTROL SIGNALS AND SYSTEMS, 1991, 4 (03) : 269 - 291
  • [9] SOME EPSILON-OPTIMAL ROW-COLUMN DESIGNS
    JACROUX, M
    SANKHYA-SERIES B-APPLIED AND INTERDISCIPLINARY STATISTICS, 1986, 48 : 31 - 39
  • [10] Fast and Epsilon-Optimal Discretized Pursuit Learning Automata
    Zhang, JunQi
    Wang, Cheng
    Zhou, MengChu
    IEEE TRANSACTIONS ON CYBERNETICS, 2015, 45 (10) : 2089 - 2099