A Special Case of Partially Observable Markov Decision Processes Problem by Event-Based Optimization

被引:0
|
作者
Zhang, Junyu [1 ]
机构
[1] Sun Yat Sen Univ, Sch Math & Computat Sci, Guangzhou 510275, Guangdong, Peoples R China
关键词
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
In this paper, we discuss a kind of partially observable Markov decision process (POMDP) problem by the event-based optimization which is proposed in [4]. A POMDP ([7] and [8]) is a generalization of a standard completely observable Markov decision process that allows imperfect information about states of the system. Policy iteration algorithms for POMDPs have proved to be impractical as it is very difficult to implement. Thus, most work with POMDPs has used value iteration. But for a special case of POMDP, we can formulate it to an MDP problem. Then we can use our sensitivity view to derive the corresponding average reward difference formula. Based on that and the idea of event-based optimization, we use a single sample path to estimate aggregated potentials. Then we develop policy iteration (PI) algorithms.
引用
收藏
页码:1522 / 1526
页数:5
相关论文
共 50 条
  • [1] Oracular partially observable Markov decision processes: A very special case
    Armstrong-Crews, Nicholas
    Veloso, Manuela
    PROCEEDINGS OF THE 2007 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, VOLS 1-10, 2007, : 2477 - +
  • [2] Stochastic optimization of controlled partially observable Markov decision processes
    Bartlett, PL
    Baxter, J
    PROCEEDINGS OF THE 39TH IEEE CONFERENCE ON DECISION AND CONTROL, VOLS 1-5, 2000, : 124 - 129
  • [3] Scheduling optimization for scalable video streaming based on partially observable markov decision processes
    Fan, Feng-Jun
    Zou, Jun-Ni
    Wang, Min
    Xiong, Hong-Kai
    Shanghai Jiaotong Daxue Xuebao/Journal of Shanghai Jiaotong University, 2010, 44 (03): : 393 - 397
  • [4] Partially Observable Markov Decision Processes and Robotics
    Kurniawati, Hanna
    ANNUAL REVIEW OF CONTROL ROBOTICS AND AUTONOMOUS SYSTEMS, 2022, 5 : 253 - 277
  • [5] A tutorial on partially observable Markov decision processes
    Littman, Michael L.
    JOURNAL OF MATHEMATICAL PSYCHOLOGY, 2009, 53 (03) : 119 - 125
  • [6] Quantum partially observable Markov decision processes
    Barry, Jennifer
    Barry, Daniel T.
    Aaronson, Scott
    PHYSICAL REVIEW A, 2014, 90 (03):
  • [7] PARTIALLY OBSERVABLE MARKOV DECISION PROCESSES WITH PARTIALLY OBSERVABLE RANDOM DISCOUNT FACTORS
    Martinez-Garcia, E. Everardo
    Minjarez-Sosa, J. Adolfo
    Vega-Amaya, Oscar
    KYBERNETIKA, 2022, 58 (06) : 960 - 983
  • [8] Deciding the Value 1 Problem for #-acyclic Partially Observable Markov Decision Processes
    Gimbert, Hugo
    Oualhadj, Youssouf
    SOFSEM 2014: THEORY AND PRACTICE OF COMPUTER SCIENCE, 2014, 8327 : 281 - 292
  • [9] Partially observable Markov decision processes for risk-based screening
    Mrozack, Alex
    Liao, Xuejun
    Skatter, Sondre
    Carin, Lawrence
    ANOMALY DETECTION AND IMAGING WITH X-RAYS (ADIX), 2016, 9847
  • [10] Active learning in partially observable Markov decision processes
    Jaulmes, R
    Pineau, J
    Precup, D
    MACHINE LEARNING: ECML 2005, PROCEEDINGS, 2005, 3720 : 601 - 608