Observation-based Performance Sensitivity Analysis for POMDPs

被引:0
|
作者
Ji, Zhe [1 ]
Jiang, Xiaofeng [1 ]
Xi, Hongsheng [1 ]
机构
[1] Univ Sci & Technol China, Sch Informat Sci & Technol, Dept Auto, Hefei 230027, Peoples R China
关键词
POMDPs; Performance sensitivity analysis; Performance difference formula; Performance derivative formula;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, the performance sensitivity analysis for Markov decision processes (MDPs) are generalized to study the partially observable Markov decision processes (POMDPs). The performance derivative formula and the performance difference formula based on observation are derived in this paper. The derivation does not need any overly strict assumptions. In order to find the optimal policy based on observation, an observation-based policy iteration algorithm is designed. An example is presented to show the applicability of the algorithm finally.
引用
收藏
页码:1671 / 1676
页数:6
相关论文
共 50 条
  • [41] An automatic observation-based aerosol typing method for EARLINET
    Papagiannopoulos, Nikolaos
    Mona, Lucia
    Amodeo, Aldo
    D'Amico, Giuseppe
    Claramunt, Pilar Guma
    Pappalardo, Gelsomina
    Alados-Arboledas, Lucas
    Luis Guerrero-Rascado, Juan
    Amiridis, Vassilis
    Kokkalis, Panagiotis
    Apituley, Arnoud
    Baars, Holger
    Schwarz, Anja
    Wandinger, Ulla
    Binietoglou, Ioannis
    Nicolae, Doina
    Bortoli, Daniele
    Comeron, Adolfo
    Rodriguez-Gomez, Alejandro
    Sicard, Michael
    Papayannis, Alex
    Wiegner, Matthias
    ATMOSPHERIC CHEMISTRY AND PHYSICS, 2018, 18 (21) : 15879 - 15901
  • [42] An observation-based algorithm to identify the characteristics of a dynamic system
    Drozdov, AL
    AUTOMATION AND REMOTE CONTROL, 2000, 61 (05) : 768 - 776
  • [43] MANAGEMENT OF SURGICAL WORKFLOW - AN OBSERVATION-BASED ASSESSMENT STUDY
    Bartnicka, Joanna
    BUSINESS AND NON-PROFIT ORGANIZATIONS FACING INCREASED COMPETITION AND GROWING CUSTOMERS' DEMANDS, VOL 17, 2018, 17 : 11 - 22
  • [44] Observation-based training for neuroprosthetic control of grasping by amputees
    Agashe, Harshavardhan A.
    Contreras-Vidal, Jose L.
    2014 36TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY (EMBC), 2014, : 3989 - 3992
  • [45] Plant location and the advent of slab casting by US steel minimills: An observation-based analysis
    Giarratani, Frank
    Gruver, Gene
    Jackson, Randall
    ECONOMIC GEOGRAPHY, 2006, 82 (04) : 401 - 419
  • [46] An observation-based instrument to measure what children with disabilities do on the playground: a Rasch analysis
    Grady-Dominguez, Patricia
    Bundy, Anita
    Ragen, Jo
    Wyver, Shirley
    Villeneuve, Michelle
    Naughton, Geraldine
    Tranter, Paul
    Eakman, Aaron
    Hepburn, Susan
    Beetham, Kassia
    INTERNATIONAL JOURNAL OF PLAY, 2019, 8 (01) : 79 - 93
  • [47] Introducing observation-based physics into the WAM wave model
    Kousal, Joshua
    Liu, Qingxiang
    Bidlot, Jean-Raymond
    Behrens, Arno
    Guenther, Heinz
    Staneva, Joanna
    Babanin, Alexander V.
    PROCEEDINGS OF ASME 2022 41ST INTERNATIONAL CONFERENCE ON OCEAN, OFFSHORE & ARCTIC ENGINEERING, OMAE2022, VOL 2, 2022,
  • [48] Observation-Based Decomposition of Radiative Perturbations and Radiative Kernels
    Thorsen, Tyler J.
    Kato, Seiji
    Loeb, Norman G.
    Rose, Fred G.
    JOURNAL OF CLIMATE, 2018, 31 (24) : 10039 - 10058
  • [49] Observation-based logic of knowledge, belief, desire and intention
    Su, Kaile
    Yue, Weiya
    Sattar, Abdul
    Orgun, Mehmet A.
    Luo, Xiangyu
    KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT, 2006, 4092 : 366 - 378
  • [50] Analysis of pickup-delivery vehicles movements: A vehicle observation-based freight survey
    Moufad, Imane
    Jawab, Fouad
    LOGISTIQUA2020: 2020 IEEE 13TH INTERNATIONAL COLLOQUIUM OF LOGISTICS AND SUPPLY CHAIN MANAGEMENT (LOGISTIQUA 2020), 2020,