Observation-based Performance Sensitivity Analysis for POMDPs

被引:0
|
作者
Ji, Zhe [1 ]
Jiang, Xiaofeng [1 ]
Xi, Hongsheng [1 ]
机构
[1] Univ Sci & Technol China, Sch Informat Sci & Technol, Dept Auto, Hefei 230027, Peoples R China
关键词
POMDPs; Performance sensitivity analysis; Performance difference formula; Performance derivative formula;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, the performance sensitivity analysis for Markov decision processes (MDPs) are generalized to study the partially observable Markov decision processes (POMDPs). The performance derivative formula and the performance difference formula based on observation are derived in this paper. The derivation does not need any overly strict assumptions. In order to find the optimal policy based on observation, an observation-based policy iteration algorithm is designed. An example is presented to show the applicability of the algorithm finally.
引用
收藏
页码:1671 / 1676
页数:6
相关论文
共 50 条
  • [21] MOAD: Modeling Observation-based Approximate Dependency
    Lee, Seongmin
    Binkley, David
    Feldt, Robert
    Gold, Nicolas
    Yoo, Shin
    2019 19TH IEEE INTERNATIONAL WORKING CONFERENCE ON SOURCE CODE ANALYSIS AND MANIPULATION (SCAM), 2019, : 12 - 22
  • [22] Proactive communication in observation-based team cooperation
    Zhang, Y
    Volz, RA
    ICAI '05: PROCEEDINGS OF THE 2005 INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOLS 1 AND 2, 2005, : 477 - 482
  • [23] Econophysics: Challenges and Promises - An Observation-based Approach
    Roehner, Bertrand M.
    EVOLUTIONARY AND INSTITUTIONAL ECONOMICS REVIEW, 2008, 4 (02) : 251 - 266
  • [24] A New Methodology for Observation-Based Parameterization Development
    Suselj, Kay
    Posselt, Derek
    Smalley, Mark
    Lebsock, Matthew D.
    Teixeira, Joao
    MONTHLY WEATHER REVIEW, 2020, 148 (10) : 4159 - 4184
  • [25] An experimental observation-based ontology evolution framework
    Alifard, Ali
    Shadgar, Bita
    Osareh, Alireza
    International Review on Computers and Software, 2011, 6 (05) : 827 - 833
  • [26] An Observation-Based Model for Secondary Inorganic Aerosols
    Xue, Jian
    Yuan, Zibing
    Yu, Jian Zhen
    Lau, Alexis K. H.
    AEROSOL AND AIR QUALITY RESEARCH, 2014, 14 (03) : 862 - U882
  • [27] Econophysics: Challenges and Promises —An Observation-based Approach
    Bertrand M. Roehner
    Evolutionary and Institutional Economics Review, 2008, 4 (2) : 251 - 266
  • [28] Accurate Analysis of Quality Properties of Software with Observation-Based Markov Chain Refinement
    Paterson, Colin
    Calinescu, Radu
    2017 IEEE INTERNATIONAL CONFERENCE ON SOFTWARE ARCHITECTURE (ICSA 2017), 2017, : 121 - 130
  • [29] Observation-Based Unit Test Generation at Meta
    Alshahwan, Nadia
    Harman, Mark
    Marginean, Alexandru
    Tal, Rotem
    Wang, Eddy
    COMPANION PROCEEDINGS OF THE 32ND ACM INTERNATIONAL CONFERENCE ON THE FOUNDATIONS OF SOFTWARE ENGINEERING, FSE COMPANION 2024, 2024, : 173 - 184
  • [30] Tooth loss in periodontally treated patients: A registry- and observation-based analysis
    Kocher, Thomas
    Holtfreter, Birte
    Priess, Heinz-Werner
    Graetz, Christian
    Jablonowski, Lukasz
    Grabe, Hans J.
    Volzke, Henry
    Raedel, Michael
    Walter, Michael H.
    JOURNAL OF CLINICAL PERIODONTOLOGY, 2022, 49 (08) : 749 - 757