A method for speeding up value iteration in partially observable Markov decision processes

被引:0
|
作者
Zhang, NL [1 ]
Lee, SS [1 ]
Zhang, WH [1 ]
机构
[1] Hong Kong Univ Sci & Technol, Dept Comp Sci, Hong Kong, Peoples R China
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present a technique for speeding up the convergence of value iteration for partially observable Markov decisions processes (POMDPs). The underlying idea is similar to that behind modified policy iteration for fully observable Markov decision processes (MDPs). The technique can be easily incorporated into any existing POMDP value iteration algorithms. Experiments have been conducted on several test problems with one POMDP value iteration algorithm called incremental pruning. We find that the technique can make incremental pruning run several orders of magnitude faster.
引用
收藏
页码:696 / 703
页数:8
相关论文
共 50 条
  • [1] Speeding up the convergence of value iteration in partially observable Markov decision processes
    Zhang, NL
    Zhang, WH
    [J]. JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2001, 14 : 29 - 51
  • [2] Perception-Aware Point-Based Value Iteration for Partially Observable Markov Decision Processes
    Ghasemi, Mahsa
    Topcu, Ufuk
    [J]. PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2019, : 2371 - 2377
  • [3] A Fast Approximation Method for Partially Observable Markov Decision Processes
    Bingbing Liu
    Yu Kang
    Xiaofeng Jiang
    Jiahu Qin
    [J]. Journal of Systems Science and Complexity, 2018, 31 : 1423 - 1436
  • [4] A Fast Approximation Method for Partially Observable Markov Decision Processes
    LIU Bingbing
    KANG Yu
    JIANG Xiaofeng
    QIN Jiahu
    [J]. Journal of Systems Science & Complexity, 2018, 31 (06) : 1423 - 1436
  • [5] A Fast Approximation Method for Partially Observable Markov Decision Processes
    Liu Bingbing
    Kang Yu
    Jiang Xiaofeng
    Qin Jiahu
    [J]. JOURNAL OF SYSTEMS SCIENCE & COMPLEXITY, 2018, 31 (06) : 1423 - 1436
  • [6] Value-Function Approximations for Partially Observable Markov Decision Processes
    Hauskrecht, Milos
    [J]. Journal of Artificial Intelligence Research, 2001, 13 (00): : 33 - 94
  • [7] Value-function approximations for partially observable Markov decision processes
    Hauskrecht, M
    [J]. JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2000, 13 : 33 - 94
  • [8] Partially Observable Markov Decision Processes and Robotics
    Kurniawati, Hanna
    [J]. ANNUAL REVIEW OF CONTROL ROBOTICS AND AUTONOMOUS SYSTEMS, 2022, 5 : 253 - 277
  • [9] Quantum partially observable Markov decision processes
    Barry, Jennifer
    Barry, Daniel T.
    Aaronson, Scott
    [J]. PHYSICAL REVIEW A, 2014, 90 (03):
  • [10] A tutorial on partially observable Markov decision processes
    Littman, Michael L.
    [J]. JOURNAL OF MATHEMATICAL PSYCHOLOGY, 2009, 53 (03) : 119 - 125