A method for speeding up value iteration in partially observable Markov decision processes

被引：0

作者：

Zhang, NL ^{[1
]}

Lee, SS ^{[1
]}

Zhang, WH ^{[1
]}

机构：

[1] Hong Kong Univ Sci & Technol, Dept Comp Sci, Hong Kong, Peoples R China

来源：

UNCERTAINTY IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS | 1999年

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We present a technique for speeding up the convergence of value iteration for partially observable Markov decisions processes (POMDPs). The underlying idea is similar to that behind modified policy iteration for fully observable Markov decision processes (MDPs). The technique can be easily incorporated into any existing POMDP value iteration algorithms. Experiments have been conducted on several test problems with one POMDP value iteration algorithm called incremental pruning. We find that the technique can make incremental pruning run several orders of magnitude faster.

引用

页码：696 / 703

页数：8

共 50 条

[1] Speeding up the convergence of value iteration in partially observable Markov decision processes
Zhang, NL
Zhang, WH
[J]. JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2001, 14 : 29 - 51
[2] Perception-Aware Point-Based Value Iteration for Partially Observable Markov Decision Processes
Ghasemi, Mahsa
Topcu, Ufuk
[J]. PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2019, : 2371 - 2377
[3] A Fast Approximation Method for Partially Observable Markov Decision Processes
Bingbing Liu
Yu Kang
Xiaofeng Jiang
Jiahu Qin
[J]. Journal of Systems Science and Complexity, 2018, 31 : 1423 - 1436
[4] A Fast Approximation Method for Partially Observable Markov Decision Processes
LIU Bingbing
KANG Yu
JIANG Xiaofeng
QIN Jiahu
[J]. Journal of Systems Science & Complexity, 2018, 31 (06) : 1423 - 1436
[5] A Fast Approximation Method for Partially Observable Markov Decision Processes
Liu Bingbing
Kang Yu
Jiang Xiaofeng
Qin Jiahu
[J]. JOURNAL OF SYSTEMS SCIENCE & COMPLEXITY, 2018, 31 (06) : 1423 - 1436
[6] Value-Function Approximations for Partially Observable Markov Decision Processes
Hauskrecht, Milos
[J]. Journal of Artificial Intelligence Research, 2001, 13 (00): : 33 - 94
[7] Value-function approximations for partially observable Markov decision processes
Hauskrecht, M
[J]. JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2000, 13 : 33 - 94
[8] Partially Observable Markov Decision Processes and Robotics
Kurniawati, Hanna
[J]. ANNUAL REVIEW OF CONTROL ROBOTICS AND AUTONOMOUS SYSTEMS, 2022, 5 : 253 - 277
[9] Quantum partially observable Markov decision processes
Barry, Jennifer
Barry, Daniel T.
Aaronson, Scott
[J]. PHYSICAL REVIEW A, 2014, 90 (03):
[10] A tutorial on partially observable Markov decision processes
Littman, Michael L.
[J]. JOURNAL OF MATHEMATICAL PSYCHOLOGY, 2009, 53 (03) : 119 - 125

← 1 2 3 4 5 →