Partially Observable Markov Decision Processes and Robotics

被引：37

作者：

Kurniawati, Hanna ^{[1
]}

机构：

[1] Australian Natl Univ, Sch Comp, Canberra, ACT, Australia

来源：

ANNUAL REVIEW OF CONTROL ROBOTICS AND AUTONOMOUS SYSTEMS | 2022年 / 5卷

关键词：

POMDP; planning under uncertainty; motion planning; VALUE-ITERATION; MOTION UNCERTAINTY; STATE; ALGORITHMS; COMPLEXITY; HORIZON; POMDPS; SPACE; TASKS;

D O I：

10.1146/annurev-control-042920-092451

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Planning under uncertainty is critical to robotics. The partially observable Markov decision process (POMDP) is a mathematical framework for such planning problems. POMDPs are powerful because of their careful quantification of the nondeterministic effects of actions and the partial observability of the states. But for the same reason, they are notorious for their high computational complexity and have been deemed impractical for robotics. However, over the past two decades, the development of sampling-based approximate solvers has led to tremendous advances in POMDP-solving capabilities. Although these solvers do not generate the optimal solution, they can compute good POMDP solutions that significantly improve the robustness of robotics systems within reasonable computational resources, thereby making POMDPs practical for many realistic robotics problems. This article presents a review of POMDPs, emphasizing computational issues that have hindered their practicality in robotics and ideas in sampling-based solvers that have alleviated such difficulties, together with lessons learned from applying POMDPs to physical robots.

引用

页码：253 / 277

页数：25

共 50 条

[1] Partially Observable Markov Decision Processes in Robotics: A Survey
Lauri, Mikko
Hsu, David
Pajarinen, Joni
[J]. IEEE TRANSACTIONS ON ROBOTICS, 2023, 39 (01) : 21 - 40
[2] A tutorial on partially observable Markov decision processes
Littman, Michael L.
[J]. JOURNAL OF MATHEMATICAL PSYCHOLOGY, 2009, 53 (03) : 119 - 125
[3] Quantum partially observable Markov decision processes
Barry, Jennifer
Barry, Daniel T.
Aaronson, Scott
[J]. PHYSICAL REVIEW A, 2014, 90 (03):
[4] PARTIALLY OBSERVABLE MARKOV DECISION PROCESSES WITH PARTIALLY OBSERVABLE RANDOM DISCOUNT FACTORS
Martinez-Garcia, E. Everardo
Minjarez-Sosa, J. Adolfo
Vega-Amaya, Oscar
[J]. KYBERNETIKA, 2022, 58 (06) : 960 - 983
[5] Active learning in partially observable Markov decision processes
Jaulmes, R
Pineau, J
Precup, D
[J]. MACHINE LEARNING: ECML 2005, PROCEEDINGS, 2005, 3720 : 601 - 608
[6] Structural Estimation of Partially Observable Markov Decision Processes
Chang, Yanling
Garcia, Alfredo
Wang, Zhide
Sun, Lu
[J]. IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2023, 68 (08) : 5135 - 5141
[7] Entropy Maximization for Partially Observable Markov Decision Processes
Savas, Yagiz
Hibbard, Michael
Wu, Bo
Tanaka, Takashi
Topcu, Ufuk
[J]. IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2022, 67 (12) : 6948 - 6955
[8] Nonapproximability results for partially observable Markov decision processes
Lusena, C
Goldsmith, J
Mundhenk, M
[J]. JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2001, 14 : 83 - 113
[9] Decentralized Control of Partially Observable Markov Decision Processes
Amato, Christopher
Chowdhary, Girish
Geramifard, Alborz
Uere, N. Kemal
Kochenderfer, Mykel J.
[J]. 2013 IEEE 52ND ANNUAL CONFERENCE ON DECISION AND CONTROL (CDC), 2013, : 2398 - 2405
[10] Partially observable Markov decision processes with reward information
Cao, XR
Guo, XP
[J]. 2004 43RD IEEE CONFERENCE ON DECISION AND CONTROL (CDC), VOLS 1-5, 2004, : 4393 - 4398

← 1 2 3 4 5 →