An On-Line POMDP Solver for Continuous Observation Spaces

被引：18

作者：

Hoerger, Marcus ^{[1
]}

Kurniawati, Hanna ^{[1
]}

机构：

[1] Australian Natl Univ, Coll Engn & Comp Sci, Canberra, ACT 7500, Australia

来源：

2021 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2021) | 2021年

关键词：

D O I：

10.1109/ICRA48506.2021.9560943

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Planning under partial obervability is essential for autonomous robots. A principled way to address such planning problems is the Partially Observable Markov Decision Process (POMDP). Although solving POMDPs is computationally intractable, substantial advancements have been achieved in developing approximate POMDP solvers in the past two decades. However, computing robust solutions for problems with continuous observation spaces remains challenging. Most on-line solvers rely on discretising the observation space or artificially limiting the number of observations that are considered during planning to compute tractable policies. In this paper we propose a new on-line POMDP solver, called Lazy Belief Extraction for Continuous Observation POMDPs (LABECOP), that combines methods from Monte-Carlo-Tree-Search and particle filtering to construct a policy reprentation which doesn't require discretised observation spaces and avoids limiting the number of observations considered during planning. Experiments on three different problems involving continuous observation spaces indicate that LABECOP performs similar or better than stateof-the-art POMDP solvers.

引用

页码：7643 / 7649

页数：7

共 50 条

[1] Continuous HMM applied to quantization of on-line Korean character spaces
Jung, KC
Yoon, SM
Kim, HJ
PATTERN RECOGNITION LETTERS, 2000, 21 (04) : 303 - 310
[2] DESPOT-α: Online POMDP Planning With Large State And Observation Spaces
Garg, Neha P.
Hsu, David
Lee, Wee Sun
ROBOTICS: SCIENCE AND SYSTEMS XV, 2019,
[3] Dynamic Programming for POMDP with Jointly Discrete and Continuous State-Spaces
Lee, Donghwan
He, Niao
Hu, Jianghai
2019 AMERICAN CONTROL CONFERENCE (ACC), 2019, : 1250 - 1255
[4] Autonomous state-space construction in pomdp with continuous observation space
Inoue, K
Ota, J
Arai, T
INTELLIGENT AUTONOMOUS VEHICLES 2001, 2002, : 245 - 250
[5] On-line colorimetry in continuous dyeing
Journal of Molecular Recognition, 1994, 711 (04)
[6] On-line continuous hemodiafiltration in sepsis
Kawanishi, Hideki
TRANSFUSION AND APHERESIS SCIENCE, 2006, 35 (03) : 265 - 269
[7] Incremental on-line adaptation of POMDP-based dialogue managers to extended domains
Gasic, M.
Kim, D.
Tsiakoulis, P.
Breslin, C.
Henderson, M.
Szummer, M.
Thomson, B.
Young, S.
15TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2014), VOLS 1-4, 2014, : 140 - 144
[8] Simplifying Complex Observation Models in Continuous POMDP Planning with Probabilistic Guarantees and Practice
Lev-Yehudi, Idan
Barenboim, Moran
Indelman, Vadim
THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 18, 2024, : 20176 - 20184
[9] Continuous flow and continuous publication: the challenges of on-line scientific
da Silva, Eli Lopes
Presser, Nadi Helena
NAVUS-REVISTA DE GESTAO E TECNOLOGIA, 2019, 9 (03): : 5 - 6
[10] An Online POMDP Solver for Uncertainty Planning in Dynamic Environment
Kurniawati, Hanna
Yadav, Vinay
ROBOTICS RESEARCH, ISRR, 2016, 114 : 611 - 629

← 1 2 3 4 5 →