An On-Line POMDP Solver for Continuous Observation Spaces

被引:18
|
作者
Hoerger, Marcus [1 ]
Kurniawati, Hanna [1 ]
机构
[1] Australian Natl Univ, Coll Engn & Comp Sci, Canberra, ACT 7500, Australia
关键词
D O I
10.1109/ICRA48506.2021.9560943
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Planning under partial obervability is essential for autonomous robots. A principled way to address such planning problems is the Partially Observable Markov Decision Process (POMDP). Although solving POMDPs is computationally intractable, substantial advancements have been achieved in developing approximate POMDP solvers in the past two decades. However, computing robust solutions for problems with continuous observation spaces remains challenging. Most on-line solvers rely on discretising the observation space or artificially limiting the number of observations that are considered during planning to compute tractable policies. In this paper we propose a new on-line POMDP solver, called Lazy Belief Extraction for Continuous Observation POMDPs (LABECOP), that combines methods from Monte-Carlo-Tree-Search and particle filtering to construct a policy reprentation which doesn't require discretised observation spaces and avoids limiting the number of observations considered during planning. Experiments on three different problems involving continuous observation spaces indicate that LABECOP performs similar or better than stateof-the-art POMDP solvers.
引用
收藏
页码:7643 / 7649
页数:7
相关论文
共 50 条
  • [1] Continuous HMM applied to quantization of on-line Korean character spaces
    Jung, KC
    Yoon, SM
    Kim, HJ
    PATTERN RECOGNITION LETTERS, 2000, 21 (04) : 303 - 310
  • [2] DESPOT-α: Online POMDP Planning With Large State And Observation Spaces
    Garg, Neha P.
    Hsu, David
    Lee, Wee Sun
    ROBOTICS: SCIENCE AND SYSTEMS XV, 2019,
  • [3] Dynamic Programming for POMDP with Jointly Discrete and Continuous State-Spaces
    Lee, Donghwan
    He, Niao
    Hu, Jianghai
    2019 AMERICAN CONTROL CONFERENCE (ACC), 2019, : 1250 - 1255
  • [4] Autonomous state-space construction in pomdp with continuous observation space
    Inoue, K
    Ota, J
    Arai, T
    INTELLIGENT AUTONOMOUS VEHICLES 2001, 2002, : 245 - 250
  • [6] On-line continuous hemodiafiltration in sepsis
    Kawanishi, Hideki
    TRANSFUSION AND APHERESIS SCIENCE, 2006, 35 (03) : 265 - 269
  • [7] Incremental on-line adaptation of POMDP-based dialogue managers to extended domains
    Gasic, M.
    Kim, D.
    Tsiakoulis, P.
    Breslin, C.
    Henderson, M.
    Szummer, M.
    Thomson, B.
    Young, S.
    15TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2014), VOLS 1-4, 2014, : 140 - 144
  • [8] Simplifying Complex Observation Models in Continuous POMDP Planning with Probabilistic Guarantees and Practice
    Lev-Yehudi, Idan
    Barenboim, Moran
    Indelman, Vadim
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 18, 2024, : 20176 - 20184
  • [9] Continuous flow and continuous publication: the challenges of on-line scientific
    da Silva, Eli Lopes
    Presser, Nadi Helena
    NAVUS-REVISTA DE GESTAO E TECNOLOGIA, 2019, 9 (03): : 5 - 6
  • [10] An Online POMDP Solver for Uncertainty Planning in Dynamic Environment
    Kurniawati, Hanna
    Yadav, Vinay
    ROBOTICS RESEARCH, ISRR, 2016, 114 : 611 - 629