An On-Line POMDP Solver for Continuous Observation Spaces

被引:18
|
作者
Hoerger, Marcus [1 ]
Kurniawati, Hanna [1 ]
机构
[1] Australian Natl Univ, Coll Engn & Comp Sci, Canberra, ACT 7500, Australia
关键词
D O I
10.1109/ICRA48506.2021.9560943
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Planning under partial obervability is essential for autonomous robots. A principled way to address such planning problems is the Partially Observable Markov Decision Process (POMDP). Although solving POMDPs is computationally intractable, substantial advancements have been achieved in developing approximate POMDP solvers in the past two decades. However, computing robust solutions for problems with continuous observation spaces remains challenging. Most on-line solvers rely on discretising the observation space or artificially limiting the number of observations that are considered during planning to compute tractable policies. In this paper we propose a new on-line POMDP solver, called Lazy Belief Extraction for Continuous Observation POMDPs (LABECOP), that combines methods from Monte-Carlo-Tree-Search and particle filtering to construct a policy reprentation which doesn't require discretised observation spaces and avoids limiting the number of observations considered during planning. Experiments on three different problems involving continuous observation spaces indicate that LABECOP performs similar or better than stateof-the-art POMDP solvers.
引用
收藏
页码:7643 / 7649
页数:7
相关论文
共 50 条
  • [11] From Off-Line to Continuous On-line Maintenance
    Pezze, Mauro
    2012 28TH IEEE INTERNATIONAL CONFERENCE ON SOFTWARE MAINTENANCE (ICSM), 2012, : 2 - 3
  • [12] Predictive on-line monitoring of continuous processes
    Chen, G
    McAvoy, TJ
    JOURNAL OF PROCESS CONTROL, 1998, 8 (5-6) : 409 - 420
  • [13] Predictive on-line monitoring of continuous processes
    Univ of Maryland, College Park, United States
    J Process Control, 5-6 (409-420):
  • [14] CONTINUOUS, ON-LINE, COMPUTER EKG ANALYSIS
    WAGGONER, DM
    WALLACE, AG
    CLARK, DO
    RIPPERTO.LA
    CIRCULATION, 1969, 40 (4S3) : I210 - &
  • [15] ADAPTIVE ON-LINE OPTIMIZATION FOR CONTINUOUS BIOREACTORS
    Rolf, Michael J.
    Lim, Henry C.
    CHEMICAL ENGINEERING COMMUNICATIONS, 1984, 29 (1-6) : 229 - 255
  • [17] On-line Monitoring Method and Observation System of Transmission Line Icing
    Geng Xin
    Hu Xiaoguang
    ICIEA: 2009 4TH IEEE CONFERENCE ON INDUSTRIAL ELECTRONICS AND APPLICATIONS, VOLS 1-6, 2009, : 1256 - 1260
  • [18] On-line estimation of smooth signals with partial observation
    P. L. Chow
    R. Z. Khasminskii
    Problems of Information Transmission, 2006, 42 : 330 - 339
  • [19] On-line observation of interlaminar damage by ultrasonic inspection
    Dong, YJ
    Ye, N
    Bai, YL
    COMPOSITES SCIENCE AND TECHNOLOGY, 1999, 59 (06) : 957 - 961
  • [20] The Lesson Observation On-line (Evidence Portfolio) Platform
    Cooper, David G.
    AUSTRALIAN JOURNAL OF TEACHER EDUCATION, 2015, 40 (01): : 83 - 93