An On-Line POMDP Solver for Continuous Observation Spaces

被引:18
|
作者
Hoerger, Marcus [1 ]
Kurniawati, Hanna [1 ]
机构
[1] Australian Natl Univ, Coll Engn & Comp Sci, Canberra, ACT 7500, Australia
关键词
D O I
10.1109/ICRA48506.2021.9560943
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Planning under partial obervability is essential for autonomous robots. A principled way to address such planning problems is the Partially Observable Markov Decision Process (POMDP). Although solving POMDPs is computationally intractable, substantial advancements have been achieved in developing approximate POMDP solvers in the past two decades. However, computing robust solutions for problems with continuous observation spaces remains challenging. Most on-line solvers rely on discretising the observation space or artificially limiting the number of observations that are considered during planning to compute tractable policies. In this paper we propose a new on-line POMDP solver, called Lazy Belief Extraction for Continuous Observation POMDPs (LABECOP), that combines methods from Monte-Carlo-Tree-Search and particle filtering to construct a policy reprentation which doesn't require discretised observation spaces and avoids limiting the number of observations considered during planning. Experiments on three different problems involving continuous observation spaces indicate that LABECOP performs similar or better than stateof-the-art POMDP solvers.
引用
收藏
页码:7643 / 7649
页数:7
相关论文
共 50 条
  • [41] LINEARIZATION OF A PROBLEM IN ON-LINE CONTROL OF CONTINUOUS PRODUCTION
    TSODIKOV, YM
    AUTOMATION AND REMOTE CONTROL, 1971, 32 (02) : 281 - &
  • [42] On-Line Inference for Fuzzy Controllers in Continuous Domains
    Yan, Fei
    Lu, Shou-rong
    FUZZY INFORMATION AND ENGINEERING, VOLUME 2, 2009, 62 : 1111 - +
  • [43] On-line color monitoring in continuous textile dyeing
    Kazmi, SZ
    Grady, FL
    Mock, GN
    Hodge, GL
    ISA TRANSACTIONS, 1996, 35 (01) : 33 - 43
  • [44] Determinants of Acceptance of On-line Courses for Continuous Studies
    Xian, Xuelin
    2019 INTERNATIONAL SYMPOSIUM ON EDUCATIONAL TECHNOLOGY (ISET 2019), 2019, : 284 - 286
  • [45] On-line parameter estimation in a continuous polymerization process
    Sirohi, A
    Choi, KY
    INDUSTRIAL & ENGINEERING CHEMISTRY RESEARCH, 1996, 35 (04) : 1332 - 1343
  • [46] On-line or not on-line?
    Michel Brusin
    Materials and Structures, 2000, 33 : 218 - 218
  • [47] On-line or not on-line?
    Brusin, M
    MATERIALS AND STRUCTURES, 2000, 33 (228) : 218 - 218
  • [48] Uncertainty management for on-line optimisation of a POMDP-based large-scale spoken dialogue system
    Daubigney, Lucie
    Gasic, Milica
    Chandramohan, Senthilkumar
    Geist, Matthieu
    Pietquin, Olivier
    Young, Steve
    12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 1308 - +
  • [49] On-line regression competitive with reproducing kernel Hilbert spaces
    Vovk, Vladimir
    THEORY AND APPLICATIONS OF MODELS OF COMPUTATION, PROCEEDINGS, 2006, 3959 : 452 - 463
  • [50] Solving POMDPs with Continuous or Large Discrete Observation Spaces
    Hoey, Jesse
    Poupart, Pascal
    19TH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE (IJCAI-05), 2005, : 1332 - 1338