Anytime point-based approximations for large POMDPs

被引:197
|
作者
Pineau, Joelle [1 ]
Gordon, Geoffrey
Thrun, Sebastian
机构
[1] McGill Univ, Sch Comp Sci, Montreal, PQ H3A 2A7, Canada
[2] Carnegie Mellon Univ, Machine Learning Dept, Pittsburgh, PA 15232 USA
[3] Stanford Univ, Dept Comp Sci, Stanford, CA 94305 USA
关键词
D O I
10.1613/jair.2078
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The Partially Observable Markov Decision Process has long been recognized as a rich framework for real-world planning and control problems, especially in robotics. However exact solutions in this framework are typically computationally intractable for all but the smallest problems. A well-known technique for speeding up POMDP solving involves performing value backups at specific belief points, rather than over the entire belief simplex. The efficiency of this approach, however, depends greatly on the selection of points. This paper presents a set of novel techniques for selecting informative belief points which work well in practice. The point selection procedure is combined with point-based value backups to form an effective anytime POMDP algorithm called Point-Based Value Iteration (PBVI). The first aim of this paper is to introduce this algorithm and present a theoretical analysis justifying the choice of belief selection technique. The second aim of this paper is to provide a thorough empirical comparison between PBVI and other state-of-the-art POMDP methods, in particular the Perseus algorithm, in an effort to highlight their similarities and differences. Evaluation is performed using both standard POMDP domains and realistic robotic tasks.
引用
收藏
页码:335 / 380
页数:46
相关论文
共 50 条
  • [41] Algorithms to support point-based cadcam
    Cripps, RJ
    [J]. INTERNATIONAL JOURNAL OF MACHINE TOOLS & MANUFACTURE, 2003, 43 (04): : 425 - 432
  • [42] Review of exceptional point-based sensors
    Wiersig, Jan
    [J]. PHOTONICS RESEARCH, 2020, 8 (09) : 1457 - 1467
  • [43] Prioritizing Point-Based POMDP Solvers
    Shani, Guy
    Brafman, Ronen I.
    Shimony, Solomon Eyal
    [J]. IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2008, 38 (06): : 1592 - 1605
  • [44] PointINS: Point-Based Instance Segmentation
    Qi, Lu
    Wang, Yi
    Chen, Yukang
    Chen, Ying-Cong
    Zhang, Xiangyu
    Sun, Jian
    Jia, Jiaya
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (10) : 6377 - 6392
  • [45] Point-based multiscale surface representation
    Pauly, Mark
    Kobbelt, Leif P.
    Gross, Markus
    [J]. ACM TRANSACTIONS ON GRAPHICS, 2006, 25 (02): : 177 - 193
  • [46] Preprocessing for point-based algorithms of POMDP
    National Key Laboratory for Novel Software Technology, Nanjing University, Nanjing 210093, China
    不详
    [J]. Ruan Jian Xue Bao, 2008, 6 (1309-1316):
  • [47] Perceptual Optimization for Point-Based Point Cloud Rendering
    YIN Yujie
    CHEN Zhang
    [J]. ZTE Communications, 2023, 21 (04) : 47 - 53
  • [48] Review of exceptional point-based sensors
    JAN WIERSIG
    [J]. Photonics Research, 2020, (09) : 1457 - 1467
  • [49] Interactive point-based isosurface extraction
    Livnat, Y
    Tricoche, X
    [J]. IEEE VISUALIZATION 2004, PROCEEEDINGS, 2004, : 457 - 464
  • [50] Subsurface scattering in point-based rendering
    不详
    [J]. PHOTONICS SPECTRA, 2011, 45 (02) : 61 - 61