Forward Search Value Iteration For POMDPs

被引:0
|
作者
Shani, Guy [1 ]
Brafman, Ronen I. [1 ]
Shimony, Solomon E. [1 ]
机构
[1] Ben Gurion Univ Negev, Dept Comp Sci, Beer Sheva, Israel
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recent scaling up of POMDP solvers towards realistic applications is largely due to point-based methods which quickly converge to an approximate solution for medium-sized problems. Of this family HSVI, which uses trial-based asynchronous value iteration, can handle the largest domains. In this paper we suggest a new algorithm, FSVI, that uses the underlying MDP to traverse the belief space towards rewards, finding sequences of useful backups, and show how it scales up better than HSVI on larger benchmarks.
引用
收藏
页码:2619 / 2624
页数:6
相关论文
共 50 条
  • [1] Goal-HSVI: Heuristic Search Value Iteration for Goal-POMDPs
    Horak, Karel
    Bosansky, Branislav
    Chatterjee, Krishnendu
    PROCEEDINGS OF THE TWENTY-SEVENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2018, : 4764 - 4770
  • [2] A Probabilistic Forward Search Value Iteration Algorithm for POMDP
    Liu, Feng
    Lei, Cheng
    Liu, Hanyi
    Wang, Chongjun
    2019 IEEE 31ST INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2019), 2019, : 384 - 391
  • [3] nso-HSVI: A not-so-optimistic Heuristic Search Value Iteration Algorithm for POMDPs
    Liu, Feng
    Li, Haibo
    Wang, Chongjun
    2014 IEEE 26TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI), 2014, : 689 - 693
  • [4] Point-based value iteration for continuous POMDPs
    Institut de Robòtica i Informàtica Industrial, UPC-CSIC, Llorens i Artigas 4-6, 08028, Barcelona, Spain
    不详
    不详
    J. Mach. Learn. Res., 2006, (2329-2367):
  • [5] Point-based value iteration for continuous POMDPs
    Porta, Josep M.
    Vlassis, Nikos
    Spaan, Matthijs T. J.
    Poupart, Pascal
    JOURNAL OF MACHINE LEARNING RESEARCH, 2006, 7 : 2329 - 2367
  • [6] Point-Based Value Iteration for VAR-POMDPs
    Zheng, Wei
    Lin, Hai
    IEEE CONTROL SYSTEMS LETTERS, 2022, 6 : 7 - 12
  • [7] Point-based Value Iteration for VAR-POMDPs
    Zheng, Wei
    Lin, Hai
    2021 AMERICAN CONTROL CONFERENCE (ACC), 2021, : 1143 - 1148
  • [8] Perseus: Randomized point-based value iteration for POMDPs
    Spaan, MTJ
    Vlassis, N
    JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2005, 24 : 195 - 220
  • [9] Monte Carlo Value Iteration for Continuous-State POMDPs
    Bai, Haoyu
    Hsu, David
    Lee, Wee Sun
    Ngo, Vien A.
    ALGORITHMIC FOUNDATIONS OF ROBOTICS IX, 2010, 68 : 175 - 191
  • [10] Point-based online value iteration algorithm for POMDPs
    Wu, Bo
    Wu, Min
    She, Jin-Hua
    Ruan Jian Xue Bao/Journal of Software, 2013, 24 (01): : 25 - 36