An integrated approach to solving influence diagrams and finite-horizon partially observable decision processes

被引:1
|
作者
Hansen, Eric A. [1 ]
机构
[1] Mississippi State Univ, Dept Comp Sci & Engn, Mississippi State, MS 39762 USA
基金
美国国家科学基金会;
关键词
Influence diagram; Variable elimination; Partially observable Markov decision process; Dynamic programming; Decision-theoretic planning; ELIMINATION; INFERENCE;
D O I
10.1016/j.artint.2020.103431
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We show how to integrate a variable elimination approach to solving influence diagrams with a value iteration approach to solving finite-horizon partially observable Markov decision processes (POMDPs). The integration of these approaches creates a variable elimination algorithm for influence diagrams that has much more relaxed constraints on elimination order, which allows improved scalability in many cases. The new algorithm can also be viewed as a generalization of the value iteration algorithm for POMDPs that solves non-Markovian as well as Markovian problems, in addition to leveraging a factored representation for improved efficiency. The development of a single algorithm that integrates and generalizes both of these classic algorithms, one for influence diagrams and the other for POMDPs, unifies these two approaches to solving Bayesian decision problems in a way that combines their complementary advantages. (C) 2020 Elsevier B.V. All rights reserved.
引用
下载
收藏
页数:47
相关论文
共 50 条
  • [21] Evaluation of CUSUM Charts for Finite-Horizon Processes
    Nenes, George
    Tagaras, George
    COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 2010, 39 (03) : 578 - 597
  • [22] Decomposition Methods for Solving Finite-Horizon Large MDPs
    el Akraoui, Bouchra
    Daoui, Cherki
    Larach, Abdelhadi
    Rahhali, Khalid
    JOURNAL OF MATHEMATICS, 2022, 2022
  • [23] Multiconstrained Finite-Horizon Piecewise Deterministic Markov Decision Processes with Unbounded Transition Rates
    Huang, Yonghui
    Guo, Xianping
    MATHEMATICS OF OPERATIONS RESEARCH, 2020, 45 (02) : 641 - 659
  • [24] PARTIALLY OBSERVABLE MARKOV DECISION PROCESSES WITH PARTIALLY OBSERVABLE RANDOM DISCOUNT FACTORS
    Martinez-Garcia, E. Everardo
    Minjarez-Sosa, J. Adolfo
    Vega-Amaya, Oscar
    KYBERNETIKA, 2022, 58 (06) : 960 - 983
  • [25] A Bayesian Approach for Learning and Planning in Partially Observable Markov Decision Processes
    Ross, Stephane
    Pineau, Joelle
    Chaib-draa, Brahim
    Kreitmann, Pierre
    JOURNAL OF MACHINE LEARNING RESEARCH, 2011, 12 : 1729 - 1770
  • [26] Complexity of finite-horizon Markov decision process problems
    Mundhenk, M
    Goldsmith, J
    Lusena, C
    Allender, E
    JOURNAL OF THE ACM, 2000, 47 (04) : 681 - 720
  • [27] Quantum partially observable Markov decision processes
    Barry, Jennifer
    Barry, Daniel T.
    Aaronson, Scott
    PHYSICAL REVIEW A, 2014, 90 (03):
  • [28] Partially Observable Markov Decision Processes and Robotics
    Kurniawati, Hanna
    ANNUAL REVIEW OF CONTROL ROBOTICS AND AUTONOMOUS SYSTEMS, 2022, 5 : 253 - 277
  • [29] A tutorial on partially observable Markov decision processes
    Littman, Michael L.
    JOURNAL OF MATHEMATICAL PSYCHOLOGY, 2009, 53 (03) : 119 - 125
  • [30] Viscosity solutions approach to finite-horizon continuous-time Markov decision process
    Liao, Zhong-Wei
    Shao, Jinghai
    INTERNATIONAL JOURNAL OF CONTROL, 2024,