A Fast Analytical Algorithm for Solving Markov Decision Processes with Real-Valued Resources

被引:0
|
作者
Marecki, Janusz [1 ]
Koenig, Sven [1 ]
Tambe, Milind [1 ]
机构
[1] Univ So Calif, Dept Comp Sci, Los Angeles, CA 90089 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Agents often have to construct plans that obey deadlines or, more generally, resource limits for real-valued resources whose consumption can only be characterized by probability distributions, such as execution time or battery power. These planning problems can be modeled with continuous state Markov decision processes (MDPs) but existing solution methods are either inefficient or provide no guarantee on the quality of the resulting policy. We therefore present CPH, a novel solution method that solves the planning problems by first approximating with any desired accuracy the probability distributions over the resource consumptions with phase-type distributions, which use exponential distributions as building blocks. It then uses value iteration to solve the resulting MDPs by exploiting properties of exponential distributions to calculate the necessary convolutions accurately and efficiently while providing strong guarantees on the quality of the resulting policy. Our experimental feasibility study in a Mars rover domain demonstrates a substantial speedup over Lazy Approximation, which is currently the leading algorithm for solving continuous state MDPs with quality guarantees.
引用
收藏
页码:2536 / 2541
页数:6
相关论文
共 50 条
  • [41] Real-Valued Embeddings and Sketches for Fast Distance and Similarity Estimation
    Rachkovskij, D. A.
    CYBERNETICS AND SYSTEMS ANALYSIS, 2016, 52 (06) : 967 - 988
  • [42] A Knowledge Compilation Map for Ordered Real-Valued Decision Diagrams
    Fargier, Helene
    Marquis, Pierre
    Niveau, Alexandre
    Schmidt, Nicolas
    PROCEEDINGS OF THE TWENTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2014, : 1049 - 1055
  • [43] Affine LIBOR models driven by real-valued affine processes
    Mueller, Wolfgang
    Waldenberger, Stefan
    STOCHASTIC MODELS, 2016, 32 (02) : 333 - 350
  • [44] AMA: a new approach for solving constrained real-valued optimization problems
    Abu S. S. M. Barkat Ullah
    Ruhul Sarker
    David Cornforth
    Chris Lokan
    Soft Computing, 2009, 13 : 741 - 762
  • [45] AMA: a new approach for solving constrained real-valued optimization problems
    Ullah, Abu S. S. M. Barkat
    Sarker, Ruhul
    Cornforth, David
    Lokan, Chris
    SOFT COMPUTING, 2009, 13 (8-9) : 741 - 762
  • [46] A Fast Learning Complex-valued Neural Classifier for Real-valued Classification Problems
    Savitha, R.
    Suresh, S.
    Sundararajan, N.
    2011 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2011, : 2243 - 2249
  • [47] Layered and Real-Valued Negative Selection Algorithm for Fault Detection
    Abid, Anam
    Khan, Muhammad Tahir
    de Silva, Clarence W.
    IEEE SYSTEMS JOURNAL, 2018, 12 (03): : 2960 - 2969
  • [48] Fast Learning Fully Complex-Valued Classifiers for Real-Valued Classification Problems
    Savitha, R.
    Suresh, S.
    Sundararajan, N.
    Kim, H. J.
    ADVANCES IN NEURAL NETWORKS - ISNN 2011, PT I, 2011, 6675 : 602 - +
  • [49] A STOCHASTIC REINFORCEMENT LEARNING ALGORITHM FOR LEARNING REAL-VALUED FUNCTIONS
    GULLAPALLI, V
    NEURAL NETWORKS, 1990, 3 (06) : 671 - 692
  • [50] Applying Real-Valued Genetic Algorithm on Curve Fitting Problem
    Chen, Hung-Jen
    Chueh, Hao-En
    NANOTECHNOLOGY AND COMPUTER ENGINEERING, 2010, 121-122 : 183 - +