A Fast Analytical Algorithm for Solving Markov Decision Processes with Real-Valued Resources

被引:0
|
作者
Marecki, Janusz [1 ]
Koenig, Sven [1 ]
Tambe, Milind [1 ]
机构
[1] Univ So Calif, Dept Comp Sci, Los Angeles, CA 90089 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Agents often have to construct plans that obey deadlines or, more generally, resource limits for real-valued resources whose consumption can only be characterized by probability distributions, such as execution time or battery power. These planning problems can be modeled with continuous state Markov decision processes (MDPs) but existing solution methods are either inefficient or provide no guarantee on the quality of the resulting policy. We therefore present CPH, a novel solution method that solves the planning problems by first approximating with any desired accuracy the probability distributions over the resource consumptions with phase-type distributions, which use exponential distributions as building blocks. It then uses value iteration to solve the resulting MDPs by exploiting properties of exponential distributions to calculate the necessary convolutions accurately and efficiently while providing strong guarantees on the quality of the resulting policy. Our experimental feasibility study in a Mars rover domain demonstrates a substantial speedup over Lazy Approximation, which is currently the leading algorithm for solving continuous state MDPs with quality guarantees.
引用
收藏
页码:2536 / 2541
页数:6
相关论文
共 50 条
  • [31] Exploiting Linkage Information in Real-Valued Optimization with the Real-Valued Gene-Pool Optimal Mixing Evolutionary Algorithm
    Bouter, Anton
    Alderliesten, Tanja
    Witteveen, Cees
    Bosman, Peter A. N.
    PROCEEDINGS OF THE 2017 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE (GECCO'17), 2017, : 705 - 712
  • [32] A DECIMATION IN FREQUENCY FFT ALGORITHM FOR REAL-VALUED SEQUENCES
    MECKELBURG, HJ
    LIPKA, D
    AEU-ARCHIV FUR ELEKTRONIK UND UBERTRAGUNGSTECHNIK-INTERNATIONAL JOURNAL OF ELECTRONICS AND COMMUNICATIONS, 1987, 41 (03): : 176 - 178
  • [33] A real-valued periodic orthogonal sequence derived from a real-valued shift-orthogonal finite-length sequence and its fast periodic correlation algorithm
    Matsumoto, Takahiro
    Tanada, Yoshihiro
    ELECTRONICS AND COMMUNICATIONS IN JAPAN PART III-FUNDAMENTAL ELECTRONIC SCIENCE, 2007, 90 (10): : 18 - 28
  • [34] Fast learning complex-valued classifiers for real-valued classification problems
    Savitha, R.
    Suresh, S.
    Sundararajan, N.
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2013, 4 (05) : 469 - 476
  • [35] Real-coded ECGA for solving decomposable real-valued optimization problems
    Li, Minqiang
    Goldberg, David E.
    Sastry, Kumara
    Yu, Tian-Li
    2007 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION, VOLS 1-10, PROCEEDINGS, 2007, : 2194 - +
  • [36] Optimization and Application of Real-valued Negative Selection Algorithm
    Xu Aiqiang
    Liu Yong
    Zhao Xiuli
    Yang Chunying
    Li Tingjun
    PEEA 2011, 2011, 23
  • [37] Fast learning complex-valued classifiers for real-valued classification problems
    R. Savitha
    S. Suresh
    N. Sundararajan
    International Journal of Machine Learning and Cybernetics, 2013, 4 : 469 - 476
  • [38] Fast array direction finding algorithm based on real-valued closed-form rooting
    Meng X.
    Li X.
    Yan F.
    Xue J.
    Harbin Gongye Daxue Xuebao/Journal of Harbin Institute of Technology, 2019, 51 (11): : 40 - 46
  • [39] Three-way decision for incomplete real-valued data
    Wen, Haili
    Xia, Fei
    Tang, Hongxiang
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2020, 39 (05) : 7843 - 7862
  • [40] An efficient pipelined architecture for real-valued Fast Fourier Transform
    Kumar, M. Aravind
    Chari, K. Manjunatha
    INTERNATIONAL JOURNAL OF ELECTRONICS, 2017, 104 (04) : 692 - 708