A Fast Analytical Algorithm for Solving Markov Decision Processes with Real-Valued Resources

被引：0

作者：

Marecki, Janusz ^{[1
]}

Koenig, Sven ^{[1
]}

Tambe, Milind ^{[1
]}

机构：

[1] Univ So Calif, Dept Comp Sci, Los Angeles, CA 90089 USA

来源：

20TH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE | 2007年

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Agents often have to construct plans that obey deadlines or, more generally, resource limits for real-valued resources whose consumption can only be characterized by probability distributions, such as execution time or battery power. These planning problems can be modeled with continuous state Markov decision processes (MDPs) but existing solution methods are either inefficient or provide no guarantee on the quality of the resulting policy. We therefore present CPH, a novel solution method that solves the planning problems by first approximating with any desired accuracy the probability distributions over the resource consumptions with phase-type distributions, which use exponential distributions as building blocks. It then uses value iteration to solve the resulting MDPs by exploiting properties of exponential distributions to calculate the necessary convolutions accurately and efficiently while providing strong guarantees on the quality of the resulting policy. Our experimental feasibility study in a Mars rover domain demonstrates a substantial speedup over Lazy Approximation, which is currently the leading algorithm for solving continuous state MDPs with quality guarantees.

引用

页码：2536 / 2541

页数：6

共 50 条

[1] A FAST FOURIER TRANSFORM ALGORITHM FOR REAL-VALUED SERIES
BERGLAND, GD
COMMUNICATIONS OF THE ACM, 1968, 11 (10) : 703 - +
[2] Recurrent Extensions of Real-Valued Self-Similar Markov Processes
Panti, H.
Pardo, J. C.
Rivero, V. M.
POTENTIAL ANALYSIS, 2020, 53 (03) : 899 - 920
[3] Recurrent Extensions of Real-Valued Self-Similar Markov Processes
H. Pantí
J. C. Pardo
V. M. Rivero
Potential Analysis, 2020, 53 : 899 - 920
[4] The Lamperti representation of real-valued self-similar Markov processes
Chaumont, Loic
Panti, Henry
Rivero, Victor
BERNOULLI, 2013, 19 (5B) : 2494 - 2523
[5] Fast algorithm for root-MUSIC with real-valued eigendecomposition
Liu Congfeng
Liao Guisheng
PROCEEDINGS OF 2006 CIE INTERNATIONAL CONFERENCE ON RADAR, VOLS 1 AND 2, 2006, : 947 - +
[6] Fast computation algorithm for the discrete Fourier transform of a real-valued sequence
Tsuchiya, M
ELECTRONICS AND COMMUNICATIONS IN JAPAN PART III-FUNDAMENTAL ELECTRONIC SCIENCE, 1997, 80 (09): : 11 - 20
[7] FAST FOURIER-TRANSFORM ALGORITHM FOR SYMMETRIC REAL-VALUED SERIES
ZIEGLER, H
IEEE TRANSACTIONS ON AUDIO AND ELECTROACOUSTICS, 1972, AU20 (05): : 353 - &
[8] An adaptive sampling algorithm for solving Markov decision processes
Chang, HS
Fu, MC
Hu, JQ
Marcus, SI
OPERATIONS RESEARCH, 2005, 53 (01) : 126 - 139
[9] TRANSIENT PHENOMENA FOR REAL-VALUED MARKOV-CHAINS
KORSHUNOV, DA
THEORY OF PROBABILITY AND ITS APPLICATIONS, 1993, 38 (01) : 149 - 152
[10] Software manipulations to speed up a real-valued fast Fourier transform algorithm
Luetkenhoener, B., 1600, (29):

← 1 2 3 4 5 →