An Incremental Sampling-based Algorithm for Stochastic Optimal Control

被引：0

作者：

Vu Anh Huynh ^{[1
]}

Karaman, Sertac ^{[1
]}

Frazzoli, Emilio ^{[1
]}

机构：

[1] MIT, Lab Informat & Decis Syst, Cambridge, MA 02139 USA

来源：

2012 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA) | 2012年

关键词：

JACOBI-BELLMAN EQUATIONS; APPROXIMATION;

D O I：

暂无

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

In this paper, we consider a class of continuous-time, continuous-space stochastic optimal control problems. Building upon recent advances in Markov chain approximation methods and sampling-based algorithms for deterministic path planning, we propose a novel algorithm called the incremental Markov Decision Process (iMDP) to compute incrementally control policies that approximate arbitrarily well an optimal policy in terms of the expected cost. The main idea behind the algorithm is to generate a sequence of finite discretizations of the original problem through random sampling of the state space. At each iteration, the discretized problem is a Markov Decision Process that serves as an incrementally refined model of the original problem. We show that with probability one, (i) the sequence of the optimal value functions for each of the discretized problems converges uniformly to the optimal value function of the original stochastic optimal control problem, and (ii) the original optimal value function can be computed efficiently in an incremental manner using asynchronous value iterations. Thus, the proposed algorithm provides an anytime approach to the computation of optimal control policies of the continuous problem. The effectiveness of the proposed approach is demonstrated on motion planning and control problems in cluttered environments in the presence of process noise.

引用

页码：2865 / 2872

页数：8

共 50 条

[1] An incremental sampling-based algorithm for stochastic optimal control
Huynh, Vu Anh
Karaman, Sertac
Frazzoli, Emilio
[J]. INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH, 2016, 35 (04): : 305 - 333
[2] Information-Theoretic Stochastic Optimal Control via Incremental Sampling-based Algorithms
Arslan, Oktay
Theodorou, Evangelos A.
Tsiotras, Panagiotis
[J]. 2014 IEEE SYMPOSIUM ON ADAPTIVE DYNAMIC PROGRAMMING AND REINFORCEMENT LEARNING (ADPRL), 2014, : 71 - 78
[3] Sampling-Based Nonlinear Stochastic Optimal Control for Neuromechanical Systems
Reed, Emily A.
Pereira, Marcus A.
Valero-Cuevas, Francisco J.
Theodorou, Evangelos A.
[J]. 42ND ANNUAL INTERNATIONAL CONFERENCES OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY: ENABLING INNOVATIVE TECHNOLOGIES FOR GLOBAL HEALTHCARE EMBC'20, 2020, : 4694 - 4699
[4] Sampling-based Stochastic Optimal Control with Metric Interval Temporal Logic Specifications
Montana, Felipe J.
Liu, Jun
Dodd, Tony J.
[J]. 2016 IEEE CONFERENCE ON CONTROL APPLICATIONS (CCA), 2016,
[5] Provably near-optimal sampling-based policies for stochastic inventory control models
Levi, Retsef
Roundy, Robin O.
Shmoys, David B.
[J]. MATHEMATICS OF OPERATIONS RESEARCH, 2007, 32 (04) : 821 - 839
[6] Optimal Kinodynamic Motion Planning using Incremental Sampling-based Methods
Karaman, Sertac
Frazzoli, Emilio
[J]. 49TH IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2010, : 7681 - 7687
[7] Incremental Sampling-based Algorithm for Minimum-violation Motion Planning
Castro, Luis I. Reyes
Chaudhari, Pratik
Tumova, Jana
Karaman, Sertac
Frazzoli, Emilio
Rus, Daniela
[J]. 2013 IEEE 52ND ANNUAL CONFERENCE ON DECISION AND CONTROL (CDC), 2013, : 3217 - 3224
[8] Optimal sampling-based neural networks for uncertainty quantification and stochastic optimization
Gupta, Subham
Paudel, Achyut
Thapa, Mishal
Mulani, Sameer B.
Walters, Robert W.
[J]. AEROSPACE SCIENCE AND TECHNOLOGY, 2023, 133
[9] A Martingale Approach and Time-Consistent Sampling-based Algorithms for Risk Management in Stochastic Optimal Control
Vu Anh Huynh
Kogan, Leonid
Frazzoli, Emilio
[J]. 2014 IEEE 53RD ANNUAL CONFERENCE ON DECISION AND CONTROL (CDC), 2014, : 1858 - 1865
[10] Asymptotically-optimal Path Planning for Manipulation using Incremental Sampling-based Algorithms
Perez, Alejandro
Karaman, Sertac
Shkolnik, Alexander
Frazzoli, Emilio
Teller, Seth
Walter, Matthew R.
[J]. 2011 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, 2011, : 4307 - 4313

← 1 2 3 4 5 →