An Incremental Sampling-based Algorithm for Stochastic Optimal Control

被引:0
|
作者
Vu Anh Huynh [1 ]
Karaman, Sertac [1 ]
Frazzoli, Emilio [1 ]
机构
[1] MIT, Lab Informat & Decis Syst, Cambridge, MA 02139 USA
关键词
JACOBI-BELLMAN EQUATIONS; APPROXIMATION;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we consider a class of continuous-time, continuous-space stochastic optimal control problems. Building upon recent advances in Markov chain approximation methods and sampling-based algorithms for deterministic path planning, we propose a novel algorithm called the incremental Markov Decision Process (iMDP) to compute incrementally control policies that approximate arbitrarily well an optimal policy in terms of the expected cost. The main idea behind the algorithm is to generate a sequence of finite discretizations of the original problem through random sampling of the state space. At each iteration, the discretized problem is a Markov Decision Process that serves as an incrementally refined model of the original problem. We show that with probability one, (i) the sequence of the optimal value functions for each of the discretized problems converges uniformly to the optimal value function of the original stochastic optimal control problem, and (ii) the original optimal value function can be computed efficiently in an incremental manner using asynchronous value iterations. Thus, the proposed algorithm provides an anytime approach to the computation of optimal control policies of the continuous problem. The effectiveness of the proposed approach is demonstrated on motion planning and control problems in cluttered environments in the presence of process noise.
引用
收藏
页码:2865 / 2872
页数:8
相关论文
共 50 条
  • [1] An incremental sampling-based algorithm for stochastic optimal control
    Huynh, Vu Anh
    Karaman, Sertac
    Frazzoli, Emilio
    [J]. INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH, 2016, 35 (04): : 305 - 333
  • [2] Information-Theoretic Stochastic Optimal Control via Incremental Sampling-based Algorithms
    Arslan, Oktay
    Theodorou, Evangelos A.
    Tsiotras, Panagiotis
    [J]. 2014 IEEE SYMPOSIUM ON ADAPTIVE DYNAMIC PROGRAMMING AND REINFORCEMENT LEARNING (ADPRL), 2014, : 71 - 78
  • [3] Sampling-Based Nonlinear Stochastic Optimal Control for Neuromechanical Systems
    Reed, Emily A.
    Pereira, Marcus A.
    Valero-Cuevas, Francisco J.
    Theodorou, Evangelos A.
    [J]. 42ND ANNUAL INTERNATIONAL CONFERENCES OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY: ENABLING INNOVATIVE TECHNOLOGIES FOR GLOBAL HEALTHCARE EMBC'20, 2020, : 4694 - 4699
  • [4] Sampling-based Stochastic Optimal Control with Metric Interval Temporal Logic Specifications
    Montana, Felipe J.
    Liu, Jun
    Dodd, Tony J.
    [J]. 2016 IEEE CONFERENCE ON CONTROL APPLICATIONS (CCA), 2016,
  • [5] Provably near-optimal sampling-based policies for stochastic inventory control models
    Levi, Retsef
    Roundy, Robin O.
    Shmoys, David B.
    [J]. MATHEMATICS OF OPERATIONS RESEARCH, 2007, 32 (04) : 821 - 839
  • [6] Optimal Kinodynamic Motion Planning using Incremental Sampling-based Methods
    Karaman, Sertac
    Frazzoli, Emilio
    [J]. 49TH IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2010, : 7681 - 7687
  • [7] Incremental Sampling-based Algorithm for Minimum-violation Motion Planning
    Castro, Luis I. Reyes
    Chaudhari, Pratik
    Tumova, Jana
    Karaman, Sertac
    Frazzoli, Emilio
    Rus, Daniela
    [J]. 2013 IEEE 52ND ANNUAL CONFERENCE ON DECISION AND CONTROL (CDC), 2013, : 3217 - 3224
  • [8] Optimal sampling-based neural networks for uncertainty quantification and stochastic optimization
    Gupta, Subham
    Paudel, Achyut
    Thapa, Mishal
    Mulani, Sameer B.
    Walters, Robert W.
    [J]. AEROSPACE SCIENCE AND TECHNOLOGY, 2023, 133
  • [9] A Martingale Approach and Time-Consistent Sampling-based Algorithms for Risk Management in Stochastic Optimal Control
    Vu Anh Huynh
    Kogan, Leonid
    Frazzoli, Emilio
    [J]. 2014 IEEE 53RD ANNUAL CONFERENCE ON DECISION AND CONTROL (CDC), 2014, : 1858 - 1865
  • [10] Asymptotically-optimal Path Planning for Manipulation using Incremental Sampling-based Algorithms
    Perez, Alejandro
    Karaman, Sertac
    Shkolnik, Alexander
    Frazzoli, Emilio
    Teller, Seth
    Walter, Matthew R.
    [J]. 2011 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, 2011, : 4307 - 4313