On the Time Discretization of the Feynman-Kac Forward-Backward Stochastic Differential Equations for Value Function Approximation

被引:4
|
作者
Hawkins, Kelsey P. [1 ]
Pakniyat, Ali [2 ]
Tsiotras, Panagiotis [3 ]
机构
[1] Toyota Res Inst, Ann Arbor, MI 48109 USA
[2] Univ Alabama, Fac Mech Engn, Tuscaloosa, AL USA
[3] Georgia Inst Technol, Fac Inst Robot & Intelligent Machines, Atlanta, GA 30332 USA
关键词
D O I
10.1109/CDC45484.2021.9683583
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Novel numerical estimators are proposed for the forward-backward stochastic differential equations (FBSDE) appearing in the Feynman-Kac representation of the value function. In contrast to the current numerical approaches based on discretization of the continuous-time FBSDE results, we propose a converse approach, by first obtaining a discrete-time approximation of the on-policy value function, and then developing a discrete-time result which resembles the continuous-time counterpart. This approach yields improved numerical estimators in the function approximation phase, and demonstrates enhanced error analysis for those value function estimators. Numerical results and error analysis are demonstrated on a scalar nonlinear stochastic optimal control problem, and they show improvements in the performance of the proposed estimators in comparison with the state-of-the-art methodologies.
引用
收藏
页码:892 / 897
页数:6
相关论文
共 50 条