Approximate Dynamic Programming Using Support Vector Regression

被引:5
|
作者
Bethke, Brett [1 ]
How, Jonathan P. [1 ]
Ozdaglar, Asuman [2 ]
机构
[1] MIT, Dept Aeronaut & Astronaut, Cambridge, MA 02139 USA
[2] MIT, Dept Elect Engn, Cambridge, MA 02139 USA
关键词
D O I
10.1109/CDC.2008.4739322
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper presents a new approximate policy iteration algorithm based on support vector regression (SVR). It provides an overview of commonly used cost approximation architectures in approximate dynamic programming problems, explains some difficulties encountered by these architectures, and argues that SVR-based architectures can avoid some of these difficulties. A key contribution of this paper is to present an extension of the SVR problem to carry out approximate policy iteration by forcing the Bellman error to zero at selected states. The algorithm does not require trajectory simulations to be performed and is able to utilize a rich set of basis functions in a computationally efficient way. Computational results for an example problem are shown.
引用
收藏
页码:3811 / 3816
页数:6
相关论文
共 50 条
  • [1] Sequential approximate optimization using support vector regression
    Lee, Yongbin
    Oh, Sangyup
    Park, Changhyun
    Choi, Dong-Hoon
    [J]. CJK-OSM 4: THE FOURTH CHINA-JAPAN-KOREA JOINT SYMPOSIUM ON OPTIMIZATION OF STRUCTURAL AND MECHANICAL SYSTEMS, 2006, : 171 - 176
  • [2] Dynamic Engine Modeling Through Linear Programming Support Vector Regression
    Lu, Zhao
    Sun, Jing
    Lee, Dongkyoung
    Butts, Ken
    [J]. 2009 AMERICAN CONTROL CONFERENCE, VOLS 1-9, 2009, : 2070 - +
  • [3] Solving a quadratic programming problem using support vector regression algorithm
    Cariola, Fernandez
    Beate, Cecilia
    Meza, Rosaura
    [J]. BULLETIN OF COMPUTATIONAL APPLIED MATHEMATICS, 2019, 7 (01): : 51 - 53
  • [4] Linear Programming Twin Support Vector Regression
    Tanveer, M.
    [J]. FILOMAT, 2017, 31 (07) : 2123 - 2142
  • [5] An Support Vector Regression Based Linear Programming
    Yu Jun
    Mao Beixing
    Meng Jintao
    [J]. 2011 AASRI CONFERENCE ON ARTIFICIAL INTELLIGENCE AND INDUSTRY APPLICATION (AASRI-AIIA 2011), VOL 2, 2011, : 133 - 135
  • [6] Dynamic load identification using support vector regression machine
    Yang, Jieming
    Li, Min
    Zhou, Chengzhao
    [J]. Zhendong Ceshi Yu Zhenduan/Journal of Vibration, Measurement and Diagnosis, 2006, 26 (SUPPL.): : 258 - 261
  • [7] Approximate Dynamic Programming Using Bellman Residual Elimination and Gaussian Process Regression
    Bethke, Brett
    How, Jonathan P.
    [J]. 2009 AMERICAN CONTROL CONFERENCE, VOLS 1-9, 2009, : 745 - +
  • [8] Choosing the Cost Vector of the Linear Programming Approach to Approximate Dynamic Programming
    de Farias, Daniela Pucci
    Weber, Theophane
    [J]. 47TH IEEE CONFERENCE ON DECISION AND CONTROL, 2008 (CDC 2008), 2008, : 67 - 72
  • [9] Multikernel semiparametric linear programming support vector regression
    Zhao, Yong-Ping
    Sun, Jian-Guo
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2011, 38 (03) : 1611 - 1618
  • [10] One norm linear programming support vector regression
    Tanveer, Mohammad
    Mangal, Mohit
    Ahmad, Izhar
    Shao, Yuan-Hai
    [J]. NEUROCOMPUTING, 2016, 173 : 1508 - 1518