Performance Evaluation of Direct Heuristic Dynamic Programming using Control-Theoretic Measures

被引:11
|
作者
Yang, Lei [1 ]
Si, Jennie [1 ]
Tsakalis, Konstantinos S. [1 ]
Rodriguez, Armando A. [1 ]
机构
[1] Arizona State Univ, Dept Elect Engn, Tempe, AZ 85287 USA
基金
美国国家科学基金会;
关键词
Approximate dynamic programming (ADP); Direc t heuristic dynamic programming (direct HDP); Linear quadratic regulator (LQR); On-line learning control; Sensitivity and complementary sensitivity; ADAPTIVE CRITIC DESIGNS; LEARNING CONTROL; REINFORCEMENT;
D O I
10.1007/s10846-008-9307-5
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Approximate dynamic programming (ADP) has been widely studied from several important perspectives: algorithm development, learning efficiency measured by success or failure statistics, convergence rate, and learning error bounds. Given that many learning benchmarks used in ADP or reinforcement learning studies are control problems, it is important and necessary to examine the learning controllers from a control-theoretic perspective. This paper makes use of direct heuristic dynamic programming (direct HDP) and three typical benchmark examples to introduce a unique analytical framework that can be applied to other learning control paradigms and other complex control problems. The sensitivity analysis and the linear quadratic regulator (LQR) design are used in the paper for two purposes: to quantify direct HDP performances and to provide guidance toward designing better learning controllers. The use of LQR however does not limit the direct HDP to be a learning controller that addresses nonlinear dynamic system control issues. Toward this end, applications of the direct HDP for nonlinear control problems beyond sensitivity analysis and the confines of LQR have been developed and compared whenever appropriate to an LQR design.
引用
收藏
页码:177 / 201
页数:25
相关论文
共 50 条
  • [1] Performance Evaluation of Direct Heuristic Dynamic Programming using Control-Theoretic Measures
    Lei Yang
    Jennie Si
    Konstantinos S. Tsakalis
    Armando A. Rodriguez
    Journal of Intelligent and Robotic Systems, 2009, 55 : 177 - 201
  • [2] Performance analysis of direct heuristic dynamic programming using control-theoretic measures
    Yang, Lei
    Si, Jennie
    Tsakalis, Konstantinos S.
    Rodriguez, Annando A.
    2007 IEEE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-6, 2007, : 2503 - 2508
  • [3] Analyzing and enhancing direct NDP designs using a control-theoretic approach
    Yang, L
    Si, J
    Tsakalis, SS
    Rodriguez, AA
    PROCEEDINGS OF THE 2003 IEEE INTERNATIONAL SYMPOSIUM ON INTELLIGENT CONTROL, 2003, : 529 - 532
  • [4] Control-theoretic dynamic voltage scaling for embedded controllers
    Xia, F.
    Tian, Y.-C.
    Sun, Y.
    Dong, J.
    IET COMPUTERS AND DIGITAL TECHNIQUES, 2008, 2 (05): : 377 - 385
  • [5] Longitudinal Control of Hypersonic Vehicles Based on Direct Heuristic Dynamic Programming Using ANFIS
    Luo, Xiong
    Chen, Yi
    Si, Jennie
    Liu, Feng
    PROCEEDINGS OF THE 2014 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2014, : 3685 - 3692
  • [6] Control-Theoretic Dynamic Thermal Management of Automotive Electronics Control Units
    Park, Sangyoung
    Han, Soohee
    Chang, Naehyuck
    IEEE JOURNAL ON EMERGING AND SELECTED TOPICS IN CIRCUITS AND SYSTEMS, 2011, 1 (02) : 102 - 108
  • [7] Mitigating SIP Overload Using a Control-Theoretic Approach
    Hong, Yang
    Huang, Changcheng
    Yan, James
    2010 IEEE GLOBAL TELECOMMUNICATIONS CONFERENCE GLOBECOM 2010, 2010,
  • [8] Convergence of direct heuristic dynamic programming in power system stability control
    Lu, Chao
    Si, Jennie
    Xie, Xiaorong
    Song, Jie
    2007 IEEE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-6, 2007, : 908 - +
  • [9] Stability of Direct Heuristic Dynamic Programming for Nonlinear Tracking Control Using PID Neural Network
    Luo, Xiong
    Si, Jennie
    2013 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2013,
  • [10] A CONTROL-THEORETIC APPROACH TO RATE ADAPTATION FOR DYNAMIC HTTP STREAMING
    Zhou, Chao
    Zhang, Xinggong
    Huo, Longshe
    Guo, Zongming
    2012 IEEE VISUAL COMMUNICATIONS AND IMAGE PROCESSING (VCIP), 2012,