Approximate Dynamic Programming for Continuous State and Control Problems

被引:6
|
作者
Si, Jennie [1 ]
Yang, Lei [1 ]
Lu, Chao [2 ]
Sun, Jian [2 ]
Mei, Shengwei [2 ]
机构
[1] Arizona State Univ, Dept Elect Engn, Tempe, AZ 85287 USA
[2] Tsinghua Univ, Dept Elect Engn, Beijing 100084, Peoples R China
基金
中国国家自然科学基金;
关键词
Approximate Dynamic Programming (ADP); direct heuristic dynamic programming (direct HDP); nonlinear tracking control; Power system stability control; LEARNING CONTROL; REINFORCEMENT; SYSTEMS;
D O I
10.1109/MED.2009.5164745
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Dynamic programming (DP) is an approach to computing the optimal control policy over time under nonlinearity and uncertainty by employing the principle of optimality introduced by Richard Bellman. Instead of enumerating all possible control sequences, dynamic programming only searches admissible state and/or action values that satisfy the principle of optimality. Therefore, the computation complexity can be much improved over the direct enumeration method. However, the computational efforts and the data storage requirement increase exponentially with the dimensionality of the system, which are reflected in the three curses: the state space, the observation space, and the action space. Thus, the traditional DP approach was limited to solving small size problems. This paper aims at providing an overview of latest development of a class of approximate/adaptive dynamic programming algorithms including those applicable to continuous state and continuous control problems. The paper will especially review direct heuristic dynamic programming (direct (HDP), its design and applications, which include large and complex continuous state and control problems. In addition to the basic principle of direct HDP, the paper includes two application studies of the direct IMP - one is when it is used in a nonlinear tracking problem, and the other is on a power grid coordination control problem based on China southern network.
引用
收藏
页码:1415 / 1420
页数:6
相关论文
共 50 条
  • [1] Approximate dynamic programming for stochastic linear control problems on compact state spaces
    Woerner, Stefan
    Laumanns, Marco
    Zenklusen, Rico
    Fertis, Apostolos
    [J]. EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2015, 241 (01) : 85 - 98
  • [2] Approximate Dynamic Programming for Building Control Problems with Occupant Interactions
    Lee, Donghwan
    Lee, Seungjae
    Karava, Panagiota
    Hu, Jianghai
    [J]. 2018 ANNUAL AMERICAN CONTROL CONFERENCE (ACC), 2018, : 3945 - 3950
  • [3] Approximate Dynamic Programming of Continuous Annealing process
    Zhang, Yingwei
    Guo, Chao
    Chen, Xue
    Teng, Yongdong
    [J]. 2009 IEEE INTERNATIONAL CONFERENCE ON AUTOMATION AND LOGISTICS ( ICAL 2009), VOLS 1-3, 2009, : 353 - 358
  • [4] Implementation of Dynamic Programming for Optimal Control Problems With Continuous States
    van Berkel, Koos
    de Jager, Bram
    Hofman, Theo
    Steinbuch, Maarten
    [J]. IEEE TRANSACTIONS ON CONTROL SYSTEMS TECHNOLOGY, 2015, 23 (03) : 1172 - 1179
  • [5] APPROXIMATE METHOD FOR SOLVING DYNAMIC PROGRAMMING PROBLEMS
    ALEKSEYE.OG
    [J]. ENGINEERING CYBERNETICS, 1971, 9 (03): : 447 - &
  • [6] Undiscounted control policy generation for continuous-valued optimal control by approximate dynamic programming
    Lock, Jonathan
    McKelvey, Tomas
    [J]. INTERNATIONAL JOURNAL OF CONTROL, 2022, 95 (10) : 2854 - 2864
  • [7] Approximate dynamic programming approach for process control
    Lee, Jay H.
    Wong, Weechin
    [J]. JOURNAL OF PROCESS CONTROL, 2010, 20 (09) : 1038 - 1048
  • [8] Approximate dynamic programming for ship course control
    Bai, Xuerui
    Yi, Jianqiang
    Zhao, Dongbin
    [J]. ADVANCES IN NEURAL NETWORKS - ISNN 2007, PT 1, PROCEEDINGS, 2007, 4491 : 349 - +
  • [9] Approximate Dynamic Programming for Stochastic Resource Allocation Problems
    Ali Forootani
    Raffaele Iervolino
    Massimo Tipaldi
    Joshua Neilson
    [J]. IEEE/CAA Journal of Automatica Sinica, 2020, 7 (04) : 975 - 990
  • [10] Approximate Dynamic Programming for Output Feedback Control
    Jiang Yu
    Jiang Zhong-Ping
    [J]. PROCEEDINGS OF THE 29TH CHINESE CONTROL CONFERENCE, 2010, : 5815 - 5820