Approximate Dynamic Programming for Continuous State and Control Problems

被引:6
|
作者
Si, Jennie [1 ]
Yang, Lei [1 ]
Lu, Chao [2 ]
Sun, Jian [2 ]
Mei, Shengwei [2 ]
机构
[1] Arizona State Univ, Dept Elect Engn, Tempe, AZ 85287 USA
[2] Tsinghua Univ, Dept Elect Engn, Beijing 100084, Peoples R China
基金
中国国家自然科学基金;
关键词
Approximate Dynamic Programming (ADP); direct heuristic dynamic programming (direct HDP); nonlinear tracking control; Power system stability control; LEARNING CONTROL; REINFORCEMENT; SYSTEMS;
D O I
10.1109/MED.2009.5164745
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Dynamic programming (DP) is an approach to computing the optimal control policy over time under nonlinearity and uncertainty by employing the principle of optimality introduced by Richard Bellman. Instead of enumerating all possible control sequences, dynamic programming only searches admissible state and/or action values that satisfy the principle of optimality. Therefore, the computation complexity can be much improved over the direct enumeration method. However, the computational efforts and the data storage requirement increase exponentially with the dimensionality of the system, which are reflected in the three curses: the state space, the observation space, and the action space. Thus, the traditional DP approach was limited to solving small size problems. This paper aims at providing an overview of latest development of a class of approximate/adaptive dynamic programming algorithms including those applicable to continuous state and continuous control problems. The paper will especially review direct heuristic dynamic programming (direct (HDP), its design and applications, which include large and complex continuous state and control problems. In addition to the basic principle of direct HDP, the paper includes two application studies of the direct IMP - one is when it is used in a nonlinear tracking problem, and the other is on a power grid coordination control problem based on China southern network.
引用
收藏
页码:1415 / 1420
页数:6
相关论文
共 50 条
  • [31] APPROXIMATE SOLUTIONS FOR CONTINUOUS-TIME QUADRATIC FRACTIONAL PROGRAMMING PROBLEMS
    Lur, Yung-Yih
    Ho, Wen-Hsien
    Lu, Tien-Hung
    Wen, Ching-Feng
    [J]. TAIWANESE JOURNAL OF MATHEMATICS, 2014, 18 (06): : 1791 - 1826
  • [32] Accelerated Continuous-Time Approximate Dynamic Programming via Data-Assisted Hybrid Control
    Ochoa, Daniel E.
    Poveda, Jorge, I
    [J]. IFAC PAPERSONLINE, 2022, 55 (12): : 561 - 566
  • [33] Approximate Dynamic Programming For Linear Systems with State and Input Constraints
    Chakrabarty, Ankush
    Quirynen, Rien
    Danielson, Claus
    Gao, Weinan
    [J]. 2019 18TH EUROPEAN CONTROL CONFERENCE (ECC), 2019, : 524 - 529
  • [34] Lattice point sets for state sampling in approximate dynamic programming
    Cervellera, Cristiano
    Gaggero, Mauro
    Maccio, Danilo
    [J]. OPTIMAL CONTROL APPLICATIONS & METHODS, 2017, 38 (06): : 1193 - 1207
  • [35] Biologically inspired scheme for continuous-time approximate dynamic programming
    Vrabie, Draguna
    Lewis, Frank
    Abu-Khalaf, Murad
    [J]. TRANSACTIONS OF THE INSTITUTE OF MEASUREMENT AND CONTROL, 2008, 30 (3-4) : 207 - 223
  • [36] APPROXIMATE SOLUTION OF CERTAIN PROBLEMS OF OPTIMAL CONTROL AND DISCRETE PROGRAMMING
    GABASHVI.NV
    LOMINADZ.NN
    CHKHAIDZ.LL
    [J]. ENGINEERING CYBERNETICS, 1972, 10 (06): : 1002 - 1011
  • [37] Approximate Dynamic Programming for Large-scale Unit Commitment Problems
    Long, Danli
    [J]. 10TH ASIA-PACIFIC POWER AND ENERGY ENGINEERING CONFERENCE (APPEEC 2018), 2018, : 353 - 362
  • [38] A METHOD FOR APPROXIMATE SOLUTIONS TO STOCHASTIC DYNAMIC PROGRAMMING PROBLEMS USING EXPECTATIONS
    NORMAN, JM
    WHITE, DJ
    [J]. OPERATIONS RESEARCH, 1968, 16 (02) : 296 - &
  • [39] An approximate dynamic programming approach to the admission control of elective patients
    Zhang, Jian
    Dridi, Mahjoub
    El Moudni, Abdellah
    [J]. COMPUTERS & OPERATIONS RESEARCH, 2021, 132
  • [40] Control of a networked microgrid system with an approximate dynamic programming approach
    Zhuo, Wenhao
    [J]. PROCEEDINGS OF THE 38TH CHINESE CONTROL CONFERENCE (CCC), 2019, : 6571 - 6576