Approximate Dynamic Programming for Continuous State and Control Problems

被引：6

作者：

Si, Jennie ^{[1
]}

Yang, Lei ^{[1
]}

Lu, Chao ^{[2
]}

Sun, Jian ^{[2
]}

Mei, Shengwei ^{[2
]}

机构：

[1] Arizona State Univ, Dept Elect Engn, Tempe, AZ 85287 USA

[2] Tsinghua Univ, Dept Elect Engn, Beijing 100084, Peoples R China

来源：

MED: 2009 17TH MEDITERRANEAN CONFERENCE ON CONTROL & AUTOMATION, VOLS 1-3 | 2009年

基金：

中国国家自然科学基金;

关键词：

Approximate Dynamic Programming (ADP); direct heuristic dynamic programming (direct HDP); nonlinear tracking control; Power system stability control; LEARNING CONTROL; REINFORCEMENT; SYSTEMS;

D O I：

10.1109/MED.2009.5164745

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Dynamic programming (DP) is an approach to computing the optimal control policy over time under nonlinearity and uncertainty by employing the principle of optimality introduced by Richard Bellman. Instead of enumerating all possible control sequences, dynamic programming only searches admissible state and/or action values that satisfy the principle of optimality. Therefore, the computation complexity can be much improved over the direct enumeration method. However, the computational efforts and the data storage requirement increase exponentially with the dimensionality of the system, which are reflected in the three curses: the state space, the observation space, and the action space. Thus, the traditional DP approach was limited to solving small size problems. This paper aims at providing an overview of latest development of a class of approximate/adaptive dynamic programming algorithms including those applicable to continuous state and continuous control problems. The paper will especially review direct heuristic dynamic programming (direct (HDP), its design and applications, which include large and complex continuous state and control problems. In addition to the basic principle of direct HDP, the paper includes two application studies of the direct IMP - one is when it is used in a nonlinear tracking problem, and the other is on a power grid coordination control problem based on China southern network.

引用

页码：1415 / 1420

页数：6

共 50 条

[11] Approximate dynamic programming approach for process control
Lee, Jay H.
[J]. INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND SYSTEMS (ICCAS 2010), 2010, : 459 - 464
[12] Approximate Dynamic Programming for Output Feedback Control
Jiang Yu
Jiang Zhong-Ping
[J]. PROCEEDINGS OF THE 29TH CHINESE CONTROL CONFERENCE, 2010, : 5815 - 5820
[13] Approximate dynamic programming for stochastic resource allocation problems
Forootani, Ali
Iervolino, Raffaele
Tipaldi, Massimo
Neilson, Joshua
[J]. IEEE-CAA JOURNAL OF AUTOMATICA SINICA, 2020, 7 (04) : 975 - 990
[14] State Aggregation based Linear Programming approach to Approximate Dynamic Programming
Darbha, S.
Krishnamoorthy, K.
Pachter, M.
Chandler, P.
[J]. 49TH IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2010, : 935 - 941
[15] On Implementation of Dynamic Programming for Optimal Control Problems with Final State Constraints
Sundstroem, O.
Ambuehl, D.
Guzzella, L.
[J]. OIL AND GAS SCIENCE AND TECHNOLOGY-REVUE D IFP ENERGIES NOUVELLES, 2010, 65 (01): : 91 - 102
[16] Approximate Dynamic Programming with Gaussian Processes for Optimal Control of Continuous-Time Nonlinear Systems
Beppu, Hirofumi
Maruta, Ichiro
Fujimoto, Kenji
[J]. IFAC PAPERSONLINE, 2020, 53 (02): : 6715 - 6722
[17] Generalized Dual Dynamic Programming for Infinite Horizon Problems in Continuous State and Action Spaces
Warrington, Joseph
Beuchat, Paul N.
Lygeros, John
[J]. IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2019, 64 (12) : 5012 - 5023
[18] Approximate dynamic programming for high dimensional resource allocation problems
Powell, WB
George, A
Bouzaiene-Ayari, B
Simao, HP
[J]. Proceedings of the International Joint Conference on Neural Networks (IJCNN), Vols 1-5, 2005, : 2989 - 2994
[19] Approximate dynamic programming for network recovery problems with stochastic demand
Ulusan, Aybike
Ergun, Ozlem
[J]. TRANSPORTATION RESEARCH PART E-LOGISTICS AND TRANSPORTATION REVIEW, 2021, 151
[20] An approximate dynamic programming approach to convex quadratic knapsack problems
Hua, ZS
Zhang, B
Liang, L
[J]. COMPUTERS & OPERATIONS RESEARCH, 2006, 33 (03) : 660 - 673

← 1 2 3 4 5 →