H∞ control with constrained input for completely unknown nonlinear systems using data-driven reinforcement learning method

被引：40

作者：

Jiang, He ^{[1
]}

Zhang, Huaguang ^{[1
]}

Luo, Yanhong ^{[1
]}

Cui, Xiaohong ^{[1
]}

机构：

[1] Northeastern Univ, Coll Informat Sci & Engn, Box 134, Shenyang 110819, Peoples R China

来源：

NEUROCOMPUTING | 2017年 / 237卷

基金：

中国国家自然科学基金;

关键词：

Reinforcement learning; Adaptive dynamic programming; Data-driven; Neural networks; OPTIMAL TRACKING CONTROL; DYNAMIC-PROGRAMMING ALGORITHM; DIFFERENTIAL GRAPHICAL GAMES; POLICY UPDATE ALGORITHM; ZERO-SUM GAME; FEEDBACK-CONTROL; CONTROL DESIGN; TIME-SYSTEMS; ITERATION; SYNCHRONIZATION;

D O I：

10.1016/j.neucom.2016.11.041

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This paper investigates the H-infinity control problem for nonlinear systems with completely unknown dynamics and constrained control input by utilizing a novel data-driven reinforcement learning method. It is known that nonlinear H-infinity control problem relies on the solution of Hamilton-Jacobi-Isaacs (HJI) equation, which is essentially a nonlinear partial differential equation and generally impossible to be solved analytically. In order to overcome this difficulty, firstly, we propose a model-based simultaneoui policy update algorithm to learn the solution of HJI equation iteratively and provide its convergence proof. Then, based on this model-based method, we develop a data-driven model-free algorithm, which only requires the real system sampling data generated by arbitrary different control inputs and external disturbances instead of accurate system models, and prove that these two algorithms are equivalent. To implement this model-free algorithm, three neural networks (NNs) are employed to approximate the iterative performance index function, control policy and disturbance policy, respectively, and the least-square approach is used to minimize the NN approximation residual errors. Finally, the proposed scheme is tested on the rotational/translational actuator nonlinear system.

引用

页码：226 / 234

页数：9

共 50 条

[31] Security data-driven iterative learning control for unknown nonlinear systems with hybrid attacks and fading measurements
Yin, Yanling
Yu, Wei
Bu, Xuhui
Yu, Qiongxia
ISA TRANSACTIONS, 2022, 129 : 1 - 12
[32] A Trust-Region Method for Data-Driven Iterative Learning Control of Nonlinear Systems
Wang, Jia
Hemelhof, Leander
Markovsky, Ivan
Patrinos, Panagiotis
IEEE CONTROL SYSTEMS LETTERS, 2024, 8 : 1847 - 1852
[33] Direct Data-Driven Control of Constrained Systems
Piga, Dario
Formentin, Simone
Bemporad, Alberto
IEEE TRANSACTIONS ON CONTROL SYSTEMS TECHNOLOGY, 2018, 26 (04) : 1422 - 1429
[34] Data-Driven Dynamic Input Transfer for Learning Control in Multi-Agent Systems with Heterogeneous Unknown Dynamics
Lehmann, Dustin
Drebinger, Philipp
Seel, Thomas
Raisch, Joerg
2023 62ND IEEE CONFERENCE ON DECISION AND CONTROL, CDC, 2023, : 2358 - 2365
[35] Data-Driven Coordinated Control of AVR and PSS in Power Systems: A Deep Reinforcement Learning Method
Oshnoei, Arman
Sadeghian, Omid
Mohammadi-Ivatloo, Behnam
Blaabjerg, Frede
Anvari-Moghaddam, Amjad
2021 21ST IEEE INTERNATIONAL CONFERENCE ON ENVIRONMENT AND ELECTRICAL ENGINEERING AND 2021 5TH IEEE INDUSTRIAL AND COMMERCIAL POWER SYSTEMS EUROPE (EEEIC/I&CPS EUROPE), 2021,
[36] Constrained Data-Driven Controller Tuning for Nonlinear Systems
Radac, Mircea-Bogdan
Precup, Radu-Emil
Preitl, Stefan
Dragos, Claudia-Adina
Petriu, Emil M.
39TH ANNUAL CONFERENCE OF THE IEEE INDUSTRIAL ELECTRONICS SOCIETY (IECON 2013), 2013, : 3404 - 3409
[37] Data-driven finite-horizon optimal tracking control scheme for completely unknown discrete-time nonlinear systems
Song, Ruizhuo
Xie, Yulong
Zhang, Zenglian
NEUROCOMPUTING, 2019, 356 : 206 - 216
[38] Data-Driven Robust Control of Unknown MIMO Nonlinear System Subject to Input Saturations and Disturbances
Wang, Li
Gong, Huajun
Liu, Chunsheng
MATHEMATICAL PROBLEMS IN ENGINEERING, 2017, 2017
[39] Data-Driven Learning for H∞ Control of Adaptive Cruise Control Systems
Zhao, Jun
Wang, Zhangu
Lv, Yongfeng
Na, Jing
Liu, Congzhi
Zhao, Ziliang
IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2024, 73 (12) : 18348 - 18362
[40] Reinforcement Learning-Based Nearly Optimal Control for Constrained-Input Partially Unknown Systems Using Differentiator
Guo, Xinxin
Yan, Weisheng
Cui, Rongxin
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2020, 31 (11) : 4713 - 4725

← 1 2 3 4 5 →