H∞ control with constrained input for completely unknown nonlinear systems using data-driven reinforcement learning method

被引:40
|
作者
Jiang, He [1 ]
Zhang, Huaguang [1 ]
Luo, Yanhong [1 ]
Cui, Xiaohong [1 ]
机构
[1] Northeastern Univ, Coll Informat Sci & Engn, Box 134, Shenyang 110819, Peoples R China
基金
中国国家自然科学基金;
关键词
Reinforcement learning; Adaptive dynamic programming; Data-driven; Neural networks; OPTIMAL TRACKING CONTROL; DYNAMIC-PROGRAMMING ALGORITHM; DIFFERENTIAL GRAPHICAL GAMES; POLICY UPDATE ALGORITHM; ZERO-SUM GAME; FEEDBACK-CONTROL; CONTROL DESIGN; TIME-SYSTEMS; ITERATION; SYNCHRONIZATION;
D O I
10.1016/j.neucom.2016.11.041
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper investigates the H-infinity control problem for nonlinear systems with completely unknown dynamics and constrained control input by utilizing a novel data-driven reinforcement learning method. It is known that nonlinear H-infinity control problem relies on the solution of Hamilton-Jacobi-Isaacs (HJI) equation, which is essentially a nonlinear partial differential equation and generally impossible to be solved analytically. In order to overcome this difficulty, firstly, we propose a model-based simultaneoui policy update algorithm to learn the solution of HJI equation iteratively and provide its convergence proof. Then, based on this model-based method, we develop a data-driven model-free algorithm, which only requires the real system sampling data generated by arbitrary different control inputs and external disturbances instead of accurate system models, and prove that these two algorithms are equivalent. To implement this model-free algorithm, three neural networks (NNs) are employed to approximate the iterative performance index function, control policy and disturbance policy, respectively, and the least-square approach is used to minimize the NN approximation residual errors. Finally, the proposed scheme is tested on the rotational/translational actuator nonlinear system.
引用
收藏
页码:226 / 234
页数:9
相关论文
共 50 条
  • [31] Security data-driven iterative learning control for unknown nonlinear systems with hybrid attacks and fading measurements
    Yin, Yanling
    Yu, Wei
    Bu, Xuhui
    Yu, Qiongxia
    ISA TRANSACTIONS, 2022, 129 : 1 - 12
  • [32] A Trust-Region Method for Data-Driven Iterative Learning Control of Nonlinear Systems
    Wang, Jia
    Hemelhof, Leander
    Markovsky, Ivan
    Patrinos, Panagiotis
    IEEE CONTROL SYSTEMS LETTERS, 2024, 8 : 1847 - 1852
  • [33] Direct Data-Driven Control of Constrained Systems
    Piga, Dario
    Formentin, Simone
    Bemporad, Alberto
    IEEE TRANSACTIONS ON CONTROL SYSTEMS TECHNOLOGY, 2018, 26 (04) : 1422 - 1429
  • [34] Data-Driven Dynamic Input Transfer for Learning Control in Multi-Agent Systems with Heterogeneous Unknown Dynamics
    Lehmann, Dustin
    Drebinger, Philipp
    Seel, Thomas
    Raisch, Joerg
    2023 62ND IEEE CONFERENCE ON DECISION AND CONTROL, CDC, 2023, : 2358 - 2365
  • [35] Data-Driven Coordinated Control of AVR and PSS in Power Systems: A Deep Reinforcement Learning Method
    Oshnoei, Arman
    Sadeghian, Omid
    Mohammadi-Ivatloo, Behnam
    Blaabjerg, Frede
    Anvari-Moghaddam, Amjad
    2021 21ST IEEE INTERNATIONAL CONFERENCE ON ENVIRONMENT AND ELECTRICAL ENGINEERING AND 2021 5TH IEEE INDUSTRIAL AND COMMERCIAL POWER SYSTEMS EUROPE (EEEIC/I&CPS EUROPE), 2021,
  • [36] Constrained Data-Driven Controller Tuning for Nonlinear Systems
    Radac, Mircea-Bogdan
    Precup, Radu-Emil
    Preitl, Stefan
    Dragos, Claudia-Adina
    Petriu, Emil M.
    39TH ANNUAL CONFERENCE OF THE IEEE INDUSTRIAL ELECTRONICS SOCIETY (IECON 2013), 2013, : 3404 - 3409
  • [37] Data-driven finite-horizon optimal tracking control scheme for completely unknown discrete-time nonlinear systems
    Song, Ruizhuo
    Xie, Yulong
    Zhang, Zenglian
    NEUROCOMPUTING, 2019, 356 : 206 - 216
  • [38] Data-Driven Robust Control of Unknown MIMO Nonlinear System Subject to Input Saturations and Disturbances
    Wang, Li
    Gong, Huajun
    Liu, Chunsheng
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2017, 2017
  • [39] Data-Driven Learning for H∞ Control of Adaptive Cruise Control Systems
    Zhao, Jun
    Wang, Zhangu
    Lv, Yongfeng
    Na, Jing
    Liu, Congzhi
    Zhao, Ziliang
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2024, 73 (12) : 18348 - 18362
  • [40] Reinforcement Learning-Based Nearly Optimal Control for Constrained-Input Partially Unknown Systems Using Differentiator
    Guo, Xinxin
    Yan, Weisheng
    Cui, Rongxin
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2020, 31 (11) : 4713 - 4725