Optimal Synchronization Control of Multiagent Systems With Input Saturation via Off-Policy Reinforcement Learning

被引:102
|
作者
Qin, Jiahu [1 ]
Li, Man [1 ]
Shi, Yang [2 ]
Ma, Qichao [1 ]
Zheng, Wei Xing [3 ]
机构
[1] Univ Sci & Technol China, Dept Automat, Hefei 230027, Anhui, Peoples R China
[2] Univ Victoria, Dept Mech Engn, Victoria, BC V8W 2Y2, Canada
[3] Western Sydney Univ, Sch Comp Engn & Math, Sydney, NSW 2751, Australia
基金
澳大利亚研究理事会; 中国国家自然科学基金;
关键词
Input saturation; multiagent systems; neural networks (NNs); off-policy reinforcement learning (RL); optimal synchronization control; LINEAR-SYSTEMS; NONLINEAR-SYSTEMS; NETWORKS; GAMES;
D O I
10.1109/TNNLS.2018.2832025
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we aim to investigate the optimal synchronization problem for a group of generic linear systems with input saturation. To seek the optimal controller, Hamilton Jacobi-Bellman (HJB) equations involving nonquadratic input energy terms in coupled forms are established. The solutions to these coupled HJB equations are further proven to be optimal and the induced controllers constitute interactive Nash equilibrium. Due to the difficulty to analytically solve HJB equations, especially in coupled forms, and the possible lack of model information of the systems, we apply the data-based off-policy reinforcement learning algorithm to learn the optimal control policies. A byproduct of this off-policy algorithm is shown that it is insensitive to probing noise that is exerted to the system to maintain persistence of excitation condition. In order to implement this off-policy algorithm, we employ actor and critic neural networks to approximate the controllers and the cost functions. Furthermore, the estimated control policies obtained by this presented implementation are proven to converge to the optimal ones under certain conditions. Finally, an illustrative example is provided to verify the effectiveness of the proposed algorithm.
引用
收藏
页码:85 / 96
页数:12
相关论文
共 50 条
  • [1] Off-Policy Reinforcement Learning for Synchronization in Multiagent Graphical Games
    Li, Jinna
    Modares, Hamidreza
    Chai, Tianyou
    Lewis, Frank L.
    Xie, Lihua
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2017, 28 (10) : 2434 - 2445
  • [2] Adaptive Output Synchronization With Designated Convergence Rate of Multiagent Systems Based on Off-Policy Reinforcement Learning
    Huang, Chengjie
    Chen, Ci
    Xie, Kan
    Li, Zhenni
    Xie, Shengli
    IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2024, 54 (08): : 4667 - 4678
  • [3] Optimal Control of Iron-Removal Systems Based on Off-Policy Reinforcement Learning
    Chen, Ning
    Luo, Shuhan
    Dai, Jiayang
    Luo, Biao
    Gui, Weihua
    IEEE ACCESS, 2020, 8 (08): : 149730 - 149740
  • [4] Off-policy reinforcement learning algorithm for robust optimal control of uncertain nonlinear systems
    Amirparast, Ali
    Kamal Hosseini Sani, S.
    INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2024, 34 (08) : 5419 - 5437
  • [5] Optimal Control for Multi-agent Systems Using Off-Policy Reinforcement Learning
    Wang, Hao
    Chen, Zhiru
    Wang, Jun
    Lu, Lijun
    Li, Mingzhe
    2022 4TH INTERNATIONAL CONFERENCE ON CONTROL AND ROBOTICS, ICCR, 2022, : 135 - 140
  • [6] Optimal model-free output synchronization of heterogeneous systems using off-policy reinforcement learning
    Modares, Hamidreza
    Nageshrao, Subramanya P.
    Lopes, Gabriel A. Delgado
    Babuska, Robert
    Lewis, Frank L.
    AUTOMATICA, 2016, 71 : 334 - 341
  • [7] Optimal Robust Control of Nonlinear Uncertain System via Off-Policy Integral Reinforcement Learning
    Wang, Xiaoyang
    Ye, Xiufen
    PROCEEDINGS OF THE 39TH CHINESE CONTROL CONFERENCE, 2020, : 1928 - 1933
  • [8] Online Off-Policy Reinforcement Learning for Optimal Control of Unknown Nonlinear Systems Using Neural Networks
    Zhu, Liao
    Wei, Qinglai
    Guo, Ping
    IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2024, 54 (08): : 5112 - 5122
  • [9] H∞ Optimal Distributed Tracking Control of Network Distributed Systems over Directed Networks via Off-Policy Reinforcement Learning
    Kucuksayacigil, Gulnihal
    2023 EUROPEAN CONTROL CONFERENCE, ECC, 2023,
  • [10] Optimal Robust Output Containment of Unknown Heterogeneous Multiagent System Using Off-Policy Reinforcement Learning
    Zuo, Shan
    Song, Yongduan
    Lewis, Frank L.
    Davoudi, Ali
    IEEE TRANSACTIONS ON CYBERNETICS, 2018, 48 (11) : 3197 - 3207