Reinforcement Learning-Based Cooperative Optimal Output Regulation via Distributed Adaptive Internal Model

被引:62
|
作者
Gao, Weinan [1 ]
Mynuddin, Mohammed [2 ]
Wunsch, Donald C. [3 ]
Jiang, Zhong-Ping [4 ]
机构
[1] Florida Inst Technol, Coll Engn & Sci, Dept Mech & Civil Engn, Melbourne, FL 32901 USA
[2] Univ Cent Florida, Dept Civil Environm & Construct Engn Major Transp, Orlando, FL 32816 USA
[3] Missouri Univ Sci & Technol Missouri S&T, Dept Comp Engn, Rolla, MO 65409 USA
[4] NYU, Tandon Sch Engn, Dept Elect & Comp Engn, Brooklyn, NY 11201 USA
基金
美国国家科学基金会;
关键词
Regulation; Adaptation models; Power system dynamics; Multi-agent systems; Vehicle dynamics; Symmetric matrices; Optimal control; Adaptive optimal control; cooperative output regulation; distributed adaptive internal model; reinforcement learning; MULTIAGENT SYSTEMS; LINEAR-SYSTEMS; LEADER; PRINCIPLE; ITERATION; DESIGN;
D O I
10.1109/TNNLS.2021.3069728
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this article, a data-driven distributed control method is proposed to solve the cooperative optimal output regulation problem of leader-follower multiagent systems. Different from traditional studies on cooperative output regulation, a distributed adaptive internal model is originally developed, which includes a distributed internal model and a distributed observer to estimate the leader's dynamics. Without relying on the dynamics of multiagent systems, we have proposed two reinforcement learning algorithms, policy iteration and value iteration, to learn the optimal controller through online input and state data, and estimated values of the leader's state. By combining these methods, we have established a basis for connecting data-distributed control methods with adaptive dynamic programming approaches in general since these are the theoretical foundation from which they are built.
引用
收藏
页码:5229 / 5240
页数:12
相关论文
共 50 条
  • [21] A reinforcement learning-based scheme for adaptive optimal control of linear stochastic systems
    Wong, Wee Chin
    Lee, Jay H.
    [J]. 2008 AMERICAN CONTROL CONFERENCE, VOLS 1-12, 2008, : 57 - 62
  • [22] Reinforcement Learning-Based Adaptive Optimal Control for Nonlinear Systems With Asymmetric Hysteresis
    Zheng, Licheng
    Liu, Zhi
    Wang, Yaonan
    Chen, C. L. Philip
    Zhang, Yun
    Wu, Zongze
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 35 (11) : 1 - 10
  • [23] Deep learning-based output tracking via regulation and contraction theory
    Zoboli, Samuele
    Janny, Steeven
    Giaccagli, Mattia
    [J]. IFAC PAPERSONLINE, 2023, 56 (02): : 8111 - 8116
  • [24] Cooperative Model-Based Reinforcement Learning for Approximate Optimal Tracking
    Greene, Max L.
    Bell, Zachary, I
    Nivison, Scott A.
    How, Jonathan P.
    Dixon, Warren E.
    [J]. 2021 AMERICAN CONTROL CONFERENCE (ACC), 2021, : 1973 - 1978
  • [25] Distributed Observer-Based Optimal Cooperative Output Regulation With Control Effectiveness Faults
    Chen, Zitao
    Zhong, Weifeng
    Xie, Shengli
    Zhang, Yun
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, 2024, 71 (03) : 1361 - 1365
  • [26] Model-based reinforcement learning for approximate optimal regulation
    Kamalapurkar, Rushikesh
    Walters, Patrick
    Dixon, Warren E.
    [J]. AUTOMATICA, 2016, 64 : 94 - 104
  • [27] Adaptive Optimal Output Regulation of Discrete-Time Linear Systems: A Reinforcement Learning Approach
    Chakraborty, Sayan
    Gao, Weinan
    Vamvoudakis, Kyriakos G.
    Jiang, Zhong-Ping
    [J]. 2023 62ND IEEE CONFERENCE ON DECISION AND CONTROL, CDC, 2023, : 7950 - 7955
  • [28] Data-driven cooperative optimal output regulation for linear discrete-time multi-agent systems by online distributed adaptive internal model approach
    Xie, Kedi
    Jiang, Yi
    Yu, Xiao
    Lan, Weiyao
    [J]. SCIENCE CHINA-INFORMATION SCIENCES, 2023, 66 (07)
  • [29] Data-driven cooperative optimal output regulation for linear discrete-time multi-agent systems by online distributed adaptive internal model approach
    Kedi XIE
    Yi JIANG
    Xiao YU
    Weiyao LAN
    [J]. Science China(Information Sciences), 2023, 66 (07) : 30 - 45
  • [30] Reinforcement Learning-Based Adaptive Operator Selection
    Durgut, Rafet
    Aydin, Mehmet Emin
    [J]. OPTIMIZATION AND LEARNING, OLA 2021, 2021, 1443 : 29 - 41