Reinforcement Learning-Based Cooperative Optimal Output Regulation via Distributed Adaptive Internal Model

被引:62
|
作者
Gao, Weinan [1 ]
Mynuddin, Mohammed [2 ]
Wunsch, Donald C. [3 ]
Jiang, Zhong-Ping [4 ]
机构
[1] Florida Inst Technol, Coll Engn & Sci, Dept Mech & Civil Engn, Melbourne, FL 32901 USA
[2] Univ Cent Florida, Dept Civil Environm & Construct Engn Major Transp, Orlando, FL 32816 USA
[3] Missouri Univ Sci & Technol Missouri S&T, Dept Comp Engn, Rolla, MO 65409 USA
[4] NYU, Tandon Sch Engn, Dept Elect & Comp Engn, Brooklyn, NY 11201 USA
基金
美国国家科学基金会;
关键词
Regulation; Adaptation models; Power system dynamics; Multi-agent systems; Vehicle dynamics; Symmetric matrices; Optimal control; Adaptive optimal control; cooperative output regulation; distributed adaptive internal model; reinforcement learning; MULTIAGENT SYSTEMS; LINEAR-SYSTEMS; LEADER; PRINCIPLE; ITERATION; DESIGN;
D O I
10.1109/TNNLS.2021.3069728
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this article, a data-driven distributed control method is proposed to solve the cooperative optimal output regulation problem of leader-follower multiagent systems. Different from traditional studies on cooperative output regulation, a distributed adaptive internal model is originally developed, which includes a distributed internal model and a distributed observer to estimate the leader's dynamics. Without relying on the dynamics of multiagent systems, we have proposed two reinforcement learning algorithms, policy iteration and value iteration, to learn the optimal controller through online input and state data, and estimated values of the leader's state. By combining these methods, we have established a basis for connecting data-distributed control methods with adaptive dynamic programming approaches in general since these are the theoretical foundation from which they are built.
引用
收藏
页码:5229 / 5240
页数:12
相关论文
共 50 条
  • [1] Distributed adaptive cooperative optimal output regulation via integral reinforcement learning
    Lin, Liquan
    Huang, Jie
    [J]. AUTOMATICA, 2024, 170
  • [2] Learning-based adaptive optimal output regulation of linear and nonlinear systems: an overview
    Weinan Gao
    Zhong-Ping Jiang
    [J]. Control Theory and Technology, 2022, 20 : 1 - 19
  • [3] Learning-based adaptive optimal output regulation of linear and nonlinear systems: an overview
    Gao, Weinan
    Jiang, Zhong-Ping
    [J]. CONTROL THEORY AND TECHNOLOGY, 2022, 20 (01) : 1 - 19
  • [4] Distributed Highway Control: A Cooperative Reinforcement Learning-Based Approach
    Kovari, Balint
    Knab, Istvan Gellert
    Esztergar-Kiss, Domokos
    Aradi, Szilard
    Becsi, Tamas
    [J]. IEEE ACCESS, 2024, 12 : 104463 - 104472
  • [5] Learning-Based Adaptive Optimal Output Regulation of Discrete-Time Linear Systems
    Chakraborty, Sayan
    Gao, Weinan
    Cui, Leilei
    Lewis, Frank L.
    Jiang, Zhong-Ping
    [J]. IFAC PAPERSONLINE, 2023, 56 (02): : 10283 - 10288
  • [6] A reinforcement learning-based strategy updating model for the cooperative evolution
    Wang, Xianjia
    Yang, Zhipeng
    Liu, Yanli
    Chen, Guici
    [J]. PHYSICA A-STATISTICAL MECHANICS AND ITS APPLICATIONS, 2023, 618
  • [7] Reinforcement learning-based robust optimal output regulation for constrained nonlinear systems with static and dynamic uncertainties
    Jin, Peng
    Ma, Qian
    Zhou, Guopeng
    Miao, Guoying
    [J]. INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2023, 33 (03) : 2022 - 2040
  • [8] Cooperative and Adaptive Optimal Output Regulation of Discrete-time Multi-agent Systems Using Reinforcement Learning
    Gao, Weinan
    Liu, Yiyang
    Odekunle, Adedapo
    Jiang, Zhong-Ping
    Yu, Yunjun
    Lu, Pingli
    [J]. PROCEEDINGS OF 2018 IEEE INTERNATIONAL CONFERENCE ON REAL-TIME COMPUTING AND ROBOTICS (IEEE RCAR), 2018, : 348 - 353
  • [9] A Reinforcement Learning-Based Distributed Control Scheme for Cooperative Intersection Traffic Control
    Guzman, Jose A.
    Pizarro, German
    Nunez, Felipe
    [J]. IEEE ACCESS, 2023, 11 : 57037 - 57045
  • [10] Reinforcement Learning-Based Adaptive Optimal Control Model for Signal Light in Intelligent Transportation Systems
    Hu, Xiaomin
    He, Yuanyuan
    [J]. JOURNAL OF CIRCUITS SYSTEMS AND COMPUTERS, 2024,