Optimal robust formation control for heterogeneous multi-agent systems based on reinforcement learning

被引:31
|
作者
Yan, Bing [1 ]
Shi, Peng [1 ]
Lim, Cheng-Chew [1 ]
Shi, Zhiyuan [2 ]
机构
[1] Univ Adelaide, Sch Elect & Elect Engn, Adelaide, SA 5005, Australia
[2] Northwestern Polytech Univ, Sch Automat, Xian, Peoples R China
基金
澳大利亚研究理事会;
关键词
adaptive observer; heterogeneous multi-agent systems; reinforcement learning; robust formation control; COOPERATIVE OUTPUT REGULATION; ADAPTIVE OPTIMAL-CONTROL; CONTINUOUS-TIME SYSTEMS; TRACKING CONTROL; COLLISION-AVOIDANCE; LINEAR-SYSTEMS; POLICY; CONSENSUS;
D O I
10.1002/rnc.5828
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this article, a reinforcement learning (RL)-based robust control strategy is proposed for uncertain heterogeneous multi-agent systems to achieve optimal collision-free time-varying formations. Without using any global information, a fully distributed adaptive observer is developed to estimate both dynamics and states of the reference and disturbance systems. The observer parameters are found by an observed model-based or a model-free off-policy RL algorithm. Using the internal model principle, a novel optimal robust formation control strategy is developed based on another proposed off-policy RL algorithm. The algorithm addresses the nonquadratic optimization problem when the system model is completely unknown. Taking the bushfire edge tracking and patrolling task for an unmanned aerial vehicle-unmanned ground vehicle heterogeneous system as an example, the effectiveness and robustness of the developed control strategy are verified by simulations.
引用
收藏
页码:2683 / 2704
页数:22
相关论文
共 50 条
  • [21] Formation Control using Simplified Reinforcement Learning for Multi-agent systems with State Delay
    Shao, Wentai
    Chen, Yutao
    Huang, Jie
    [J]. 2021 PROCEEDINGS OF THE 40TH CHINESE CONTROL CONFERENCE (CCC), 2021, : 2269 - 2274
  • [22] Multi-Agent Reinforcement Learning Based Actuator Control for EV HVAC Systems
    Joo, Sungho
    Lee, Dongmin
    Kim, Minseop
    Lee, Taeho
    Choi, Sanghyeok
    Kim, Seungju
    Lee, Jeyeol
    Kim, Joongjae
    Lim, Yongsub
    Lee, Jeonghoon
    [J]. IEEE ACCESS, 2023, 11 : 7574 - 7587
  • [23] The Formation Control of Mobile Autonomous Multi-Agent Systems Using Deep Reinforcement Learning
    Liu, Qishuai
    Hui, Qing
    [J]. 2019 13TH ANNUAL IEEE INTERNATIONAL SYSTEMS CONFERENCE (SYSCON), 2019,
  • [24] Cooperative robust optimal control of uncertain multi-agent systems
    Zhang, Zhuo
    Zhang, Shouxu
    Li, Huiping
    Yan, Weisheng
    [J]. JOURNAL OF THE FRANKLIN INSTITUTE-ENGINEERING AND APPLIED MATHEMATICS, 2020, 357 (14): : 9467 - 9483
  • [25] Output feedback reinforcement learning based optimal output synchronisation of heterogeneous discrete-time multi-agent systems
    Rizvi, Syed Ali Asad
    Lin, Zongli
    [J]. IET CONTROL THEORY AND APPLICATIONS, 2019, 13 (17): : 2866 - 2876
  • [26] Multi-agent Reinforcement Learning for Control Systems: Challenges and Proposals
    Grana, Manuel
    Fernandez-Gauna, Borja
    [J]. INTELLIGENT DATA ENGINEERING AND AUTOMATED LEARNING - IDEAL 2015, 2015, 9375 : 18 - 25
  • [27] Reinforcement Learning Approach for Cooperative Control of Multi-Agent Systems
    Javalera-Rincon, Valeria
    Puig Cayuela, Vicenc
    Morcego Seix, Bernardo
    Orduna-Cabrera, Fernando
    [J]. PROCEEDINGS OF THE 11TH INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE (ICAART), VOL 2, 2019, : 80 - 91
  • [28] Formation Control for Heterogeneous Multi-agent Systems with Communication Delays Based on Impulsive Control
    Jia X.
    Xi J.-X.
    Liu G.-B.
    Du B.-Y.
    [J]. 2018, Chinese Institute of Electronics (46): : 2957 - 2963
  • [29] Multi-Agent Reinforcement Learning Based Optimal PV-ESS Control In Grid
    Park, Jaemin
    Kwon, Taehyeon
    Kim, Bongseok
    Hwang, Yujeong
    Sim, Min Kyu
    [J]. 2023 IEEE IAS GLOBAL CONFERENCE ON RENEWABLE ENERGY AND HYDROGEN TECHNOLOGIES, GLOBCONHT, 2023,
  • [30] Formation control of multi-agent systems with heterogeneous communication delays
    Liu, Cheng-Lin
    Tian, Yu-Ping
    [J]. INTERNATIONAL JOURNAL OF SYSTEMS SCIENCE, 2009, 40 (06) : 627 - 636