Resilient adaptive optimal control of distributed multi-agent systems using reinforcement learning

被引:16
|
作者
Moghadam, Rohollah [1 ]
Modares, Hamidreza [2 ]
机构
[1] Missouri Univ Sci, Dept Elect & Comp Engn, Technol, Emerson Hall,301 W 16th St, Rolla, MO 65409 USA
[2] Michigan State Univ, Dept Mech Engn, E Lansing, MI 48824 USA
来源
IET CONTROL THEORY AND APPLICATIONS | 2018年 / 12卷 / 16期
关键词
game theory; Riccati equations; adaptive control; learning (artificial intelligence); multi-agent systems; distributed control; H control; resilient adaptive optimal control; distributed multiagent systems; model-free reinforcement learning; control protocol; leader-follower multiagent systems; optimal control protocols; adversarial input; adverse effects; learning outcome; intact agents; unified RL-based; control frameworks; corrupted sensory data; actual sensory information; leader state; distributed observer; compromised agent; off-policy RL algorithm; control problem; H control problem; ZERO-SUM GAMES; H-INFINITY; CONTINUOUS-TIME; TRACKING CONTROL; CONSENSUS CONTROL; FEEDBACK-CONTROL; DESIGN; AGENTS; SYNCHRONIZATION; ALGORITHM;
D O I
10.1049/iet-cta.2018.0029
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This study presents a unified resilient model-free reinforcement learning (RL) based distributed control protocol for leader-follower multi-agent systems. Although RL has been successfully used to learn optimal control protocols for multi-agent systems, the effects of adversarial inputs are ignored. It is shown in this study, however, that their adverse effects can propagate across the network and impact the learning outcome of other intact agents. To alleviate this problem, a unified RL-based distributed control frameworks is developed for both homogeneous and heterogeneous multi-agent systems to prevent corrupted sensory data from propagating across the network. To this end, only the leader communicates its actual sensory information and other agents estimate the leader' state using a distributed observer and communicate this estimation to their neighbours to achieve consensus on the leader state. The observer cannot be physically affected by any adversarial input. To further improve resiliency, distributed H8 control protocols are designed to attenuate the effect of the adversarial inputs on the compromised agent itself. An off-policy RL algorithm is developed to learn the solutions of the game algebraic Riccati equations arising from solving the H8 control problem. No knowledge of the agent's dynamics is required and it is shown that the proposed RL-based H8 control protocol is resilient against adversarial inputs.
引用
收藏
页码:2165 / 2174
页数:10
相关论文
共 50 条
  • [1] Distributed optimal control of nonlinear multi-agent systems based on integral reinforcement learning
    Xu, Ying
    Li, Kewen
    Li, Yongming
    [J]. OPTIMAL CONTROL APPLICATIONS & METHODS, 2024, : 2596 - 2612
  • [2] Adaptive resilient containment control using reinforcement learning for nonlinear stochastic multi-agent systems under sensor faults
    Mo, Guanzong
    Lyu, Yixin
    [J]. PEERJ COMPUTER SCIENCE, 2024, 10
  • [3] Optimal control in microgrid using multi-agent reinforcement learning
    Li, Fu-Dong
    Wu, Min
    He, Yong
    Chen, Xin
    [J]. ISA TRANSACTIONS, 2012, 51 (06) : 743 - 751
  • [4] Optimal Control for Multi-agent Systems Using Off-Policy Reinforcement Learning
    Wang, Hao
    Chen, Zhiru
    Wang, Jun
    Lu, Lijun
    Li, Mingzhe
    [J]. 2022 4TH INTERNATIONAL CONFERENCE ON CONTROL AND ROBOTICS, ICCR, 2022, : 135 - 140
  • [5] Cooperative Multi-Agent Systems Using Distributed Reinforcement Learning Techniques
    Zemzem, Wiem
    Tagina, Moncef
    [J]. KNOWLEDGE-BASED AND INTELLIGENT INFORMATION & ENGINEERING SYSTEMS (KES-2018), 2018, 126 : 517 - 526
  • [6] Distributed reinforcement learning in multi-agent decision systems
    Giráldez, JI
    Borrajo, D
    [J]. PROGRESS IN ARTIFICIAL INTELLIGENCE-IBERAMIA 98, 1998, 1484 : 148 - 159
  • [7] Distributed output formation tracking control of heterogeneous multi-agent systems using reinforcement learning
    Shi, Yu
    Dong, Xiwang
    Hua, Yongzhao
    Yu, Jianglong
    Ren, Zhang
    [J]. ISA TRANSACTIONS, 2023, 138 : 318 - 328
  • [8] Reinforcement Learning for Multi-Agent Systems with an Application to Distributed Predictive Cruise Control
    Mynuddin, Mohammed
    Gao, Weinan
    Jiang, Zhong-Ping
    [J]. 2020 AMERICAN CONTROL CONFERENCE (ACC), 2020, : 315 - 320
  • [9] Reinforcement learning intermittent optimal formation control for multi-agent systems with disturbances
    Liu, Erliang
    Miao, Guoying
    Hu, Jingyu
    [J]. MEASUREMENT SCIENCE AND TECHNOLOGY, 2024, 35 (12)
  • [10] Distributed Transmission Control for Wireless Networks using Multi-Agent Reinforcement Learning
    Farquhar, Collin
    Kumar, Prem
    Jagannath, Anu
    Jagannath, Jithin
    [J]. BIG DATA IV: LEARNING, ANALYTICS, AND APPLICATIONS, 2022, 12097