Attention-Based Fault-Tolerant Approach for Multi-Agent Reinforcement Learning Systems

被引:8
|
作者
Gu, Shanzhi [1 ]
Geng, Mingyang [1 ]
Lan, Long [2 ]
机构
[1] Natl Univ Def Technol, Coll Comp, Changsha 410073, Peoples R China
[2] Natl Univ Def Technol, High Performance Comp Lab, Changsha 410073, Peoples R China
基金
中国国家自然科学基金;
关键词
reinforcement learning; attention mechanism; fault tolerance; multi-agent;
D O I
10.3390/e23091133
中图分类号
O4 [物理学];
学科分类号
0702 ;
摘要
The aim of multi-agent reinforcement learning systems is to provide interacting agents with the ability to collaboratively learn and adapt to the behavior of other agents. Typically, an agent receives its private observations providing a partial view of the true state of the environment. However, in realistic settings, the harsh environment might cause one or more agents to show arbitrarily faulty or malicious behavior, which may suffice to allow the current coordination mechanisms fail. In this paper, we study a practical scenario of multi-agent reinforcement learning systems considering the security issues in the presence of agents with arbitrarily faulty or malicious behavior. The previous state-of-the-art work that coped with extremely noisy environments was designed on the basis that the noise intensity in the environment was known in advance. However, when the noise intensity changes, the existing method has to adjust the configuration of the model to learn in new environments, which limits the practical applications. To overcome these difficulties, we present an Attention-based Fault-Tolerant (FT-Attn) model, which can select not only correct, but also relevant information for each agent at every time step in noisy environments. The multihead attention mechanism enables the agents to learn effective communication policies through experience concurrent with the action policies. Empirical results showed that FT-Attn beats previous state-of-the-art methods in some extremely noisy environments in both cooperative and competitive scenarios, much closer to the upper-bound performance. Furthermore, FT-Attn maintains a more general fault tolerance ability and does not rely on the prior knowledge about the noise intensity of the environment.
引用
收藏
页数:15
相关论文
共 50 条
  • [1] Multi-agent Fault-tolerant Reinforcement Learning with Noisy Environments
    Luo, Canhui
    Liu, Xuan
    Chen, Xinning
    Luo, Juan
    [J]. 2020 IEEE 26TH INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED SYSTEMS (ICPADS), 2020, : 164 - 171
  • [2] Learning-based Design of Fault-tolerant Cooperative Multi-agent Systems
    Dai, Jin
    Lin, Hai
    [J]. 2015 AMERICAN CONTROL CONFERENCE (ACC), 2015, : 1929 - 1934
  • [3] Formal Specification of Fault-Tolerant Multi-agent Systems
    Troubitsyna, Elena
    [J]. ADVANCES IN PRACTICAL APPLICATIONS OF AGENTS, MULTI-AGENT SYSTEMS, AND SOCIAL GOOD: THE PAAMS COLLECTION, PAAMS 2021, 2021, 12946 : 291 - 302
  • [4] Fault-tolerant cooperative tasking for multi-agent systems
    Karimadini, Mohammad
    Lin, Hai
    [J]. INTERNATIONAL JOURNAL OF CONTROL, 2011, 84 (12) : 2092 - 2107
  • [5] Distributed Fault Identification and Fault-Tolerant Control for Multi-agent Systems
    Feng, Zhi
    Hu, Guoqiang
    [J]. 2014 33RD CHINESE CONTROL CONFERENCE (CCC), 2014, : 1476 - 1481
  • [6] Fault-tolerant Control of Multi-Agent Systems based on Adaptive Fault Hiding Framework
    Yadegar, Meysam
    Meskin, Nader
    Afshar, Ahmad
    [J]. 2017 AMERICAN CONTROL CONFERENCE (ACC), 2017, : 4111 - 4116
  • [7] Fault-tolerant Consensus Control Based on Fault Detection for Linear Multi-agent Systems
    Li, Xiayang
    Wang, Jinzhi
    Wang, Qi
    [J]. 2018 37TH CHINESE CONTROL CONFERENCE (CCC), 2018, : 6806 - 6811
  • [8] A reorganization strategy to build fault-tolerant multi-agent systems
    Mellouli, Sehl
    [J]. ADVANCES IN ARTIFICIAL INTELLIGENCE, 2007, 4509 : 61 - 72
  • [9] Fault-tolerant Consensus Control for Hybrid Multi-agent Systems
    Habibzadeh, Hamed
    Ziaei, Amin
    Kharrati, Hamed
    Rahimi, Afshin
    [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON PROGNOSTICS AND HEALTH MANAGEMENT (ICPHM), 2022, : 64 - 69
  • [10] Fault-Tolerant Tracking Control for Heterogeneous Multi-Agent Systems
    Pham, Thiem, V
    Nguyen, Quynh T. T.
    Messai, Nadhir
    Manamanni, Noureddine
    [J]. 2020 59TH IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2020, : 2696 - 2701