Communication-robust multi-agent learning by adaptable auxiliary multi-agent adversary generation

被引：0

作者：

Yuan, Lei ^{[1
,2
]}

Chen, Feng ^{[1
]}

Zhang, Zongzhang ^{[1
]}

Yu, Yang ^{[1
,2
]}

机构：

[1] Nanjing Univ, Natl Key Lab Novel Software Technol, Nanjing 210023, Peoples R China

[2] Polixir Technol, Nanjing 211106, Peoples R China

来源：

FRONTIERS OF COMPUTER SCIENCE | 2024年 / 18卷 / 06期

基金：

国家重点研发计划; 中国国家自然科学基金;

关键词：

multi-agent communication; adversarial training; robustness validation; reinforcement learning;

D O I：

10.1007/s11704-023-2733-5

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Communication can promote coordination in cooperative Multi-Agent Reinforcement Learning (MARL). Nowadays, existing works mainly focus on improving the communication efficiency of agents, neglecting that real-world communication is much more challenging as there may exist noise or potential attackers. Thus the robustness of the communication-based policies becomes an emergent and severe issue that needs more exploration. In this paper, we posit that the ego system1) trained with auxiliary adversaries may handle this limitation and propose an adaptable method of Multi-Agent Auxiliary Adversaries Generation for robust Communication, dubbed MA3C, to obtain a robust communication-based policy. In specific, we introduce a novel message-attacking approach that models the learning of the auxiliary attacker as a cooperative problem under a shared goal to minimize the coordination ability of the ego system, with which every information channel may suffer from distinct message attacks. Furthermore, as naive adversarial training may impede the generalization ability of the ego system, we design an attacker population generation approach based on evolutionary learning. Finally, the ego system is paired with an attacker population and then alternatively trained against the continuously evolving attackers to improve its robustness, meaning that both the ego system and the attackers are adaptable. Extensive experiments on multiple benchmarks indicate that our proposed MA3C provides comparable or better robustness and generalization ability than other baselines.

引用

页数：17

共 50 条

[1] Communication-robust multi-agent learning by adaptable auxiliary multi-agent adversary generation
Lei Yuan
Feng Chen
Zongzhang Zhang
Yang Yu
[J]. Frontiers of Computer Science, 2024, 18
[2] Communication-robust multi-agent learning by adaptable auxiliary multi-agent adversary generation
YUAN Lei
CHEN Feng
ZHANG Zongzhang
YU Yang
[J]. Frontiers of Computer Science, 2024, 18 (06)
[3] Multi-Agent Reinforcement Learning With Distributed Targeted Multi-Agent Communication
Xu, Chi
Zhang, Hui
Zhang, Ya
[J]. 2023 35TH CHINESE CONTROL AND DECISION CONFERENCE, CCDC, 2023, : 2915 - 2920
[4] Learning communication for multi-agent systems
Giles, CL
Jim, KC
[J]. INNOVATIVE CONCPTS FOR AGENT-BASED SYSTEMS, 2002, 2564 : 377 - 390
[5] Learning and communication in multi-agent systems
Friedrich, H
Kaiser, M
Rogalla, O
Dillmann, R
[J]. DISTRIBUTED ARTIFICIAL INTELLIGENCE MEETS MACHINE LEARNING: LEARNING IN MULTI-AGENT ENVIRONMENTS, 1997, 1221 : 259 - 275
[6] Research of communication mechanism of the multi-agent in multi-agent robot systems
Gao, Zhijun
Yan, Guozheng
Ding, Guoqing
Huang, Heng
[J]. High Technology Letters, 2002, 8 (01) : 67 - 71
[7] Research of Communication Mechanism of the Multi-agent in Multi-agent Robot Systems
高志军
[J]. High Technology Letters, 2002, (01) : 67 - 71
[8] Multi-agent learning
Eduardo Alonso
[J]. Autonomous Agents and Multi-Agent Systems, 2007, 15 : 3 - 4
[9] Robust Multi-Agent Coordination via Evolutionary Generation of Auxiliary Adversarial Attackers
Yuan, Lei
Zhang, Ziqian
Xue, Ke
Yin, Hao
Chen, Feng
Guan, Cong
Li, Lihe
Qian, Chao
Yu, Yang
[J]. THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 10, 2023, : 11753 - 11762
[10] Multi-agent learning
Alonso, Eduardo
[J]. AUTONOMOUS AGENTS AND MULTI-AGENT SYSTEMS, 2007, 15 (01) : 3 - 4

← 1 2 3 4 5 →