Towards Comprehensive Testing on the Robustness of Cooperative Multi-agent Reinforcement Learning

被引:16
|
作者
Guo, Jun [1 ]
Chen, Yonghong [2 ]
Hao, Yihang [2 ]
Yin, Zixin [1 ]
Yu, Yin [3 ]
Li, Simin [1 ]
机构
[1] Beihang Univ, State Key Lab Software Dev Environm, Beijing, Peoples R China
[2] Yangzhou Collaborat Innovat Res Inst CO LTD, Yangzhou, Jiangsu, Peoples R China
[3] CETC, Res Inst 38, Beijing, Peoples R China
关键词
D O I
10.1109/CVPRW56347.2022.00022
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
While deep neural networks (DNNs) have strengthened the performance of cooperative multi-agent reinforcement learning (c-MARL), the agent policy can be easily perturbed by adversarial examples. Considering the safety critical applications of c-MARL, such as traffic management, power management and unmanned aerial vehicle control, it is crucial to test the robustness of c-MARL algorithm before it was deployed in reality. Existing adversarial attacks for MARL could be used for testing, but is limited to one robustness aspects (e.g., reward, state, action), while c-MARL model could be attacked from any aspect. To overcome the challenge, we propose MARLSafe, the first robustness testing framework for c-MARL algorithms. First, motivated by Markov Decision Process (MDP), MARLSafe consider the robustness of c-MARL algorithms comprehensively from three aspects, namely state robustness, action robustness and reward robustness. Any c-MARL algorithm must simultaneously satisfy these robustness aspects to be considered secure. Second, due to the scarceness of c-MARL attack, we propose c-MARL attacks as robustness testing algorithms from multiple aspects. Experiments on SMAC environment reveals that many state-of-the-art c-MARL algorithms are of low robustness in all aspect, pointing out the urgent need to test and enhance robustness of c-MARL algorithms.
引用
收藏
页码:114 / 121
页数:8
相关论文
共 50 条
  • [21] Reinforcement learning of coordination in cooperative multi-agent systems
    Kapetanakis, S
    Kudenko, D
    [J]. EIGHTEENTH NATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE (AAAI-02)/FOURTEENTH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE (IAAI-02), PROCEEDINGS, 2002, : 326 - 331
  • [22] Training Cooperative Agents for Multi-Agent Reinforcement Learning
    Bhalla, Sushrut
    Subramanian, Sriram G.
    Crowley, Mark
    [J]. AAMAS '19: PROCEEDINGS OF THE 18TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS, 2019, : 1826 - 1828
  • [23] Towards reinforcement learning for holonic multi-agent systems
    Abdoos, Monireh
    Mozayani, Nasser
    Bazzan, Ana L. C.
    [J]. INTELLIGENT DATA ANALYSIS, 2015, 19 (02) : 211 - 232
  • [24] Learning Distinct Strategies for Heterogeneous Cooperative Multi-agent Reinforcement Learning
    Wan, Kejia
    Xu, Xinhai
    Li, Yuan
    [J]. ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2021, PT IV, 2021, 12894 : 544 - 555
  • [25] Pacesetter Learning for Large Scale Cooperative Multi-Agent Reinforcement Learning
    Zhou, Pingqi
    Li, Chao
    Qiu, Mengwei
    Liu, Jun
    Ma, Chennan
    Yan, Ming
    [J]. ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2023, PT VI, 2023, 14259 : 115 - 126
  • [26] Learning Fair Policies in Decentralized Cooperative Multi-Agent Reinforcement Learning
    Zimmer, Matthieu
    Glanois, Claire
    Siddique, Umer
    Weng, Paul
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
  • [27] QTRAN: Learning to Factorize with Transformation for Cooperative Multi-Agent Reinforcement learning
    Son, Kyunghwan
    Kim, Daewoo
    Kang, Wan Ju
    Hostallero, David
    Yi, Yung
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 97, 2019, 97
  • [28] Learning Implicit Credit Assignment for Cooperative Multi-Agent Reinforcement Learning
    Zhou, Meng
    Liu, Ziyu
    Sui, Pengwei
    Li, Yixuan
    Chung, Yuk Ying
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
  • [29] Certified Policy Smoothing for Cooperative Multi-Agent Reinforcement Learning
    Mu, Ronghui
    Ruan, Wenjie
    Marcolino, Leandro Soriano
    Jin, Gaojie
    Ni, Qiang
    [J]. THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 12, 2023, : 15046 - 15054
  • [30] Optimistic Value Instructors for Cooperative Multi-Agent Reinforcement Learning
    Li, Chao
    Zhang, Yupeng
    Wang, Jianqi
    Hu, Yujing
    Dong, Shaokang
    Li, Wenbin
    Lv, Tangjie
    Fan, Changjie
    Gao, Yang
    [J]. THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 16, 2024, : 17453 - 17460