MARNet: Backdoor Attacks Against Cooperative Multi-Agent Reinforcement Learning

被引:7
|
作者
Chen, Yanjiao [1 ]
Zheng, Zhicong [1 ]
Gong, Xueluan [2 ]
机构
[1] Zhejiang Univ, Coll Elect Engn, Hangzhou 310027, Peoples R China
[2] Wuhan Univ, Sch Comp Sci, Wuhan 430072, Peoples R China
基金
中国国家自然科学基金;
关键词
Training; Computer crime; Predator prey systems; Task analysis; Games; Convergence; Q-learning; Backdoor attacks; multi-agent reinforcement learning;
D O I
10.1109/TDSC.2022.3207429
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Recent works have revealed that backdoor attacks against Deep Reinforcement Learning ( DRL) could lead to abnormal action selections of the agent, which may result in failure or even catastrophe in crucial decision processes. However, existing attacks only consider single-agent reinforcement learning (RL) systems, in which the only agent can observe the global state and have full control of the decision process. In this article, we explore a new backdoor attack paradigm in cooperative multi-agent reinforcement learning (CMARL) scenarios, where a group of agents coordinate with each other to achieve a common goal, while each agent can only observe the local state. In the proposed MARNet attack framework, we carefully design a pipeline of trigger design, action poisoning, and reward hacking modules to accommodate the cooperative multi-agent settings. In particular, as only a subset of agents can observe the triggers in their local observations, we maneuver their actions to the worst actions suggested by an expert policy model. Since the global reward in CMARL is aggregated by individual rewards from all agents, we propose to modify the reward in a way that boosts the bad actions of poisoned agents (agents who observe the triggers) but mitigates the influence on non-poisoned agents. We conduct extensive experiments on three classical CMARL algorithms VDN, COMA, and QMIX, in two popular CMARL games Predator Prey and SMAC. The results show that the baselines extended from single-agent DRL backdoor attacks seldom work in CMARL problems while MARNet performs well by reducing the utility under attack by nearly 100%. We apply fine-tuning as a potential defense against MARNet and demonstrate that fine- tuning cannot entirely eliminate the effect of the attack.
引用
收藏
页码:4188 / 4198
页数:11
相关论文
共 50 条
  • [1] Backdoor Attacks on Multi-Agent Reinforcement Learning-based Spectrum Management
    Zhang, Hongyi
    Liu, Mingqian
    Chen, Yunfei
    [J]. IEEE CONFERENCE ON GLOBAL COMMUNICATIONS, GLOBECOM, 2023, : 3361 - 3365
  • [2] Security Analysis of Poisoning Attacks Against Multi-agent Reinforcement Learning
    Xie, Zhiqiang
    Xiang, Yingxiao
    Li, Yike
    Zhao, Shuang
    Tong, Endong
    Niu, Wenjia
    Liu, Jiqiang
    Wang, Jian
    [J]. ALGORITHMS AND ARCHITECTURES FOR PARALLEL PROCESSING, ICA3PP 2021, PT I, 2022, 13155 : 660 - 675
  • [3] Multi-Agent Uncertainty Sharing for Cooperative Multi-Agent Reinforcement Learning
    Chen, Hao
    Yang, Guangkai
    Zhang, Junge
    Yin, Qiyue
    Huang, Kaiqi
    [J]. 2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
  • [4] On the Robustness of Cooperative Multi-Agent Reinforcement Learning
    Lin, Jieyu
    Dzeparoska, Kristina
    Zhang, Sai Qian
    Leon-Garcia, Alberto
    Papernot, Nicolas
    [J]. 2020 IEEE SYMPOSIUM ON SECURITY AND PRIVACY WORKSHOPS (SPW 2020), 2020, : 62 - 68
  • [5] Consensus Learning for Cooperative Multi-Agent Reinforcement Learning
    Xu, Zhiwei
    Zhang, Bin
    Li, Dapeng
    Zhang, Zeren
    Zhou, Guangchong
    Chen, Hao
    Fan, Guoliang
    [J]. THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 10, 2023, : 11726 - 11734
  • [6] A Novel Distributed Multi-Agent Reinforcement Learning Algorithm Against Jamming Attacks
    Elleuch, Ibrahim
    Pourranjbar, Ali
    Kaddoum, Georges
    [J]. IEEE COMMUNICATIONS LETTERS, 2021, 25 (10) : 3204 - 3208
  • [7] Learning Cooperative Intrinsic Motivation in Multi-Agent Reinforcement Learning
    Hong, Seung-Jin
    Lee, Sang-Kwang
    [J]. 12TH INTERNATIONAL CONFERENCE ON ICT CONVERGENCE (ICTC 2021): BEYOND THE PANDEMIC ERA WITH ICT CONVERGENCE INNOVATION, 2021, : 1697 - 1699
  • [8] Cooperative Learning of Multi-Agent Systems Via Reinforcement Learning
    Wang, Xin
    Zhao, Chen
    Huang, Tingwen
    Chakrabarti, Prasun
    Kurths, Juergen
    [J]. IEEE TRANSACTIONS ON SIGNAL AND INFORMATION PROCESSING OVER NETWORKS, 2023, 9 : 13 - 23
  • [9] Cooperative Multi-Agent Reinforcement Learning With Approximate Model Learning
    Park, Young Joon
    Lee, Young Jae
    Kim, Seoung Bum
    [J]. IEEE ACCESS, 2020, 8 : 125389 - 125400
  • [10] Cooperative Multi-Agent Reinforcement Learning with Hypergraph Convolution
    Bai, Yunpeng
    Gong, Chen
    Zhang, Bin
    Fan, Guoliang
    Hou, Xinwen
    Lu, Yu
    [J]. 2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,