Adversarial Attacks on Heterogeneous Multi-Agent Deep Reinforcement Learning System with Time-Delayed Data Transmission

被引:3
|
作者
Fard, Neshat Elhami [1 ]
Selmic, Rastko R. [1 ]
机构
[1] Concordia Univ, Dept Elect & Comp Engn, Montreal, PQ H3G 1M8, Canada
关键词
multi-agent system; deep Q-network (DQN); data transmission; gradient-based attack; defense; CONSENSUS CONTROL;
D O I
10.3390/jsan11030045
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper studies the gradient-based adversarial attacks on cluster-based, heterogeneous, multi-agent, deep reinforcement learning (MADRL) systems with time-delayed data transmission. The structure of the MADRL system consists of various clusters of agents. The deep Q-network (DQN) architecture presents the first cluster's agent structure. The other clusters are considered as the environment of the first cluster's DQN agent. We introduce two novel observations in data transmission, termed on-time and time-delay observations. The proposed observations are considered when the data transmission channel is idle, and the data is transmitted on time or delayed. By considering the distance between the neighboring agents, we present a novel immediate reward function by appending a distance-based reward to the previously utilized reward to improve the MADRL system performance. We consider three types of gradient-based attacks to investigate the robustness of the proposed system data transmission. Two defense methods are proposed to reduce the effects of the discussed malicious attacks. We have rigorously shown the system performance based on the DQN loss and the team reward for the entire team of agents. Moreover, the effects of the various attacks before and after using defense algorithms are demonstrated. The theoretical results are illustrated and verified with simulation examples.
引用
收藏
页数:25
相关论文
共 50 条
  • [1] Time-delayed Data Transmission in Heterogeneous Multi-agent Deep Reinforcement Learning System
    Fard, Elhami
    Selmic, Rastko R.
    [J]. 2022 30TH MEDITERRANEAN CONFERENCE ON CONTROL AND AUTOMATION (MED), 2022, : 636 - 642
  • [2] Data Transmission Resilience to Cyber-attacks on Heterogeneous Multi-agent Deep Reinforcement Learning Systems
    Fard, Neshat Elhami
    Selmic, Rastko R.
    [J]. 2022 17TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION, ROBOTICS AND VISION (ICARCV), 2022, : 758 - 764
  • [3] Towards Secure Multi-Agent Deep Reinforcement Learning: Adversarial Attacks and Countermeasures
    Zheng, Changgang
    Zhen, Chen
    Xie, Haiyong
    Yang, Shufan
    [J]. 2022 5TH IEEE CONFERENCE ON DEPENDABLE AND SECURE COMPUTING (IEEE DSC 2022), 2022,
  • [4] Efficient Adversarial Attacks on Online Multi-agent Reinforcement Learning
    Liu, Guanlin
    Lai, Lifeng
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [5] Adversarial attacks in consensus-based multi-agent reinforcement learning
    Figura, Martin
    Kosaraju, Krishna Chaitanya
    Gupta, Vijay
    [J]. 2021 AMERICAN CONTROL CONFERENCE (ACC), 2021, : 3050 - 3055
  • [6] Multi-Agent Adversarial Inverse Reinforcement Learning
    Yu, Lantao
    Song, Jiaming
    Ermon, Stefano
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 97, 2019, 97
  • [7] Adversarial attacks on cooperative multi-agent deep reinforcement learning: a dynamic group-based adversarial example transferability method
    Lixia Zan
    Xiangbin Zhu
    Zhao-Long Hu
    [J]. Complex & Intelligent Systems, 2023, 9 : 7439 - 7450
  • [8] Multi-Agent Deep Q Network to Enhance the Reinforcement Learning for Delayed Reward System
    Kim, Keecheon
    [J]. APPLIED SCIENCES-BASEL, 2022, 12 (07):
  • [9] Adversarial attacks on cooperative multi-agent deep reinforcement learning: a dynamic group-based adversarial example transferability method
    Zan, Lixia
    Zhu, Xiangbin
    Hu, Zhao-Long
    [J]. COMPLEX & INTELLIGENT SYSTEMS, 2023, 9 (06) : 7439 - 7450
  • [10] Deep Reinforcement Learning for Multi-Agent Power Control in Heterogeneous Networks
    Zhang, Lin
    Liang, Ying-Chang
    [J]. IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2021, 20 (04) : 2551 - 2564