Robust Multi-Agent Reinforcement Learning with Model Uncertainty

被引:0
|
作者
Zhang, Kaiqing [1 ,2 ]
Sun, Tao [3 ]
Tao, Yunzhe [3 ]
Genc, Sahika [3 ]
Mallya, Sunil [3 ]
Basar, Tamer [1 ,2 ]
机构
[1] Univ Illinois, Dept ECE, Chicago, IL 60680 USA
[2] Univ Illinois, CSL, Chicago, IL 60680 USA
[3] Amazon Web Serv, Seattle, WA USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this work, we study the problem of multi-agent reinforcement learning (MARL) with model uncertainty, which is referred to as robust MARL. This is naturally motivated by some multi-agent applications where each agent may not have perfectly accurate knowledge of the model, e.g., all the reward functions of other agents. Little a priori work on MARL has accounted for such uncertainties, neither in problem formulation nor in algorithm design. In contrast, we model the problem as a robust Markov game, where the goal of all agents is to find policies such that no agent has the incentive to deviate, i.e., reach some equilibrium point, which is also robust to the possible uncertainty of the MARL model. We first introduce the solution concept of robust Nash equilibrium in our setting, and develop a Q-learning algorithm to find such equilibrium policies, with convergence guarantees under certain conditions. In order to handle possibly enormous state-action spaces in practice, we then derive the policy gradients for robust MARL, and develop an actor-critic algorithm with function approximation. Our experiments demonstrate that the proposed algorithm outperforms several baseline MARL methods that do not account for the model uncertainty, in several standard but uncertain cooperative and competitive MARL environments.
引用
收藏
页数:13
相关论文
共 50 条
  • [1] Scalable Robust Multi-Agent Reinforcement Learning for Model Uncertainty
    Jwa, Younkyung
    Gwak, Minseon
    Kwak, Jiin
    Ahn, Chang Wook
    Park, PooGyeon
    2023 62ND IEEE CONFERENCE ON DECISION AND CONTROL, CDC, 2023, : 3402 - 3407
  • [2] Multi-Agent Uncertainty Sharing for Cooperative Multi-Agent Reinforcement Learning
    Chen, Hao
    Yang, Guangkai
    Zhang, Junge
    Yin, Qiyue
    Huang, Kaiqi
    2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
  • [3] Uncertainty modified policy for multi-agent reinforcement learning
    Zhao, Xinyu
    Liu, Jianxiang
    Wu, Faguo
    Zhang, Xiao
    Wang, Guojian
    APPLIED INTELLIGENCE, 2024, 54 (22) : 12020 - 12034
  • [4] Robust multi-agent reinforcement learning for noisy environments
    Chen, Xinning
    Liu, Xuan
    Luo, Canhui
    Yin, Jiangjin
    PEER-TO-PEER NETWORKING AND APPLICATIONS, 2022, 15 (02) : 1045 - 1056
  • [5] Robust multi-agent reinforcement learning for noisy environments
    Xinning Chen
    Xuan Liu
    Canhui Luo
    Jiangjin Yin
    Peer-to-Peer Networking and Applications, 2022, 15 : 1045 - 1056
  • [6] Robust Communicative Multi-Agent Reinforcement Learning with Active Defense
    Yu, Lebin
    Qiu, Yunbo
    Yao, Quanming
    Shen, Yuan
    Zhang, Xudong
    Wang, Jian
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 16, 2024, : 17575 - 17582
  • [7] Robust experience replay sampling for multi-agent reinforcement learning
    Nicholaus, Isack Thomas
    Kang, Dae-Ki
    PATTERN RECOGNITION LETTERS, 2022, 155 : 135 - 142
  • [8] DATA-DRIVEN ROBUST MULTI-AGENT REINFORCEMENT LEARNING
    Wang, Yudan
    Wang, Yue
    Zhou, Yi
    Velasquez, Alvaro
    Zou, Shaofeng
    2022 IEEE 32ND INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING (MLSP), 2022,
  • [9] Robust Multi-agent Patrolling Strategies Using Reinforcement Learning
    Lauri, Fabrice
    Koukam, Abderrafiaa
    SWARM INTELLIGENCE BASED OPTIMIZATION (ICSIBO 2014), 2014, 8472 : 157 - 165
  • [10] Multi-Agent Reinforcement Learning
    Stankovic, Milos
    2016 13TH SYMPOSIUM ON NEURAL NETWORKS AND APPLICATIONS (NEUREL), 2016, : 43 - 43