Robust Multi-Agent Reinforcement Learning with Model Uncertainty

被引：0

作者：

Zhang, Kaiqing ^{[1
,2
]}

Sun, Tao ^{[3
]}

Tao, Yunzhe ^{[3
]}

Genc, Sahika ^{[3
]}

Mallya, Sunil ^{[3
]}

Basar, Tamer ^{[1
,2
]}

机构：

[1] Univ Illinois, Dept ECE, Chicago, IL 60680 USA

[2] Univ Illinois, CSL, Chicago, IL 60680 USA

[3] Amazon Web Serv, Seattle, WA USA

来源：

ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020 | 2020年 / 33卷

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this work, we study the problem of multi-agent reinforcement learning (MARL) with model uncertainty, which is referred to as robust MARL. This is naturally motivated by some multi-agent applications where each agent may not have perfectly accurate knowledge of the model, e.g., all the reward functions of other agents. Little a priori work on MARL has accounted for such uncertainties, neither in problem formulation nor in algorithm design. In contrast, we model the problem as a robust Markov game, where the goal of all agents is to find policies such that no agent has the incentive to deviate, i.e., reach some equilibrium point, which is also robust to the possible uncertainty of the MARL model. We first introduce the solution concept of robust Nash equilibrium in our setting, and develop a Q-learning algorithm to find such equilibrium policies, with convergence guarantees under certain conditions. In order to handle possibly enormous state-action spaces in practice, we then derive the policy gradients for robust MARL, and develop an actor-critic algorithm with function approximation. Our experiments demonstrate that the proposed algorithm outperforms several baseline MARL methods that do not account for the model uncertainty, in several standard but uncertain cooperative and competitive MARL environments.

引用

页数：13

共 50 条

[1] Scalable Robust Multi-Agent Reinforcement Learning for Model Uncertainty
Jwa, Younkyung
Gwak, Minseon
Kwak, Jiin
Ahn, Chang Wook
Park, PooGyeon
2023 62ND IEEE CONFERENCE ON DECISION AND CONTROL, CDC, 2023, : 3402 - 3407
[2] Multi-Agent Uncertainty Sharing for Cooperative Multi-Agent Reinforcement Learning
Chen, Hao
Yang, Guangkai
Zhang, Junge
Yin, Qiyue
Huang, Kaiqi
2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
[3] Uncertainty modified policy for multi-agent reinforcement learning
Zhao, Xinyu
Liu, Jianxiang
Wu, Faguo
Zhang, Xiao
Wang, Guojian
APPLIED INTELLIGENCE, 2024, 54 (22) : 12020 - 12034
[4] Robust multi-agent reinforcement learning for noisy environments
Chen, Xinning
Liu, Xuan
Luo, Canhui
Yin, Jiangjin
PEER-TO-PEER NETWORKING AND APPLICATIONS, 2022, 15 (02) : 1045 - 1056
[5] Robust multi-agent reinforcement learning for noisy environments
Xinning Chen
Xuan Liu
Canhui Luo
Jiangjin Yin
Peer-to-Peer Networking and Applications, 2022, 15 : 1045 - 1056
[6] Robust Communicative Multi-Agent Reinforcement Learning with Active Defense
Yu, Lebin
Qiu, Yunbo
Yao, Quanming
Shen, Yuan
Zhang, Xudong
Wang, Jian
THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 16, 2024, : 17575 - 17582
[7] Robust experience replay sampling for multi-agent reinforcement learning
Nicholaus, Isack Thomas
Kang, Dae-Ki
PATTERN RECOGNITION LETTERS, 2022, 155 : 135 - 142
[8] DATA-DRIVEN ROBUST MULTI-AGENT REINFORCEMENT LEARNING
Wang, Yudan
Wang, Yue
Zhou, Yi
Velasquez, Alvaro
Zou, Shaofeng
2022 IEEE 32ND INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING (MLSP), 2022,
[9] Robust Multi-agent Patrolling Strategies Using Reinforcement Learning
Lauri, Fabrice
Koukam, Abderrafiaa
SWARM INTELLIGENCE BASED OPTIMIZATION (ICSIBO 2014), 2014, 8472 : 157 - 165
[10] Multi-Agent Reinforcement Learning
Stankovic, Milos
2016 13TH SYMPOSIUM ON NEURAL NETWORKS AND APPLICATIONS (NEUREL), 2016, : 43 - 43

← 1 2 3 4 5 →