A survey and critique of multiagent deep reinforcement learning

被引：6

作者：

Pablo Hernandez-Leal

Bilal Kartal

Matthew E. Taylor

机构：

[1] Borealis AI,

来源：

Autonomous Agents and Multi-Agent Systems | 2019年 / 33卷

关键词：

Multiagent learning; Multiagent systems; Multiagent reinforcement learning; Deep reinforcement learning; Survey;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

Deep reinforcement learning (RL) has achieved outstanding results in recent years. This has led to a dramatic increase in the number of applications and methods. Recent works have explored learning beyond single-agent scenarios and have considered multiagent learning (MAL) scenarios. Initial results report successes in complex multiagent domains, although there are several challenges to be addressed. The primary goal of this article is to provide a clear overview of current multiagent deep reinforcement learning (MDRL) literature. Additionally, we complement the overview with a broader analysis: (i) we revisit previous key components, originally presented in MAL and RL, and highlight how they have been adapted to multiagent deep reinforcement learning settings. (ii) We provide general guidelines to new practitioners in the area: describing lessons learned from MDRL works, pointing to recent benchmarks, and outlining open avenues of research. (iii) We take a more critical tone raising practical challenges of MDRL (e.g., implementation and computational demands). We expect this article will help unify and motivate future research to take advantage of the abundant literature that exists (e.g., RL and MAL) in a joint effort to promote fruitful research in the multiagent community.

引用

页码：750 / 797

页数：47

共 50 条

[41] Simultaneously Learning and Advising in Multiagent Reinforcement Learning
da Silva, Felipe Leno
Glatt, Ruben
Reali Costa, Anna Helena
AAMAS'17: PROCEEDINGS OF THE 16TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS, 2017, : 1100 - 1108
[42] A Proactive Eavesdropping Game in MIMO Systems Based on Multiagent Deep Reinforcement Learning
Guo, Delin
Ding, Hui
Tang, Lan
Zhang, Xinggan
Yang, Lvxi
Liang, Ying-Chang
IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2022, 21 (11) : 8889 - 8904
[43] ROMAX: Certifiably Robust Deep Multiagent Reinforcement Learning via Convex Relaxation
Sun, Chuangchuang
Kim, Dong-Ki
How, Jonathan P.
Proceedings - IEEE International Conference on Robotics and Automation, 2022, : 5503 - 5510
[44] Distributed Multiagent Deep Reinforcement Learning for Multiline Dynamic Bus Timetable Optimization
Yan, Haoyang
Cui, Zhiyong
Chen, Xinqiang
Ma, Xiaolei
IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2023, 19 (01) : 469 - 479
[45] Adversarial Attacks on Multiagent Deep Reinforcement Learning Models in Continuous Action Space
Zhou, Ziyuan
Liu, Guanjun
Guo, Weiran
Zhou, MengChu
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2024,
[46] Lateral Transfer Learning for Multiagent Reinforcement Learning
Shi, Haobin
Li, Jingchen
Mao, Jiahui
Hwang, Kao-Shing
IEEE TRANSACTIONS ON CYBERNETICS, 2023, 53 (03) : 1699 - 1711
[47] Learning to Teach in Cooperative Multiagent Reinforcement Learning
Omidshafiei, Shayegan
Kim, Dong-Ki
Liu, Miao
Tesauro, Gerald
Riemer, Matthew
Amato, Christopher
Campbell, Murray
How, Jonathan P.
THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 6128 - 6136
[48] Measurement of Regional Electric Vehicle Adoption Using Multiagent Deep Reinforcement Learning
Choi, Seung Jun
Jiao, Junfeng
APPLIED SCIENCES-BASEL, 2024, 14 (05):
[49] Interterminal Truck Routing Optimization Using Cooperative Multiagent Deep Reinforcement Learning
Adi, Taufik Nur
Bae, Hyerim
Iskandar, Yelita Anggiane
PROCESSES, 2021, 9 (10)
[50] ROMAX: Certifiably Robust Deep Multiagent Reinforcement Learning via Convex Relaxation
Sun, Chuangchuang
Kim, Dong-Ki
How, Jonathan P.
2022 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2022), 2022, : 5503 - 5510

← 1 2 3 4 5 →