A survey and critique of multiagent deep reinforcement learning

被引:6
|
作者
Pablo Hernandez-Leal
Bilal Kartal
Matthew E. Taylor
机构
[1] Borealis AI,
关键词
Multiagent learning; Multiagent systems; Multiagent reinforcement learning; Deep reinforcement learning; Survey;
D O I
暂无
中图分类号
学科分类号
摘要
Deep reinforcement learning (RL) has achieved outstanding results in recent years. This has led to a dramatic increase in the number of applications and methods. Recent works have explored learning beyond single-agent scenarios and have considered multiagent learning (MAL) scenarios. Initial results report successes in complex multiagent domains, although there are several challenges to be addressed. The primary goal of this article is to provide a clear overview of current multiagent deep reinforcement learning (MDRL) literature. Additionally, we complement the overview with a broader analysis: (i) we revisit previous key components, originally presented in MAL and RL, and highlight how they have been adapted to multiagent deep reinforcement learning settings. (ii) We provide general guidelines to new practitioners in the area: describing lessons learned from MDRL works, pointing to recent benchmarks, and outlining open avenues of research. (iii) We take a more critical tone raising practical challenges of MDRL (e.g., implementation and computational demands). We expect this article will help unify and motivate future research to take advantage of the abundant literature that exists (e.g., RL and MAL) in a joint effort to promote fruitful research in the multiagent community.
引用
收藏
页码:750 / 797
页数:47
相关论文
共 50 条
  • [41] Simultaneously Learning and Advising in Multiagent Reinforcement Learning
    da Silva, Felipe Leno
    Glatt, Ruben
    Reali Costa, Anna Helena
    AAMAS'17: PROCEEDINGS OF THE 16TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS, 2017, : 1100 - 1108
  • [42] A Proactive Eavesdropping Game in MIMO Systems Based on Multiagent Deep Reinforcement Learning
    Guo, Delin
    Ding, Hui
    Tang, Lan
    Zhang, Xinggan
    Yang, Lvxi
    Liang, Ying-Chang
    IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2022, 21 (11) : 8889 - 8904
  • [43] ROMAX: Certifiably Robust Deep Multiagent Reinforcement Learning via Convex Relaxation
    Sun, Chuangchuang
    Kim, Dong-Ki
    How, Jonathan P.
    Proceedings - IEEE International Conference on Robotics and Automation, 2022, : 5503 - 5510
  • [44] Distributed Multiagent Deep Reinforcement Learning for Multiline Dynamic Bus Timetable Optimization
    Yan, Haoyang
    Cui, Zhiyong
    Chen, Xinqiang
    Ma, Xiaolei
    IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2023, 19 (01) : 469 - 479
  • [45] Adversarial Attacks on Multiagent Deep Reinforcement Learning Models in Continuous Action Space
    Zhou, Ziyuan
    Liu, Guanjun
    Guo, Weiran
    Zhou, MengChu
    IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2024,
  • [46] Lateral Transfer Learning for Multiagent Reinforcement Learning
    Shi, Haobin
    Li, Jingchen
    Mao, Jiahui
    Hwang, Kao-Shing
    IEEE TRANSACTIONS ON CYBERNETICS, 2023, 53 (03) : 1699 - 1711
  • [47] Learning to Teach in Cooperative Multiagent Reinforcement Learning
    Omidshafiei, Shayegan
    Kim, Dong-Ki
    Liu, Miao
    Tesauro, Gerald
    Riemer, Matthew
    Amato, Christopher
    Campbell, Murray
    How, Jonathan P.
    THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 6128 - 6136
  • [48] Measurement of Regional Electric Vehicle Adoption Using Multiagent Deep Reinforcement Learning
    Choi, Seung Jun
    Jiao, Junfeng
    APPLIED SCIENCES-BASEL, 2024, 14 (05):
  • [49] Interterminal Truck Routing Optimization Using Cooperative Multiagent Deep Reinforcement Learning
    Adi, Taufik Nur
    Bae, Hyerim
    Iskandar, Yelita Anggiane
    PROCESSES, 2021, 9 (10)
  • [50] ROMAX: Certifiably Robust Deep Multiagent Reinforcement Learning via Convex Relaxation
    Sun, Chuangchuang
    Kim, Dong-Ki
    How, Jonathan P.
    2022 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2022), 2022, : 5503 - 5510