VMAS: A Vectorized Multi-agent Simulator for Collective Robot Learning

被引:0
|
作者
Bettini, Matteo [1 ]
Kortvelesy, Ryan [1 ]
Blumenkamp, Jan [1 ]
Prorok, Amanda [1 ]
机构
[1] Univ Cambridge, Dept Comp Sci & Technol, Cambridge, England
基金
欧洲研究理事会; 英国工程与自然科学研究理事会;
关键词
simulator; multi-robot learning; vectorization;
D O I
10.1007/978-3-031-51497-5_4
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
While many multi-robot coordination problems can be solved optimally by exact algorithms, solutions are often not scalable in the number of robots. Multi-Agent Reinforcement Learning (MARL) is gaining increasing attention in the robotics community as a promising solution to tackle such problems. Nevertheless, we still lack the tools that allow us to quickly and efficiently find solutions to largescale collective learning tasks. In this work, we introduce the Vectorized Multi-Agent Simulator (VMAS). VMAS is an open-source framework designed for efficient MARL benchmarking. It is comprised of a vectorized 2D physics engine written in PyTorch and a set of twelve challenging multi-robot scenarios. Additional scenarios can be implemented through a simple and modular interface. We demonstrate how vectorization enables parallel simulation on accelerated hardware without added complexity. When comparing VMAS to OpenAI MPE, we show how MPE's execution time increases linearly in the number of simulations while VMAS is able to execute 30,000 parallel simulations in under 10 s, proving more than 100x faster. Using VMAS's RLlib interface, we benchmark our multi-robot scenarios using various Proximal Policy Optimization (PPO)-based MARL algorithms. VMAS's scenarios prove challenging in orthogonal ways for state-of-the-art MARL algorithms. The VMAS framework is available at: https://github.com/proroklab/ VectorizedMultiAgentSimulator. A video of VMAS scenarios and experiments is available https://youtu.be/aaDRYfiesAY
引用
收藏
页码:42 / 56
页数:15
相关论文
共 50 条
  • [21] Multi-agent learning
    Eduardo Alonso
    [J]. Autonomous Agents and Multi-Agent Systems, 2007, 15 : 3 - 4
  • [22] Multi-agent learning
    Alonso, Eduardo
    [J]. AUTONOMOUS AGENTS AND MULTI-AGENT SYSTEMS, 2007, 15 (01) : 3 - 4
  • [23] Multi-Agent Deep Reinforcement Learning for Multi-Robot Applications: A Survey
    Orr, James
    Dutta, Ayan
    [J]. SENSORS, 2023, 23 (07)
  • [24] Heterogeneous Multi-Robot Cooperation With Asynchronous Multi-Agent Reinforcement Learning
    Zhang, Han
    Zhang, Xiaohui
    Feng, Zhao
    Xiao, Xiaohui
    [J]. IEEE ROBOTICS AND AUTOMATION LETTERS, 2024, 9 (01): : 159 - 166
  • [25] Process mining of a multi-agent business simulator
    Sohei Ito
    Dominik Vymětal
    Roman Šperka
    Michal Halaška
    [J]. Computational and Mathematical Organization Theory, 2018, 24 : 500 - 531
  • [26] People flow prediction by multi-agent simulator
    Sato D.
    Matsubayashi T.
    Adachi T.
    Ooi S.
    Tanaka Y.
    Nagano S.
    Muto Y.
    Shiohara H.
    Miyamoto M.
    Toda H.
    [J]. Transactions of the Japanese Society for Artificial Intelligence, 2020, 35 (02)
  • [27] Process mining of a multi-agent business simulator
    Ito, Sohei
    Vymetal, Dominik
    Sperka, Roman
    Halaska, Michal
    [J]. COMPUTATIONAL AND MATHEMATICAL ORGANIZATION THEORY, 2018, 24 (04) : 500 - 531
  • [28] People Flow Prediction by Multi-Agent Simulator
    Sato, Daisuke
    Matsubayashi, Tatsushi
    Nagano, Shoichi
    Toda, Hiroyuki
    [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON BIG DATA AND SMART COMPUTING (BIGCOMP), 2019, : 436 - 439
  • [29] A multi-agent simulator for generating novelty in monopoly
    Kejriwal, Mayank
    Thomas, Shilpa
    [J]. SIMULATION MODELLING PRACTICE AND THEORY, 2021, 112
  • [30] TEMMAS: The Electricity Market Multi-Agent Simulator
    Trigo, Paulo
    Marques, Paulo
    Coelho, Helder
    [J]. BIO-INSPIRED SYSTEMS: COMPUTATIONAL AND AMBIENT INTELLIGENCE, PT 1, 2009, 5517 : 569 - +