Scalable Multi-Robot Cooperation for Multi-Goal Tasks Using Reinforcement Learning

被引:0
|
作者
An, Tianxu [1 ]
Lee, Joonho [2 ]
Bjelonic, Marko [3 ]
De Vincenti, Flavio [4 ]
Hutter, Marco [1 ]
机构
[1] Robot Syst Lab, CH-8092 Zurich, Switzerland
[2] Neuromeka Co Ltd, Seoul 04782, South Korea
[3] Swiss Mile Robot AG, CH-8092 Zurich, Switzerland
[4] Swiss Fed Inst Technol, Computat Robot Lab, CH-8092 Zurich, Switzerland
来源
IEEE ROBOTICS AND AUTOMATION LETTERS | 2025年 / 10卷 / 02期
基金
瑞士国家科学基金会; 欧洲研究理事会;
关键词
Robots; Navigation; Training; Neural networks; Collision avoidance; Mobile robots; Reinforcement learning; Quadrupedal robots; Vectors; Scalability; Legged locomotion; multi-robot systems; reinforcement learning;
D O I
10.1109/LRA.2024.3521183
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
Coordinated navigation of an arbitrary number of robots to an arbitrary number of goals is a big challenge in robotics, often hindered by scalability limitations of existing strategies. This letter introduces a decentralized multi-agent control system using neural network policies trained in simulation. By leveraging permutation invariant neural network architectures and model-free reinforcement learning, our policy enables robots to prioritize varying numbers of collaborating robots and goals in a zero-shot manner without being biased by ordering or limited by a fixed capacity. We validate the task performance and scalability of our policies through experiments in both simulation and real-world settings. Our approach achieves a 10.3% higher success rate in collaborative navigation tasks compared to a policy without a permutation invariant encoder. Additionally, it finds near-optimal solutions for multi-robot navigation problems while being two orders of magnitude faster than an optimization-based centralized controller. We deploy our multi-goal navigation policies on two wheeled-legged quadrupedal robots, which successfully complete a series of multi-goal navigation missions.
引用
收藏
页码:1585 / 1592
页数:8
相关论文
共 50 条
  • [11] Sequencing of multi-robot behaviors using reinforcement learning
    Pierpaoli, Pietro
    Doan, Thinh T.
    Romberg, Justin
    Egerstedt, Magnus
    CONTROL THEORY AND TECHNOLOGY, 2021, 19 (04) : 529 - 537
  • [12] Goal Density-based Hindsight Experience Prioritization for Multi-Goal Robot Manipulation Reinforcement Learning
    Kuang, Yingyi
    Weinberg, Abraham Itzhak
    Vogiatzis, George
    Faria, Diego R.
    2020 29TH IEEE INTERNATIONAL CONFERENCE ON ROBOT AND HUMAN INTERACTIVE COMMUNICATION (RO-MAN), 2020, : 432 - 437
  • [13] Reinforcement learning in the multi-robot domain
    Mataric, MJ
    AUTONOMOUS ROBOTS, 1997, 4 (01) : 73 - 83
  • [14] Reinforcement Learning in the Multi-Robot Domain
    Maja J. Matarić
    Autonomous Robots, 1997, 4 : 73 - 83
  • [15] Maximum Entropy-Regularized Multi-Goal Reinforcement Learning
    Zhao, Rui
    Sun, Xudong
    Tresp, Volker
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 97, 2019, 97
  • [16] Scalable Multi-Robot Task Allocation Using Graph Deep Reinforcement Learning with Graph Normalization
    Zhang, Zhenqiang
    Jiang, Xiangyuan
    Yang, Zhenfa
    Ma, Sile
    Chen, Jiyang
    Sun, Wenxu
    ELECTRONICS, 2024, 13 (08)
  • [17] Multi-goal Reinforcement Learning via Exploring Successor Matching
    Feng, Xiaoyun
    2022 IEEE CONFERENCE ON GAMES, COG, 2022, : 401 - 408
  • [18] CURIOUS: Intrinsically Motivated Modular Multi-Goal Reinforcement Learning
    Colas, Cedric
    Fournier, Pierre
    Sigaud, Olivier
    Chetouani, Mohamed
    Oudeyer, Pierre-Yves
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 97, 2019, 97
  • [19] A Hierarchical Reinforcement Learning Based Approach for Multi-robot Cooperation in Unknown Environments
    Cai, Yifan
    Yang, Simon X.
    Xu, Xin
    Mittal, Gauri S.
    PROCEEDINGS OF THE 2011 2ND INTERNATIONAL CONGRESS ON COMPUTER APPLICATIONS AND COMPUTATIONAL SCIENCE, VOL 1, 2012, 144 : 69 - +
  • [20] Hierarchical Deep Reinforcement Learning for Multi-robot Cooperation in Partially Observable Environment
    Liang, Zhixuan
    Cao, Jiannong
    Lin, Wanyu
    Chen, Jinlin
    Xu, Huafeng
    2021 IEEE THIRD INTERNATIONAL CONFERENCE ON COGNITIVE MACHINE INTELLIGENCE (COGMI 2021), 2021, : 272 - 281