Scalable Multi-Robot Cooperation for Multi-Goal Tasks Using Reinforcement Learning

被引：0

作者：

An, Tianxu ^{[1
]}

Lee, Joonho ^{[2
]}

Bjelonic, Marko ^{[3
]}

De Vincenti, Flavio ^{[4
]}

Hutter, Marco ^{[1
]}

机构：

[1] Robot Syst Lab, CH-8092 Zurich, Switzerland

[2] Neuromeka Co Ltd, Seoul 04782, South Korea

[3] Swiss Mile Robot AG, CH-8092 Zurich, Switzerland

[4] Swiss Fed Inst Technol, Computat Robot Lab, CH-8092 Zurich, Switzerland

来源：

IEEE ROBOTICS AND AUTOMATION LETTERS | 2025年 / 10卷 / 02期

基金：

瑞士国家科学基金会; 欧洲研究理事会;

关键词：

Robots; Navigation; Training; Neural networks; Collision avoidance; Mobile robots; Reinforcement learning; Quadrupedal robots; Vectors; Scalability; Legged locomotion; multi-robot systems; reinforcement learning;

D O I：

10.1109/LRA.2024.3521183

中图分类号：

TP24 [机器人技术];

学科分类号：

080202 ; 1405 ;

摘要：

Coordinated navigation of an arbitrary number of robots to an arbitrary number of goals is a big challenge in robotics, often hindered by scalability limitations of existing strategies. This letter introduces a decentralized multi-agent control system using neural network policies trained in simulation. By leveraging permutation invariant neural network architectures and model-free reinforcement learning, our policy enables robots to prioritize varying numbers of collaborating robots and goals in a zero-shot manner without being biased by ordering or limited by a fixed capacity. We validate the task performance and scalability of our policies through experiments in both simulation and real-world settings. Our approach achieves a 10.3% higher success rate in collaborative navigation tasks compared to a policy without a permutation invariant encoder. Additionally, it finds near-optimal solutions for multi-robot navigation problems while being two orders of magnitude faster than an optimization-based centralized controller. We deploy our multi-goal navigation policies on two wheeled-legged quadrupedal robots, which successfully complete a series of multi-goal navigation missions.

引用

页码：1585 / 1592

页数：8

共 50 条

[11] Sequencing of multi-robot behaviors using reinforcement learning
Pierpaoli, Pietro
Doan, Thinh T.
Romberg, Justin
Egerstedt, Magnus
CONTROL THEORY AND TECHNOLOGY, 2021, 19 (04) : 529 - 537
[12] Goal Density-based Hindsight Experience Prioritization for Multi-Goal Robot Manipulation Reinforcement Learning
Kuang, Yingyi
Weinberg, Abraham Itzhak
Vogiatzis, George
Faria, Diego R.
2020 29TH IEEE INTERNATIONAL CONFERENCE ON ROBOT AND HUMAN INTERACTIVE COMMUNICATION (RO-MAN), 2020, : 432 - 437
[13] Reinforcement learning in the multi-robot domain
Mataric, MJ
AUTONOMOUS ROBOTS, 1997, 4 (01) : 73 - 83
[14] Reinforcement Learning in the Multi-Robot Domain
Maja J. Matarić
Autonomous Robots, 1997, 4 : 73 - 83
[15] Maximum Entropy-Regularized Multi-Goal Reinforcement Learning
Zhao, Rui
Sun, Xudong
Tresp, Volker
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 97, 2019, 97
[16] Scalable Multi-Robot Task Allocation Using Graph Deep Reinforcement Learning with Graph Normalization
Zhang, Zhenqiang
Jiang, Xiangyuan
Yang, Zhenfa
Ma, Sile
Chen, Jiyang
Sun, Wenxu
ELECTRONICS, 2024, 13 (08)
[17] Multi-goal Reinforcement Learning via Exploring Successor Matching
Feng, Xiaoyun
2022 IEEE CONFERENCE ON GAMES, COG, 2022, : 401 - 408
[18] CURIOUS: Intrinsically Motivated Modular Multi-Goal Reinforcement Learning
Colas, Cedric
Fournier, Pierre
Sigaud, Olivier
Chetouani, Mohamed
Oudeyer, Pierre-Yves
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 97, 2019, 97
[19] A Hierarchical Reinforcement Learning Based Approach for Multi-robot Cooperation in Unknown Environments
Cai, Yifan
Yang, Simon X.
Xu, Xin
Mittal, Gauri S.
PROCEEDINGS OF THE 2011 2ND INTERNATIONAL CONGRESS ON COMPUTER APPLICATIONS AND COMPUTATIONAL SCIENCE, VOL 1, 2012, 144 : 69 - +
[20] Hierarchical Deep Reinforcement Learning for Multi-robot Cooperation in Partially Observable Environment
Liang, Zhixuan
Cao, Jiannong
Lin, Wanyu
Chen, Jinlin
Xu, Huafeng
2021 IEEE THIRD INTERNATIONAL CONFERENCE ON COGNITIVE MACHINE INTELLIGENCE (COGMI 2021), 2021, : 272 - 281

← 1 2 3 4 5 →