Learning to Navigate in Turbulent Flows With Aerial Robot Swarms: A Cooperative Deep Reinforcement Learning Approach

被引：1

作者：

Patino, Diego ^{[1
]}

Mayya, Siddharth ^{[2
]}

Calderon, Juan ^{[3
,4
]}

Daniilidis, Kostas ^{[1
]}

Saldana, David ^{[5
]}

机构：

[1] Univ Penn, GRASP Lab, Philadelphia, PA 19104 USA

[2] Amazon Robot, Cambridge, MA 02141 USA

[3] Univ St Tomas, Bogota 110231, Colombia

[4] Bethune Cookman Univ, Daytona Beach, FL 32114 USA

[5] Lehigh Univ, Autonomous & Intelligent Robot Lab AIRLab, Bethlehem, PA 18015 USA

来源：

IEEE ROBOTICS AND AUTOMATION LETTERS | 2023年 / 8卷 / 07期

关键词：

Robots; Robot kinematics; Robot sensing systems; Wind; Navigation; Force; Drag; Swarm robotics; reinforcement learning; wind turbulence; machine learning for robot control; graph neural networks; NEURAL-NETWORKS; FIELDS;

D O I：

10.1109/LRA.2023.3280806

中图分类号：

TP24 [机器人技术];

学科分类号：

080202 ; 1405 ;

摘要：

Aerial operation in turbulent environments is a challenging problem due to the chaotic behavior of the flow. This problem is made even more complex when a team of aerial robots is trying to achieve coordinated motion in turbulent wind conditions. In this letter, we present a novel multi-robot controller to navigate in turbulent flows, decoupling the trajectory-tracking control from the turbulence compensation via a nested control architecture. Unlike previous works, our method does not learn to compensate for the air-flow at a specific time and space. Instead, our method learns to compensate for the flow based on its effect on the team. This is made possible via a deep reinforcement learning approach, implemented via a Graph Convolutional Neural Network (GCNN)-based architecture, which enables robots to achieve better wind compensation by processing the spatial-temporal correlation of wind flows across the team. Our approach scales well to large robot teams -as each robot only uses information from its nearest neighbors-, and generalizes well to robot teams larger than seen in training. Simulated experiments demonstrate how information sharing improves turbulence compensation in a team of aerial robots and demonstrate the flexibility of our method over different team configurations.

引用

页码：4219 / 4226

页数：8

共 50 条

[31] Hierarchical Human-robot Cooperative Control Based on GPR and Deep Reinforcement Learning
Jin Z.-H.
Liu A.-D.
Yu L.
Zidonghua Xuebao/Acta Automatica Sinica, 2022, 48 (09): : 2352 - 2360
[32] Cooperative behavior of a heterogeneous robot team for planetary exploration using deep reinforcement learning
Barth, Andrew
Ma, Ou
ACTA ASTRONAUTICA, 2024, 214 : 689 - 700
[33] Robot multi-action cooperative grasping strategy based on deep reinforcement learning
He, Huiteng
Zhou, Yong
Hu, Kaixiong
Li, Weidong
Jisuanji Jicheng Zhizao Xitong/Computer Integrated Manufacturing Systems, CIMS, 2024, 30 (05): : 1789 - 1797
[34] Cooperative behavior of a heterogeneous robot team for planetary exploration using deep reinforcement learning
Barth, Andrew
Ma, Ou
Acta Astronautica, 2024, 214 : 689 - 700
[35] Cooperative Multi-Robot Hierarchical Reinforcement Learning
Setyawan, Gembong Edhi
Hartono, Pitoyo
Sawada, Hideyuki
INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2022, 13 (09) : 35 - 44
[36] Reinforcement learning on strategy selection for a cooperative robot system
Hwang, Kao-Shing
Chen, Yu-Jen
Lee, Ching-Huang
Wu, Cheng-Shong
2005 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND BIOMIMETICS, 2006, : 381 - +
[37] Deep Reinforcement Learning for Snake Robot Locomotion
Shi, Junyao
Dear, Tony
Kelly, Scott David
IFAC PAPERSONLINE, 2020, 53 (02): : 9688 - 9695
[38] Deep Reinforcement Learning for Humanoid Robot Dribbling
Muzio, Alexandre F., V
Maximo, Marcos R. O. A.
Yoneyama, Takashi
2020 XVIII LATIN AMERICAN ROBOTICS SYMPOSIUM, 2020 XII BRAZILIAN SYMPOSIUM ON ROBOTICS AND 2020 XI WORKSHOP OF ROBOTICS IN EDUCATION (LARS-SBR-WRE 2020), 2020, : 246 - 251
[39] Deep Reinforcement Learning for Humanoid Robot Behaviors
Muzio, Alexandre F. V.
Maximo, Marcos R. O. A.
Yoneyama, Takashi
Journal of Intelligent and Robotic Systems: Theory and Applications, 2022, 105 (01):
[40] Deep Reinforcement Learning for Mobile Robot Navigation
Gromniak, Martin
Stenzel, Jonas
2019 4TH ASIA-PACIFIC CONFERENCE ON INTELLIGENT ROBOT SYSTEMS (ACIRS 2019), 2019, : 68 - 73

← 1 2 3 4 5 →