Learning to Navigate in Turbulent Flows With Aerial Robot Swarms: A Cooperative Deep Reinforcement Learning Approach

被引:0
|
作者
Patino, Diego [1 ]
Mayya, Siddharth [2 ]
Calderon, Juan [3 ,4 ]
Daniilidis, Kostas [1 ]
Saldana, David [5 ]
机构
[1] Univ Penn, GRASP Lab, Philadelphia, PA 19104 USA
[2] Amazon Robot, Cambridge, MA 02141 USA
[3] Univ St Tomas, Bogota 110231, Colombia
[4] Bethune Cookman Univ, Daytona Beach, FL 32114 USA
[5] Lehigh Univ, Autonomous & Intelligent Robot Lab AIRLab, Bethlehem, PA 18015 USA
关键词
Robots; Robot kinematics; Robot sensing systems; Wind; Navigation; Force; Drag; Swarm robotics; reinforcement learning; wind turbulence; machine learning for robot control; graph neural networks; NEURAL-NETWORKS; FIELDS;
D O I
10.1109/LRA.2023.3280806
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
Aerial operation in turbulent environments is a challenging problem due to the chaotic behavior of the flow. This problem is made even more complex when a team of aerial robots is trying to achieve coordinated motion in turbulent wind conditions. In this letter, we present a novel multi-robot controller to navigate in turbulent flows, decoupling the trajectory-tracking control from the turbulence compensation via a nested control architecture. Unlike previous works, our method does not learn to compensate for the air-flow at a specific time and space. Instead, our method learns to compensate for the flow based on its effect on the team. This is made possible via a deep reinforcement learning approach, implemented via a Graph Convolutional Neural Network (GCNN)-based architecture, which enables robots to achieve better wind compensation by processing the spatial-temporal correlation of wind flows across the team. Our approach scales well to large robot teams -as each robot only uses information from its nearest neighbors-, and generalizes well to robot teams larger than seen in training. Simulated experiments demonstrate how information sharing improves turbulence compensation in a team of aerial robots and demonstrate the flexibility of our method over different team configurations.
引用
收藏
页码:4219 / 4226
页数:8
相关论文
共 50 条
  • [31] Cooperative behavior of a heterogeneous robot team for planetary exploration using deep reinforcement learning
    Barth, Andrew
    Ma, Ou
    [J]. ACTA ASTRONAUTICA, 2024, 214 : 689 - 700
  • [32] Robot multi-action cooperative grasping strategy based on deep reinforcement learning
    He, Huiteng
    Zhou, Yong
    Hu, Kaixiong
    Li, Weidong
    [J]. Jisuanji Jicheng Zhizao Xitong/Computer Integrated Manufacturing Systems, CIMS, 2024, 30 (05): : 1789 - 1797
  • [33] Cooperative behavior of a heterogeneous robot team for planetary exploration using deep reinforcement learning
    Barth, Andrew
    Ma, Ou
    [J]. Acta Astronautica, 2024, 214 : 689 - 700
  • [34] Cooperative Multi-Robot Hierarchical Reinforcement Learning
    Setyawan, Gembong Edhi
    Hartono, Pitoyo
    Sawada, Hideyuki
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2022, 13 (09) : 35 - 44
  • [35] Reinforcement learning on strategy selection for a cooperative robot system
    Hwang, Kao-Shing
    Chen, Yu-Jen
    Lee, Ching-Huang
    Wu, Cheng-Shong
    [J]. 2005 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND BIOMIMETICS, 2006, : 381 - +
  • [36] Deep Reinforcement Learning for Humanoid Robot Dribbling
    Muzio, Alexandre F., V
    Maximo, Marcos R. O. A.
    Yoneyama, Takashi
    [J]. 2020 XVIII LATIN AMERICAN ROBOTICS SYMPOSIUM, 2020 XII BRAZILIAN SYMPOSIUM ON ROBOTICS AND 2020 XI WORKSHOP OF ROBOTICS IN EDUCATION (LARS-SBR-WRE 2020), 2020, : 246 - 251
  • [37] Deep Reinforcement Learning for Snake Robot Locomotion
    Shi, Junyao
    Dear, Tony
    Kelly, Scott David
    [J]. IFAC PAPERSONLINE, 2020, 53 (02): : 9688 - 9695
  • [38] Deep Reinforcement Learning for Humanoid Robot Behaviors
    Muzio, Alexandre F. V.
    Maximo, Marcos R. O. A.
    Yoneyama, Takashi
    [J]. Journal of Intelligent and Robotic Systems: Theory and Applications, 2022, 105 (01):
  • [39] Review of Deep Reinforcement Learning for Robot Manipulation
    Hai Nguyen
    Hung Manh La
    [J]. 2019 THIRD IEEE INTERNATIONAL CONFERENCE ON ROBOTIC COMPUTING (IRC 2019), 2019, : 590 - 595
  • [40] Deep Reinforcement Learning for Humanoid Robot Behaviors
    Muzio, Alexandre F. V.
    Maximo, Marcos R. O. A.
    Yoneyama, Takashi
    [J]. JOURNAL OF INTELLIGENT & ROBOTIC SYSTEMS, 2022, 105 (01)