Learning to Navigate in Turbulent Flows With Aerial Robot Swarms: A Cooperative Deep Reinforcement Learning Approach

被引:1
|
作者
Patino, Diego [1 ]
Mayya, Siddharth [2 ]
Calderon, Juan [3 ,4 ]
Daniilidis, Kostas [1 ]
Saldana, David [5 ]
机构
[1] Univ Penn, GRASP Lab, Philadelphia, PA 19104 USA
[2] Amazon Robot, Cambridge, MA 02141 USA
[3] Univ St Tomas, Bogota 110231, Colombia
[4] Bethune Cookman Univ, Daytona Beach, FL 32114 USA
[5] Lehigh Univ, Autonomous & Intelligent Robot Lab AIRLab, Bethlehem, PA 18015 USA
关键词
Robots; Robot kinematics; Robot sensing systems; Wind; Navigation; Force; Drag; Swarm robotics; reinforcement learning; wind turbulence; machine learning for robot control; graph neural networks; NEURAL-NETWORKS; FIELDS;
D O I
10.1109/LRA.2023.3280806
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
Aerial operation in turbulent environments is a challenging problem due to the chaotic behavior of the flow. This problem is made even more complex when a team of aerial robots is trying to achieve coordinated motion in turbulent wind conditions. In this letter, we present a novel multi-robot controller to navigate in turbulent flows, decoupling the trajectory-tracking control from the turbulence compensation via a nested control architecture. Unlike previous works, our method does not learn to compensate for the air-flow at a specific time and space. Instead, our method learns to compensate for the flow based on its effect on the team. This is made possible via a deep reinforcement learning approach, implemented via a Graph Convolutional Neural Network (GCNN)-based architecture, which enables robots to achieve better wind compensation by processing the spatial-temporal correlation of wind flows across the team. Our approach scales well to large robot teams -as each robot only uses information from its nearest neighbors-, and generalizes well to robot teams larger than seen in training. Simulated experiments demonstrate how information sharing improves turbulence compensation in a team of aerial robots and demonstrate the flexibility of our method over different team configurations.
引用
收藏
页码:4219 / 4226
页数:8
相关论文
共 50 条
  • [21] Learning to Navigate Through Complex Dynamic Environment With Modular Deep Reinforcement Learning
    Wang, Yuanda
    He, Haibo
    Sun, Changyin
    IEEE TRANSACTIONS ON GAMES, 2018, 10 (04) : 400 - 412
  • [22] A Multiagent Deep Reinforcement Learning Approach for Multi-UAV Cooperative Search in Multilayered Aerial Computing Networks
    Wu, Jiaqi
    Luo, Jingjing
    Jiang, Changkun
    Gao, Lin
    IEEE INTERNET OF THINGS JOURNAL, 2025, 12 (05): : 5807 - 5821
  • [23] Cooperative Multi-Robot Navigation in Dynamic Environment with Deep Reinforcement Learning
    Han, Ruihua
    Chen, Shengduo
    Hao, Qi
    2020 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2020, : 448 - 454
  • [24] Learning Cooperative Visual Dialog Agents with Deep Reinforcement Learning
    Das, Abhishek
    Kottur, Satwik
    Moura, Jose M. F.
    Lee, Stefan
    Batra, Dhruv
    2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 2970 - 2979
  • [25] Deep learning and reinforcement learning approach on microgrid
    Chandrasekaran, Kumar
    Kandasamy, Prabaakaran
    Ramanathan, Srividhya
    INTERNATIONAL TRANSACTIONS ON ELECTRICAL ENERGY SYSTEMS, 2020, 30 (10):
  • [26] Learning MAC Protocols in HetNets: A Cooperative Multi-Agent Deep Reinforcement Learning Approach
    Naeem, Faisal
    Adam, Nadir
    Kaddoum, Georges
    Waqar, Omer
    2024 IEEE WIRELESS COMMUNICATIONS AND NETWORKING CONFERENCE, WCNC 2024, 2024,
  • [27] An Improved Deep Reinforcement Learning Approach for Medical Delivery Robot Navigation
    Wang, Minghui
    Zeng, Bi
    Zhao, Rui
    BASIC & CLINICAL PHARMACOLOGY & TOXICOLOGY, 2020, 127 : 23 - 23
  • [28] Task Assignment of UAV Swarms Based on Deep Reinforcement Learning
    Liu, Bo
    Wang, Shulei
    Li, Qinghua
    Zhao, Xinyang
    Pan, Yunqing
    Wang, Changhong
    DRONES, 2023, 7 (05)
  • [29] Deep Decentralized Reinforcement Learning for Cooperative Control
    Koepf, Florian
    Tesfazgi, Samuel
    Flad, Michael
    Hohmann, Soeren
    IFAC PAPERSONLINE, 2020, 53 (02): : 1555 - 1562
  • [30] Reinforcement learning in swarms that learn
    Peters, JF
    Henry, C
    Ramanna, S
    2005 IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON INTELLIGENT AGENT TECHNOLOGY, PROCEEDINGS, 2005, : 400 - 406