Air-M: A Visual Reality Many-agent Reinforcement Learning Platform for Large-Scale Aerial Unmanned System

被引:0
|
作者
Lou, Jiabin [1 ]
Wu, Wenjun [1 ]
Liao, Shuhao [1 ]
Shi, Rongye [1 ]
机构
[1] Beihang Univ, Xueyuan Rd 37, Beijing 100191, Peoples R China
基金
国家重点研发计划;
关键词
D O I
10.1109/IROS55552.2023.10341405
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Reinforcement learning for swarms of flying robots is a challenging task that requires a large number of data samples. Moreover, the problem of sim-to-real transfer has long been a challenge in robotics algorithm deployment. To address these issues, we propose Air-M, a platform that facilitates large-scale drone swarm learning in a distributed docker container environment and deployment in a virtual reality setting. Air-M trains the policy network using physics engines and creates replicas of agents in docker containers, which helps amortize the computational cost. In addition, Air-M establishes an intermediate link between the simulation and the real world, allowing real drones to interact with virtual objects via virtual sensors. This enables the policy network to be trained using virtual agents and seamlessly transferred to real drones. Air-M is highly scalable, accommodating hundreds of agents with dynamic models and virtual sensors. We evaluate the effectiveness of our approach by conducting experiments in three representative virtual scenarios with an increasing number of agents. Our results demonstrate that our method outperforms the state-of-the-art in terms of training efficiency and transferability, making it a promising platform for swarm robotics applications.
引用
收藏
页码:5598 / 5605
页数:8
相关论文
共 50 条
  • [1] MAgent: A Many-Agent Reinforcement Learning Platform for Artificial Collective Intelligence
    Zheng, Lianmin
    Yang, Jiacheng
    Cai, Han
    Zhang, Weinan
    Wang, Jun
    Yu, Yong
    [J]. THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 8222 - 8223
  • [2] Unmanned Aerial Vehicle Path Planning Algorithm Based on Deep Reinforcement Learning in Large-Scale and Dynamic Environments
    Xie, Ronglei
    Meng, Zhijun
    Wang, Lifeng
    Li, Haochen
    Wang, Kaipeng
    Wu, Zhe
    [J]. IEEE Access, 2021, 9 : 24884 - 24900
  • [3] Unmanned Aerial Vehicle Path Planning Algorithm Based on Deep Reinforcement Learning in Large-Scale and Dynamic Environments
    Xie, Ronglei
    Meng, Zhijun
    Wang, Lifeng
    Li, Haochen
    Wang, Kaipeng
    Wu, Zhe
    [J]. IEEE ACCESS, 2021, 9 : 24884 - 24900
  • [4] UAVs (Unmanned Aerial Vehicle system) for generation of digital large-scale orthophotos
    Cui, Hongxia
    Lin, Zongjian
    [J]. AD'07: Proceedings of Asia Display 2007, Vols 1 and 2, 2007, : 2009 - 2012
  • [5] Large-scale machinery monitoring system based on the visual reality
    Zhang, Yusi
    Ruan, Jun
    [J]. PROCEEDINGS OF 2018 IEEE 3RD ADVANCED INFORMATION TECHNOLOGY, ELECTRONIC AND AUTOMATION CONTROL CONFERENCE (IAEAC 2018), 2018, : 863 - 867
  • [6] Concentration Network for Reinforcement Learning of Large-Scale Multi-Agent Systems
    Fu, Qingxu
    Qiu, Tenghai
    Yi, Jianqiang
    Pu, Zhiqiang
    Wu, Shiguang
    [J]. THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 9341 - 9349
  • [7] Message Communication System Using Unmanned Aerial Vehicles under Large-Scale Disaster Environments
    Mase, Kenichi
    Okada, Hiraku
    [J]. 2015 IEEE 26TH ANNUAL INTERNATIONAL SYMPOSIUM ON PERSONAL, INDOOR, AND MOBILE RADIO COMMUNICATIONS (PIMRC), 2015, : 2171 - 2176
  • [8] Graph-based multi-agent reinforcement learning for large-scale UAVs swarm system control
    Zhao, Bocheng
    Huo, Mingying
    Li, Zheng
    Yu, Ze
    Qi, Naiming
    [J]. AEROSPACE SCIENCE AND TECHNOLOGY, 2024, 150
  • [9] Tactical reward shaping for large-scale combat by multi-agent reinforcement learning
    DUO Nanxun
    WANG Qinzhao
    LYU Qiang
    WANG Wei
    [J]. Journal of Systems Engineering and Electronics, 2024, 35 (06) - 1529
  • [10] Multi-Agent Deep Reinforcement Learning for Large-Scale Traffic Signal Control
    Chu, Tianshu
    Wang, Jie
    Codeca, Lara
    Li, Zhaojian
    [J]. IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2020, 21 (03) : 1086 - 1095