Multi-UAV Path Planning and Following Based on Multi-Agent Reinforcement Learning

被引:1
|
作者
Zhao, Xiaoru [1 ]
Yang, Rennong [1 ]
Zhong, Liangsheng [2 ]
Hou, Zhiwei [2 ]
机构
[1] Air Force Engn Univ, Air Traff Control & Nav Sch, Xian 710051, Peoples R China
[2] Sun Yat Sen Univ, Sch Syst Sci & Engn, Guangzhou 510275, Peoples R China
关键词
path planning; path follow; deep reinforcement learning; multi-UAV; parameter share;
D O I
10.3390/drones8010018
中图分类号
TP7 [遥感技术];
学科分类号
081102 ; 0816 ; 081602 ; 083002 ; 1404 ;
摘要
Dedicated to meeting the growing demand for multi-agent collaboration in complex scenarios, this paper introduces a parameter-sharing off-policy multi-agent path planning and the following approach. Current multi-agent path planning predominantly relies on grid-based maps, whereas our proposed approach utilizes laser scan data as input, providing a closer simulation of real-world applications. In this approach, the unmanned aerial vehicle (UAV) uses the soft actor-critic (SAC) algorithm as a planner and trains its policy to converge. This policy enables end-to-end processing of laser scan data, guiding the UAV to avoid obstacles and reach the goal. At the same time, the planner incorporates paths generated by a sampling-based method as following points. The following points are continuously updated as the UAV progresses. Multi-UAV path planning tasks are facilitated, and policy convergence is accelerated through sharing experiences among agents. To address the challenge of UAVs that are initially stationary and overly cautious near the goal, a reward function is designed to encourage UAV movement. Additionally, a multi-UAV simulation environment is established to simulate real-world UAV scenarios to support training and validation of the proposed approach. The simulation results highlight the effectiveness of the presented approach in both the training process and task performance. The presented algorithm achieves an 80% success rate to guarantee that three UAVs reach the goal points.
引用
收藏
页数:18
相关论文
共 50 条
  • [41] Multi-UAV Cooperative Path Planning Based on Aquila Optimizer
    Huang, Hanqiao
    Li, Haoran
    Wang, Meng
    Wu, Yongliang
    He, Xiang
    [J]. PROCEEDINGS OF 2022 INTERNATIONAL CONFERENCE ON AUTONOMOUS UNMANNED SYSTEMS, ICAUS 2022, 2023, 1010 : 2005 - 2014
  • [42] Collaborative Path Planning of Multiple Carrier-based Aircraft Based on Multi-agent Reinforcement Learning
    Shang, Zhihao
    Mao, Zhiqiang
    Zhang, Huachao
    Xu, Mingliang
    [J]. 2022 23RD IEEE INTERNATIONAL CONFERENCE ON MOBILE DATA MANAGEMENT (MDM 2022), 2022, : 512 - 517
  • [43] Multi-UAV Autonomous Path Planning in Reconnaissance Missions Considering Incomplete Information: A Reinforcement Learning Method
    Chen, Yu
    Dong, Qi
    Shang, Xiaozhou
    Wu, Zhenyu
    Wang, Jinyu
    [J]. DRONES, 2023, 7 (01)
  • [44] UAV Path Planning Based on Multi-Layer Reinforcement Learning Technique
    Cui, Zhengyang
    Wang, Yong
    [J]. IEEE ACCESS, 2021, 9 : 59486 - 59497
  • [45] Multi-UAV simultaneous target assignment and path planning based on deep reinforcement learning in dynamic multiple obstacles environments
    Kong, Xiaoran
    Zhou, Yatong
    Li, Zhe
    Wang, Shaohai
    [J]. FRONTIERS IN NEUROROBOTICS, 2024, 17
  • [46] Multi-Agent Deep Reinforcement Learning for Secure UAV Communications
    Zhang, Yu
    Zhuang, Zirui
    Gao, Feifei
    Wang, Jingyu
    Han, Zhu
    [J]. 2020 IEEE WIRELESS COMMUNICATIONS AND NETWORKING CONFERENCE (WCNC), 2020,
  • [47] Multi-agent reinforcement learning as a rehearsal for decentralized planning
    Kraemer, Landon
    Banerjee, Bikramjit
    [J]. NEUROCOMPUTING, 2016, 190 : 82 - 94
  • [48] Multi-agent Reinforcement Learning for Urban Projects Planning
    Khelifa, Boudjemaa
    Laouar, Mohamed Ridda
    [J]. PROCEEDINGS OF THE 7TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING AND NEW TECHNOLOGIES (ICSENT '18), 2018,
  • [49] Trajectory planning of space manipulator based on multi-agent reinforcement learning
    Zhao, Yu
    Guan, Gongshun
    Guo, Jifeng
    Yu, Xiaoqiang
    Yan, Peng
    [J]. Hangkong Xuebao/Acta Aeronautica et Astronautica Sinica, 2021, 42 (01):
  • [50] Multi-Agent Reinforcement Learning-Based Resource Allocation for UAV Networks
    Cui, Jingjing
    Liu, Yuanwei
    Nallanathan, Arumugam
    [J]. IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2020, 19 (02) : 729 - 743