Multi-UAV Path Planning and Following Based on Multi-Agent Reinforcement Learning

被引：1

作者：

Zhao, Xiaoru ^{[1
]}

Yang, Rennong ^{[1
]}

Zhong, Liangsheng ^{[2
]}

Hou, Zhiwei ^{[2
]}

机构：

[1] Air Force Engn Univ, Air Traff Control & Nav Sch, Xian 710051, Peoples R China

[2] Sun Yat Sen Univ, Sch Syst Sci & Engn, Guangzhou 510275, Peoples R China

来源：

DRONES | 2024年 / 8卷 / 01期

关键词：

path planning; path follow; deep reinforcement learning; multi-UAV; parameter share;

D O I：

10.3390/drones8010018

中图分类号：

TP7 [遥感技术];

学科分类号：

081102 ; 0816 ; 081602 ; 083002 ; 1404 ;

摘要：

Dedicated to meeting the growing demand for multi-agent collaboration in complex scenarios, this paper introduces a parameter-sharing off-policy multi-agent path planning and the following approach. Current multi-agent path planning predominantly relies on grid-based maps, whereas our proposed approach utilizes laser scan data as input, providing a closer simulation of real-world applications. In this approach, the unmanned aerial vehicle (UAV) uses the soft actor-critic (SAC) algorithm as a planner and trains its policy to converge. This policy enables end-to-end processing of laser scan data, guiding the UAV to avoid obstacles and reach the goal. At the same time, the planner incorporates paths generated by a sampling-based method as following points. The following points are continuously updated as the UAV progresses. Multi-UAV path planning tasks are facilitated, and policy convergence is accelerated through sharing experiences among agents. To address the challenge of UAVs that are initially stationary and overly cautious near the goal, a reward function is designed to encourage UAV movement. Additionally, a multi-UAV simulation environment is established to simulate real-world UAV scenarios to support training and validation of the proposed approach. The simulation results highlight the effectiveness of the presented approach in both the training process and task performance. The presented algorithm achieves an 80% success rate to guarantee that three UAVs reach the goal points.

引用

页数：18

共 50 条

[41] Multi-UAV Cooperative Path Planning Based on Aquila Optimizer
Huang, Hanqiao
Li, Haoran
Wang, Meng
Wu, Yongliang
He, Xiang
[J]. PROCEEDINGS OF 2022 INTERNATIONAL CONFERENCE ON AUTONOMOUS UNMANNED SYSTEMS, ICAUS 2022, 2023, 1010 : 2005 - 2014
[42] Collaborative Path Planning of Multiple Carrier-based Aircraft Based on Multi-agent Reinforcement Learning
Shang, Zhihao
Mao, Zhiqiang
Zhang, Huachao
Xu, Mingliang
[J]. 2022 23RD IEEE INTERNATIONAL CONFERENCE ON MOBILE DATA MANAGEMENT (MDM 2022), 2022, : 512 - 517
[43] Multi-UAV Autonomous Path Planning in Reconnaissance Missions Considering Incomplete Information: A Reinforcement Learning Method
Chen, Yu
Dong, Qi
Shang, Xiaozhou
Wu, Zhenyu
Wang, Jinyu
[J]. DRONES, 2023, 7 (01)
[44] UAV Path Planning Based on Multi-Layer Reinforcement Learning Technique
Cui, Zhengyang
Wang, Yong
[J]. IEEE ACCESS, 2021, 9 : 59486 - 59497
[45] Multi-UAV simultaneous target assignment and path planning based on deep reinforcement learning in dynamic multiple obstacles environments
Kong, Xiaoran
Zhou, Yatong
Li, Zhe
Wang, Shaohai
[J]. FRONTIERS IN NEUROROBOTICS, 2024, 17
[46] Multi-Agent Deep Reinforcement Learning for Secure UAV Communications
Zhang, Yu
Zhuang, Zirui
Gao, Feifei
Wang, Jingyu
Han, Zhu
[J]. 2020 IEEE WIRELESS COMMUNICATIONS AND NETWORKING CONFERENCE (WCNC), 2020,
[47] Multi-agent reinforcement learning as a rehearsal for decentralized planning
Kraemer, Landon
Banerjee, Bikramjit
[J]. NEUROCOMPUTING, 2016, 190 : 82 - 94
[48] Multi-agent Reinforcement Learning for Urban Projects Planning
Khelifa, Boudjemaa
Laouar, Mohamed Ridda
[J]. PROCEEDINGS OF THE 7TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING AND NEW TECHNOLOGIES (ICSENT '18), 2018,
[49] Trajectory planning of space manipulator based on multi-agent reinforcement learning
Zhao, Yu
Guan, Gongshun
Guo, Jifeng
Yu, Xiaoqiang
Yan, Peng
[J]. Hangkong Xuebao/Acta Aeronautica et Astronautica Sinica, 2021, 42 (01):
[50] Multi-Agent Reinforcement Learning-Based Resource Allocation for UAV Networks
Cui, Jingjing
Liu, Yuanwei
Nallanathan, Arumugam
[J]. IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2020, 19 (02) : 729 - 743

← 1 2 3 4 5 →