Multi-UAV Path Planning and Following Based on Multi-Agent Reinforcement Learning

被引:1
|
作者
Zhao, Xiaoru [1 ]
Yang, Rennong [1 ]
Zhong, Liangsheng [2 ]
Hou, Zhiwei [2 ]
机构
[1] Air Force Engn Univ, Air Traff Control & Nav Sch, Xian 710051, Peoples R China
[2] Sun Yat Sen Univ, Sch Syst Sci & Engn, Guangzhou 510275, Peoples R China
关键词
path planning; path follow; deep reinforcement learning; multi-UAV; parameter share;
D O I
10.3390/drones8010018
中图分类号
TP7 [遥感技术];
学科分类号
081102 ; 0816 ; 081602 ; 083002 ; 1404 ;
摘要
Dedicated to meeting the growing demand for multi-agent collaboration in complex scenarios, this paper introduces a parameter-sharing off-policy multi-agent path planning and the following approach. Current multi-agent path planning predominantly relies on grid-based maps, whereas our proposed approach utilizes laser scan data as input, providing a closer simulation of real-world applications. In this approach, the unmanned aerial vehicle (UAV) uses the soft actor-critic (SAC) algorithm as a planner and trains its policy to converge. This policy enables end-to-end processing of laser scan data, guiding the UAV to avoid obstacles and reach the goal. At the same time, the planner incorporates paths generated by a sampling-based method as following points. The following points are continuously updated as the UAV progresses. Multi-UAV path planning tasks are facilitated, and policy convergence is accelerated through sharing experiences among agents. To address the challenge of UAVs that are initially stationary and overly cautious near the goal, a reward function is designed to encourage UAV movement. Additionally, a multi-UAV simulation environment is established to simulate real-world UAV scenarios to support training and validation of the proposed approach. The simulation results highlight the effectiveness of the presented approach in both the training process and task performance. The presented algorithm achieves an 80% success rate to guarantee that three UAVs reach the goal points.
引用
收藏
页数:18
相关论文
共 50 条
  • [1] Joint Optimization of Multi-UAV Target Assignment and Path Planning Based on Multi-Agent Reinforcement Learning
    Qie, Han
    Shi, Dianxi
    Shen, Tianlong
    Xu, Xinhai
    Li, Yuan
    Wang, Liujing
    [J]. IEEE ACCESS, 2019, 7 : 146264 - 146272
  • [2] An evolutionary multi-agent reinforcement learning algorithm for multi-UAV air combat
    Wang, Baolai
    Gao, Xianzhong
    Xie, Tao
    [J]. KNOWLEDGE-BASED SYSTEMS, 2024, 299
  • [3] Multi-Agent Deep Reinforcement Learning-Based Trajectory Planning for Multi-UAV Assisted Mobile Edge Computing
    Wang, Liang
    Wang, Kezhi
    Pan, Cunhua
    Xu, Wei
    Aslam, Nauman
    Hanzo, Lajos
    [J]. IEEE TRANSACTIONS ON COGNITIVE COMMUNICATIONS AND NETWORKING, 2021, 7 (01) : 73 - 84
  • [4] Multi-UAV Cooperative Searching and Tracking for Moving Targets Based on Multi-Agent Reinforcement Learning
    Su, Kai
    Qian, Feng
    [J]. APPLIED SCIENCES-BASEL, 2023, 13 (21):
  • [5] Multi-UAV Adaptive Path Planning Using Deep Reinforcement Learning
    Westheider, Jonas
    Rueckin, Julius
    Popovic, Marija
    [J]. 2023 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, IROS, 2023, : 649 - 656
  • [6] Multi-Agent UAV Path Planning
    Marsh, L.
    Calbert, G.
    Tu, J.
    Gossink, D.
    Kwok, H.
    [J]. MODSIM 2005: INTERNATIONAL CONGRESS ON MODELLING AND SIMULATION: ADVANCES AND APPLICATIONS FOR MANAGEMENT AND DECISION MAKING: ADVANCES AND APPLICATIONS FOR MANAGEMENT AND DECISION MAKING, 2005, : 2188 - 2194
  • [7] Multi-Agent Deep Reinforcement Learning for Full-Duplex Multi-UAV Networks
    Dai, Chen
    Zhu, Kun
    Hossain, Ekram
    [J]. 2022 IEEE WIRELESS COMMUNICATIONS AND NETWORKING CONFERENCE (WCNC), 2022, : 2232 - 2237
  • [8] Multi-UAV Mobile Edge Computing and Path Planning Platform Based on Reinforcement Learning
    Chang, Huan
    Chen, Yicheng
    Zhang, Baochang
    Doermann, David
    [J]. IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2022, 6 (03): : 489 - 498
  • [9] Multi-agent reinforcement learning based transmission scheme for IRS-assisted multi-UAV systems
    Mei, Yumo
    Liu, Chen
    Song, Yunchao
    Wang, Ge
    Liang, Huibin
    [J]. IET COMMUNICATIONS, 2023, 17 (17) : 2019 - 2029
  • [10] Multi-agent Coverage Path Planning Based on Security Reinforcement Learning
    Li, Song
    Ma, Zhuangzhuang
    Zhang, Yunlin
    Shao, Jinliang
    [J]. Binggong Xuebao/Acta Armamentarii, 2023, 44 : 101 - 113