Energy Constrained Multi-Agent Reinforcement Learning for Coverage Path Planning

被引:0
|
作者
Zhao, Chenyang [1 ]
Liu, Juan [2 ]
Yoon, Suk-Un [3 ]
Li, Xinde [1 ,4 ]
Li, Heqing [1 ]
Zhang, Zhentong [1 ,4 ]
机构
[1] Southeast Univ, Nanjing 210096, Peoples R China
[2] Samsung Elect China R&D Ctr, Nanjing 210012, Peoples R China
[3] Samsung Elect, Suwon 16677, Gyeonggi Do, South Korea
[4] Nanjing Ctr Appl Math, Nanjing 211135, Peoples R China
基金
中国国家自然科学基金;
关键词
NAVIGATION;
D O I
10.1109/IROS55552.2023.10341412
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
For multi-agent area coverage path planning problem, existing researches regard it as a combination of Traveling Salesman Problem (TSP) and Coverage Path Planning (CPP). However, these approaches have disadvantages of poor observation ability in online phase and high computational cost in offline phase, making it difficult to be applied to energy-constrained Unmanned Aerial Vehicles (UAVs) and adjust strategy dynamically. In this paper, we decompose the task into two sub-problems: multi-agent path planning and sub-region CPP. We model the multi-agent path planning problem as a Collective Markov Decision Process (C-MDP), and design an Energy Constrained Multi-Agent Reinforcement Learning (ECMARL) algorithm based on the centralized training and distributed execution concept. Taking into account energy constraint of UAVs, the UAV propulsion power model is established to measure the energy consumption of UAVs, and load balancing strategy is applied to dynamically allocate target areas for each UAV. If the UAV is under energy-depleted situation, ECMARL can adjust the mission strategy in real time according to environmental information and energy storage conditions of other UAVs. When UAVs reach each sub-region of interest, Back-an-Forth Paths (BFPs) are adopted to solve CPP problem, which can ensure full coverage, optimality and complexity of the sub-problem. Comprehensive theoretical analysis and experiments demonstrate that ECMARL is superior to the traditional offline TSP-CPP strategy in terms of solution quality and computational time, and can effectively deal with the energy-constrained UAVs.
引用
收藏
页码:5590 / 5597
页数:8
相关论文
共 50 条
  • [1] Multi-agent Coverage Path Planning Based on Security Reinforcement Learning
    Li, Song
    Ma, Zhuangzhuang
    Zhang, Yunlin
    Shao, Jinliang
    [J]. Binggong Xuebao/Acta Armamentarii, 2023, 44 : 101 - 113
  • [2] Deep Reinforcement Learning for Image-Based Multi-Agent Coverage Path Planning
    Xu, Meng
    She, Yechao
    Jin, Yang
    Wang, Jianping
    [J]. 2023 IEEE 98TH VEHICULAR TECHNOLOGY CONFERENCE, VTC2023-FALL, 2023,
  • [3] Constrained Multi-agent Path Planning Problem
    Maktabifard, Ali
    Foldes, David
    Bak, Bendeguz Dezso
    [J]. COMPUTATIONAL LOGISTICS, ICCL 2023, 2023, 14239 : 450 - 466
  • [4] Attention-Cooperated Reinforcement Learning for Multi-agent Path Planning
    Ma, Jinchao
    Lian, Defu
    [J]. DATABASE SYSTEMS FOR ADVANCED APPLICATIONS. DASFAA 2022 INTERNATIONAL WORKSHOPS, 2022, 13248 : 272 - 290
  • [5] Multi-UAV Path Planning and Following Based on Multi-Agent Reinforcement Learning
    Zhao, Xiaoru
    Yang, Rennong
    Zhong, Liangsheng
    Hou, Zhiwei
    [J]. DRONES, 2024, 8 (01)
  • [6] Research on Path-planning of Manipulator based on Multi-agent Reinforcement Learning
    Tong, Liang
    [J]. FRONTIERS OF MANUFACTURING AND DESIGN SCIENCE, PTS 1-4, 2011, 44-47 : 2116 - 2120
  • [7] Multi-Agent Path Planning in Unknown Environment with Reinforcement Learning and Neural Network
    Luviano Cruz, David
    Yu, Wen
    [J]. 2014 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC), 2014, : 3458 - 3463
  • [8] Planning and Learning in Multi-Agent Path Finding
    K. S. Yakovlev
    A. A. Andreychuk
    A. A. Skrynnik
    A. I. Panov
    [J]. Doklady Mathematics, 2022, 106 : S79 - S84
  • [9] Planning and Learning in Multi-Agent Path Finding
    Yakovlev, K. S.
    Andreychuk, A. A.
    Skrynnik, A. A.
    Panov, A. I.
    [J]. DOKLADY MATHEMATICS, 2022, 106 (SUPPL 1) : S79 - S84
  • [10] Conflict-constrained Multi-agent Reinforcement Learning Method for Parking Trajectory Planning
    Chen, Siyuan
    Wang, Meiling
    Yang, Yi
    Song, Wenjie
    [J]. 2023 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2023), 2023, : 9421 - 9427