Energy Constrained Multi-Agent Reinforcement Learning for Coverage Path Planning

被引：0

作者：

Zhao, Chenyang ^{[1
]}

Liu, Juan ^{[2
]}

Yoon, Suk-Un ^{[3
]}

Li, Xinde ^{[1
,4
]}

Li, Heqing ^{[1
]}

Zhang, Zhentong ^{[1
,4
]}

机构：

[1] Southeast Univ, Nanjing 210096, Peoples R China

[2] Samsung Elect China R&D Ctr, Nanjing 210012, Peoples R China

[3] Samsung Elect, Suwon 16677, Gyeonggi Do, South Korea

[4] Nanjing Ctr Appl Math, Nanjing 211135, Peoples R China

来源：

2023 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS) | 2023年

基金：

中国国家自然科学基金;

关键词：

NAVIGATION;

D O I：

10.1109/IROS55552.2023.10341412

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

For multi-agent area coverage path planning problem, existing researches regard it as a combination of Traveling Salesman Problem (TSP) and Coverage Path Planning (CPP). However, these approaches have disadvantages of poor observation ability in online phase and high computational cost in offline phase, making it difficult to be applied to energy-constrained Unmanned Aerial Vehicles (UAVs) and adjust strategy dynamically. In this paper, we decompose the task into two sub-problems: multi-agent path planning and sub-region CPP. We model the multi-agent path planning problem as a Collective Markov Decision Process (C-MDP), and design an Energy Constrained Multi-Agent Reinforcement Learning (ECMARL) algorithm based on the centralized training and distributed execution concept. Taking into account energy constraint of UAVs, the UAV propulsion power model is established to measure the energy consumption of UAVs, and load balancing strategy is applied to dynamically allocate target areas for each UAV. If the UAV is under energy-depleted situation, ECMARL can adjust the mission strategy in real time according to environmental information and energy storage conditions of other UAVs. When UAVs reach each sub-region of interest, Back-an-Forth Paths (BFPs) are adopted to solve CPP problem, which can ensure full coverage, optimality and complexity of the sub-problem. Comprehensive theoretical analysis and experiments demonstrate that ECMARL is superior to the traditional offline TSP-CPP strategy in terms of solution quality and computational time, and can effectively deal with the energy-constrained UAVs.

引用

页码：5590 / 5597

页数：8

共 50 条

[1] Multi-agent Coverage Path Planning Based on Security Reinforcement Learning
Li S.
Ma Z.
Zhang Y.
Shao J.
Binggong Xuebao/Acta Armamentarii, 2023, 44 : 101 - 113
[2] Deep Reinforcement Learning for Image-Based Multi-Agent Coverage Path Planning
Xu, Meng
She, Yechao
Jin, Yang
Wang, Jianping
2023 IEEE 98TH VEHICULAR TECHNOLOGY CONFERENCE, VTC2023-FALL, 2023,
[3] Constrained Multi-agent Path Planning Problem
Maktabifard, Ali
Foldes, David
Bak, Bendeguz Dezso
COMPUTATIONAL LOGISTICS, ICCL 2023, 2023, 14239 : 450 - 466
[4] Exploration of Multi-Agent Reinforcement Learning for ISR Flight Path Planning
Xie, Lynphone Mark
Conway, Emily
Cheng, Huaining
Amsaad, Fathi
IEEE NATIONAL AEROSPACE AND ELECTRONICS CONFERENCE, NAECON 2024, 2024, : 328 - 333
[5] Attention-Cooperated Reinforcement Learning for Multi-agent Path Planning
Ma, Jinchao
Lian, Defu
DATABASE SYSTEMS FOR ADVANCED APPLICATIONS. DASFAA 2022 INTERNATIONAL WORKSHOPS, 2022, 13248 : 272 - 290
[6] Multi-UAV Path Planning and Following Based on Multi-Agent Reinforcement Learning
Zhao, Xiaoru
Yang, Rennong
Zhong, Liangsheng
Hou, Zhiwei
DRONES, 2024, 8 (01)
[7] Multi-Objective Dynamic Path Planning with Multi-Agent Deep Reinforcement Learning
Tao, Mengxue
Li, Qiang
Yu, Junxi
JOURNAL OF MARINE SCIENCE AND ENGINEERING, 2025, 13 (01)
[8] Research on Path-planning of Manipulator based on Multi-agent Reinforcement Learning
Tong, Liang
FRONTIERS OF MANUFACTURING AND DESIGN SCIENCE, PTS 1-4, 2011, 44-47 : 2116 - 2120
[9] Multi-Agent Path Planning in Unknown Environment with Reinforcement Learning and Neural Network
Luviano Cruz, David
Yu, Wen
2014 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC), 2014, : 3458 - 3463
[10] Planning and Learning in Multi-Agent Path Finding
K. S. Yakovlev
A. A. Andreychuk
A. A. Skrynnik
A. I. Panov
Doklady Mathematics, 2022, 106 : S79 - S84

← 1 2 3 4 5 →