Deep Reinforcement Learning Approach for UAV Search Path Planning In Discrete Time and Space

被引：0

作者：

Benalaya, Najoua ^{[1
]}

Amdouni, Ichrak ^{[1
,2
]}

Adjih, Cedric ^{[3
]}

Laouiti, Anis ^{[2
]}

Saidane, Leila Azouz ^{[1
]}

机构：

[1] Univ Manouba, ENSI, Manouba, Tunisia

[2] Telecom SudParis, Paris, France

[3] INRIA Saclay, Palaiseau, France

来源：

20TH INTERNATIONAL WIRELESS COMMUNICATIONS & MOBILE COMPUTING CONFERENCE, IWCMC 2024 | 2024年

关键词：

Deep Reinforcement Learning; PPO; UAVs; Search Path Planning; Reward Design; Optuna; Hyperparameters Search;

D O I：

10.1109/IWCMC61514.2024.10592510

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

Path planning for search missions carried out by Unmanned Aerial Vehicles (UAVs) is a challenging problem. This is due to UAV limited energy budget and the importance of time for search operations. The objective of this study is to come up with an approach to minimize the total search time required to locate a specific target. To achieve this, we deployed a deep reinforcement learning (DRL) model based on the Proximal Policy Optimization (PPO) algorithm to solve the combinatorial optimization problem of UAV search path planning within a minimized search time. A smart reward formulation is designed to achieve the learning goal, fulfill the search requirement, and encourage the agent to select search paths that minimize search time. In addition, we employed Optuna hyperparameter optimization framework to systematically select optimal parameters for the PPO model. Most importantly, thanks to the state representation we considered, the model is generalized and adaptable to various search environments. The PPO model succeeds to compute an accurate search path to be followed by the UAV searcher. Results of the model are compared with results previously obtained with a linear program. We found that the PPO achieves almost the same expected search time, which proves the great relevance of the reward design and the hyperparameters selection we made.

引用

页码：1437 / 1442

页数：6

共 50 条

[1] UAV Path Planning for Wireless Data Harvesting: A Deep Reinforcement Learning Approach
Bayerlein, Harald
Theile, Mirco
Caccamo, Marco
Gesbert, David
2020 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2020,
[2] Reinforcement Learning Combined with Heuristic Search for Solving Discrete Space Path Planning Problems
Zhang, Xiuling
Kang, Xuenan
Wei, Kailun
Li, Jinxiang
Ma, Kai
PROCEEDINGS OF THE 33RD CHINESE CONTROL AND DECISION CONFERENCE (CCDC 2021), 2021, : 2142 - 2147
[3] Explainable Deep Reinforcement Learning for UAV autonomous path planning
He, Lei
Aouf, Nabil
Song, Bifeng
AEROSPACE SCIENCE AND TECHNOLOGY, 2021, 118
[4] A UAV Path Planning Method Based on Deep Reinforcement Learning
Li, Yibing
Zhang, Sitong
Ye, Fang
Jiang, Tao
Li, Yingsong
2020 IEEE USNC-CNC-URSI NORTH AMERICAN RADIO SCIENCE MEETING (JOINT WITH AP-S SYMPOSIUM), 2020, : 93 - 94
[5] UAV online path planning technology based on deep reinforcement learning
Fan, Jiaxuan
Wang, Zhenya
Ren, Jinlei
Lu, Ying
Liu, Yiheng
2020 CHINESE AUTOMATION CONGRESS (CAC 2020), 2020, : 5382 - 5386
[6] A Deep Reinforcement Learning Approach for UAV Path Planning Incorporating Vehicle Dynamics with Acceleration Control
Sabzekar, Sina
Samadzad, Mahdi
Mehditabrizi, Asal
Tak, Ala Nekouvaght
UNMANNED SYSTEMS, 2024, 12 (03) : 477 - 498
[7] Towards Real-Time Path Planning through Deep Reinforcement Learning for a UAV in Dynamic Environments
Yan, Chao
Xiang, Xiaojia
Wang, Chang
JOURNAL OF INTELLIGENT & ROBOTIC SYSTEMS, 2020, 98 (02) : 297 - 309
[8] Towards Real-Time Path Planning through Deep Reinforcement Learning for a UAV in Dynamic Environments
Chao Yan
Xiaojia Xiang
Chang Wang
Journal of Intelligent & Robotic Systems, 2020, 98 : 297 - 309
[9] Research on Path Planning of Agricultural UAV Based on Improved Deep Reinforcement Learning
Fu, Haitao
Li, Zheng
Zhang, Weijian
Feng, Yuxuan
Zhu, Li
Fang, Xu
Li, Jian
AGRONOMY-BASEL, 2024, 14 (11):
[10] Path Planning for UAV Ground Target Tracking via Deep Reinforcement Learning
Li, Bohao
Wu, Yunjie
IEEE ACCESS, 2020, 8 (29064-29074) : 29064 - 29074

← 1 2 3 4 5 →