Deep Reinforcement Learning Approach for UAV Search Path Planning In Discrete Time and Space

被引：0

作者：

Benalaya, Najoua ^{[1
]}

Amdouni, Ichrak ^{[1
,2
]}

Adjih, Cedric ^{[3
]}

Laouiti, Anis ^{[2
]}

Saidane, Leila Azouz ^{[1
]}

机构：

[1] Univ Manouba, ENSI, Manouba, Tunisia

[2] Telecom SudParis, Paris, France

[3] INRIA Saclay, Palaiseau, France

来源：

20TH INTERNATIONAL WIRELESS COMMUNICATIONS & MOBILE COMPUTING CONFERENCE, IWCMC 2024 | 2024年

关键词：

Deep Reinforcement Learning; PPO; UAVs; Search Path Planning; Reward Design; Optuna; Hyperparameters Search;

D O I：

10.1109/IWCMC61514.2024.10592510

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

Path planning for search missions carried out by Unmanned Aerial Vehicles (UAVs) is a challenging problem. This is due to UAV limited energy budget and the importance of time for search operations. The objective of this study is to come up with an approach to minimize the total search time required to locate a specific target. To achieve this, we deployed a deep reinforcement learning (DRL) model based on the Proximal Policy Optimization (PPO) algorithm to solve the combinatorial optimization problem of UAV search path planning within a minimized search time. A smart reward formulation is designed to achieve the learning goal, fulfill the search requirement, and encourage the agent to select search paths that minimize search time. In addition, we employed Optuna hyperparameter optimization framework to systematically select optimal parameters for the PPO model. Most importantly, thanks to the state representation we considered, the model is generalized and adaptable to various search environments. The PPO model succeeds to compute an accurate search path to be followed by the UAV searcher. Results of the model are compared with results previously obtained with a linear program. We found that the PPO achieves almost the same expected search time, which proves the great relevance of the reward design and the hyperparameters selection we made.

引用

页码：1437 / 1442

页数：6

共 50 条

[21] Path Planning for UAV-Mounted Mobile Edge Computing With Deep Reinforcement Learning
Liu, Qian
Shi, Long
Sun, Linlin
Li, Jun
Ding, Ming
Shu, Feng
IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2020, 69 (05) : 5723 - 5728
[22] Robot Search Path Planning Method Based on Prioritized Deep Reinforcement Learning
Yanglong Liu
Zuguo Chen
Yonggang Li
Ming Lu
Chaoyang Chen
Xuzhuo Zhang
International Journal of Control, Automation and Systems, 2022, 20 : 2669 - 2680
[23] Robot Search Path Planning Method Based on Prioritized Deep Reinforcement Learning
Liu, Yanglong
Chen, Zuguo
Li, Yonggang
Lu, Ming
Chen, Chaoyang
Zhang, Xuzhuo
INTERNATIONAL JOURNAL OF CONTROL AUTOMATION AND SYSTEMS, 2022, 20 (08) : 2669 - 2680
[24] Real Time Path Planning of Robot using Deep Reinforcement Learning
Raajan, Jeevan
Srihari, P., V
Satya, Jayadev P.
Bhikkaji, B.
Pasumarthy, Ramkrishna
IFAC PAPERSONLINE, 2020, 53 (02): : 15602 - 15607
[25] AoI optimal UAV trajectory planning: A Deep Recurrent Reinforcement Learning Approach
Wu, Mengjie
Chi, Huijia
Gan, Shuying
Wang, Xijun
Xu, Chao
2021 IEEE 32ND ANNUAL INTERNATIONAL SYMPOSIUM ON PERSONAL, INDOOR AND MOBILE RADIO COMMUNICATIONS (PIMRC), 2021,
[26] A novel modified search and rescue optimization algorithm based on reinforcement learning for UAV path planning
Zhou W.-J.
Zhang C.-Q.
Tang W.-D.
Yi Y.-H.
Liu W.-W.
Qin W.-D.
Kongzhi yu Juece/Control and Decision, 2024, 39 (04): : 1203 - 1211
[27] Deep Reinforcement Learning for Real-Time Trajectory Planning in UAV Networks
Li, Kai
Ni, Wei
Tovar, Eduardo
Guizani, Mohsen
2020 16TH INTERNATIONAL WIRELESS COMMUNICATIONS & MOBILE COMPUTING CONFERENCE, IWCMC, 2020, : 958 - 963
[28] Deep Reinforcement Learning-Based UAV Path Planning Algorithm in Agricultural Time- Constrained Data Collection
Cai, Mingcheng
Fan, Shoucheng
Xiao, GuoQiang
Hu, Ke
ADVANCES IN ELECTRICAL AND COMPUTER ENGINEERING, 2023, 23 (02) : 101 - 108
[29] Deep Reinforcement Learning for UAV Intelligent Mission Planning
Yue, Longfei
Yang, Rennong
Zhang, Ying
Yu, Lixin
Wang, Zhuangzhuang
COMPLEXITY, 2022, 2022
[30] UAV Coverage Path Planning under Varying Power Constraints using Deep Reinforcement Learning
Theile, Mirco
Bayerlein, Harald
Nai, Richard
Gesbert, David
Caccamo, Marco
2020 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2020, : 1444 - 1449

← 1 2 3 4 5 →