Deep Reinforcement Learning Approach for UAV Search Path Planning In Discrete Time and Space

被引：0

作者：

Benalaya, Najoua ^{[1
]}

Amdouni, Ichrak ^{[1
,2
]}

Adjih, Cedric ^{[3
]}

Laouiti, Anis ^{[2
]}

Saidane, Leila Azouz ^{[1
]}

机构：

[1] Univ Manouba, ENSI, Manouba, Tunisia

[2] Telecom SudParis, Paris, France

[3] INRIA Saclay, Palaiseau, France

来源：

20TH INTERNATIONAL WIRELESS COMMUNICATIONS & MOBILE COMPUTING CONFERENCE, IWCMC 2024 | 2024年

关键词：

Deep Reinforcement Learning; PPO; UAVs; Search Path Planning; Reward Design; Optuna; Hyperparameters Search;

D O I：

10.1109/IWCMC61514.2024.10592510

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

Path planning for search missions carried out by Unmanned Aerial Vehicles (UAVs) is a challenging problem. This is due to UAV limited energy budget and the importance of time for search operations. The objective of this study is to come up with an approach to minimize the total search time required to locate a specific target. To achieve this, we deployed a deep reinforcement learning (DRL) model based on the Proximal Policy Optimization (PPO) algorithm to solve the combinatorial optimization problem of UAV search path planning within a minimized search time. A smart reward formulation is designed to achieve the learning goal, fulfill the search requirement, and encourage the agent to select search paths that minimize search time. In addition, we employed Optuna hyperparameter optimization framework to systematically select optimal parameters for the PPO model. Most importantly, thanks to the state representation we considered, the model is generalized and adaptable to various search environments. The PPO model succeeds to compute an accurate search path to be followed by the UAV searcher. Results of the model are compared with results previously obtained with a linear program. We found that the PPO achieves almost the same expected search time, which proves the great relevance of the reward design and the hyperparameters selection we made.

引用

页码：1437 / 1442

页数：6

共 50 条

[31] AoI-Aware Deep Reinforcement Learning Based UAV Path Planning for Defence Applications
Kumari, Shilpi
Sodhi, Eshaan
Gupta, Dev
Pratap, Ajay
2024 IEEE SPACE, AEROSPACE AND DEFENCE CONFERENCE, SPACE 2024, 2024, : 230 - 234
[32] Improve exploration in deep reinforcement learning for UAV path planning using state and action entropy
Lv, Hui
Chen, Yadong
Li, Shibo
Zhu, Baolong
Li, Min
MEASUREMENT SCIENCE AND TECHNOLOGY, 2024, 35 (05)
[33] Deep Reinforcement Learning based Path Planning for UAV-assisted Edge Computing Networks
Peng, Yingsheng
Liu, Yong
Zhang, Han
2021 IEEE WIRELESS COMMUNICATIONS AND NETWORKING CONFERENCE (WCNC), 2021,
[34] UAV swarm path planning with reinforcement learning for field prospecting
Puente-Castro, Alejandro
Rivero, Daniel
Pazos, Alejandro
Fernandez-Blanco, Enrique
APPLIED INTELLIGENCE, 2022, 52 (12) : 14101 - 14118
[35] UAV swarm path planning with reinforcement learning for field prospecting
Alejandro Puente-Castro
Daniel Rivero
Alejandro Pazos
Enrique Fernandez-Blanco
Applied Intelligence, 2022, 52 : 14101 - 14118
[36] Underwater Multi-Target Node Path Planning in Hybrid Action Space: A Deep Reinforcement Learning Approach
Han, Guangjie
Feng, Zixiao
Wang, Hao
Hou, Yun
Zhang, Fan
IEEE TRANSACTIONS ON MOBILE COMPUTING, 2024, 23 (12) : 13033 - 13047
[37] A Mapless Local Path Planning Approach Using Deep Reinforcement Learning Framework
Yin, Yan
Chen, Zhiyu
Liu, Gang
Guo, Jianwei
SENSORS, 2023, 23 (04)
[38] A novel path planning approach for unmanned ships based on deep reinforcement learning
Chen, Chen
Ma, Feng
Liu, Jia-Lun
Yan, Xin-Ping
Chen, Xian-Qiao
DATA SCIENCE AND KNOWLEDGE ENGINEERING FOR SENSING DECISION SUPPORT, 2018, 11 : 626 - 633
[39] A deep reinforcement learning approach incorporating genetic algorithm for missile path planning
Xu, Shuangfei
Bi, Wenhao
Zhang, An
Wang, Yunong
INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2024, 15 (05) : 1795 - 1814
[40] A deep reinforcement learning approach incorporating genetic algorithm for missile path planning
Shuangfei Xu
Wenhao Bi
An Zhang
Yunong Wang
International Journal of Machine Learning and Cybernetics, 2024, 15 : 1795 - 1814

← 1 2 3 4 5 →