Deep Reinforcement Learning Approach for UAV Search Path Planning In Discrete Time and Space

被引:0
|
作者
Benalaya, Najoua [1 ]
Amdouni, Ichrak [1 ,2 ]
Adjih, Cedric [3 ]
Laouiti, Anis [2 ]
Saidane, Leila Azouz [1 ]
机构
[1] Univ Manouba, ENSI, Manouba, Tunisia
[2] Telecom SudParis, Paris, France
[3] INRIA Saclay, Palaiseau, France
关键词
Deep Reinforcement Learning; PPO; UAVs; Search Path Planning; Reward Design; Optuna; Hyperparameters Search;
D O I
10.1109/IWCMC61514.2024.10592510
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Path planning for search missions carried out by Unmanned Aerial Vehicles (UAVs) is a challenging problem. This is due to UAV limited energy budget and the importance of time for search operations. The objective of this study is to come up with an approach to minimize the total search time required to locate a specific target. To achieve this, we deployed a deep reinforcement learning (DRL) model based on the Proximal Policy Optimization (PPO) algorithm to solve the combinatorial optimization problem of UAV search path planning within a minimized search time. A smart reward formulation is designed to achieve the learning goal, fulfill the search requirement, and encourage the agent to select search paths that minimize search time. In addition, we employed Optuna hyperparameter optimization framework to systematically select optimal parameters for the PPO model. Most importantly, thanks to the state representation we considered, the model is generalized and adaptable to various search environments. The PPO model succeeds to compute an accurate search path to be followed by the UAV searcher. Results of the model are compared with results previously obtained with a linear program. We found that the PPO achieves almost the same expected search time, which proves the great relevance of the reward design and the hyperparameters selection we made.
引用
收藏
页码:1437 / 1442
页数:6
相关论文
共 50 条
  • [31] AoI-Aware Deep Reinforcement Learning Based UAV Path Planning for Defence Applications
    Kumari, Shilpi
    Sodhi, Eshaan
    Gupta, Dev
    Pratap, Ajay
    2024 IEEE SPACE, AEROSPACE AND DEFENCE CONFERENCE, SPACE 2024, 2024, : 230 - 234
  • [32] Improve exploration in deep reinforcement learning for UAV path planning using state and action entropy
    Lv, Hui
    Chen, Yadong
    Li, Shibo
    Zhu, Baolong
    Li, Min
    MEASUREMENT SCIENCE AND TECHNOLOGY, 2024, 35 (05)
  • [33] Deep Reinforcement Learning based Path Planning for UAV-assisted Edge Computing Networks
    Peng, Yingsheng
    Liu, Yong
    Zhang, Han
    2021 IEEE WIRELESS COMMUNICATIONS AND NETWORKING CONFERENCE (WCNC), 2021,
  • [34] UAV swarm path planning with reinforcement learning for field prospecting
    Puente-Castro, Alejandro
    Rivero, Daniel
    Pazos, Alejandro
    Fernandez-Blanco, Enrique
    APPLIED INTELLIGENCE, 2022, 52 (12) : 14101 - 14118
  • [35] UAV swarm path planning with reinforcement learning for field prospecting
    Alejandro Puente-Castro
    Daniel Rivero
    Alejandro Pazos
    Enrique Fernandez-Blanco
    Applied Intelligence, 2022, 52 : 14101 - 14118
  • [36] Underwater Multi-Target Node Path Planning in Hybrid Action Space: A Deep Reinforcement Learning Approach
    Han, Guangjie
    Feng, Zixiao
    Wang, Hao
    Hou, Yun
    Zhang, Fan
    IEEE TRANSACTIONS ON MOBILE COMPUTING, 2024, 23 (12) : 13033 - 13047
  • [37] A Mapless Local Path Planning Approach Using Deep Reinforcement Learning Framework
    Yin, Yan
    Chen, Zhiyu
    Liu, Gang
    Guo, Jianwei
    SENSORS, 2023, 23 (04)
  • [38] A novel path planning approach for unmanned ships based on deep reinforcement learning
    Chen, Chen
    Ma, Feng
    Liu, Jia-Lun
    Yan, Xin-Ping
    Chen, Xian-Qiao
    DATA SCIENCE AND KNOWLEDGE ENGINEERING FOR SENSING DECISION SUPPORT, 2018, 11 : 626 - 633
  • [39] A deep reinforcement learning approach incorporating genetic algorithm for missile path planning
    Xu, Shuangfei
    Bi, Wenhao
    Zhang, An
    Wang, Yunong
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2024, 15 (05) : 1795 - 1814
  • [40] A deep reinforcement learning approach incorporating genetic algorithm for missile path planning
    Shuangfei Xu
    Wenhao Bi
    An Zhang
    Yunong Wang
    International Journal of Machine Learning and Cybernetics, 2024, 15 : 1795 - 1814