Deep Reinforcement Learning Approach for UAV Search Path Planning In Discrete Time and Space

被引:0
|
作者
Benalaya, Najoua [1 ]
Amdouni, Ichrak [1 ,2 ]
Adjih, Cedric [3 ]
Laouiti, Anis [2 ]
Saidane, Leila Azouz [1 ]
机构
[1] Univ Manouba, ENSI, Manouba, Tunisia
[2] Telecom SudParis, Paris, France
[3] INRIA Saclay, Palaiseau, France
关键词
Deep Reinforcement Learning; PPO; UAVs; Search Path Planning; Reward Design; Optuna; Hyperparameters Search;
D O I
10.1109/IWCMC61514.2024.10592510
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Path planning for search missions carried out by Unmanned Aerial Vehicles (UAVs) is a challenging problem. This is due to UAV limited energy budget and the importance of time for search operations. The objective of this study is to come up with an approach to minimize the total search time required to locate a specific target. To achieve this, we deployed a deep reinforcement learning (DRL) model based on the Proximal Policy Optimization (PPO) algorithm to solve the combinatorial optimization problem of UAV search path planning within a minimized search time. A smart reward formulation is designed to achieve the learning goal, fulfill the search requirement, and encourage the agent to select search paths that minimize search time. In addition, we employed Optuna hyperparameter optimization framework to systematically select optimal parameters for the PPO model. Most importantly, thanks to the state representation we considered, the model is generalized and adaptable to various search environments. The PPO model succeeds to compute an accurate search path to be followed by the UAV searcher. Results of the model are compared with results previously obtained with a linear program. We found that the PPO achieves almost the same expected search time, which proves the great relevance of the reward design and the hyperparameters selection we made.
引用
收藏
页码:1437 / 1442
页数:6
相关论文
共 50 条
  • [21] Path Planning for UAV-Mounted Mobile Edge Computing With Deep Reinforcement Learning
    Liu, Qian
    Shi, Long
    Sun, Linlin
    Li, Jun
    Ding, Ming
    Shu, Feng
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2020, 69 (05) : 5723 - 5728
  • [22] Robot Search Path Planning Method Based on Prioritized Deep Reinforcement Learning
    Yanglong Liu
    Zuguo Chen
    Yonggang Li
    Ming Lu
    Chaoyang Chen
    Xuzhuo Zhang
    International Journal of Control, Automation and Systems, 2022, 20 : 2669 - 2680
  • [23] Robot Search Path Planning Method Based on Prioritized Deep Reinforcement Learning
    Liu, Yanglong
    Chen, Zuguo
    Li, Yonggang
    Lu, Ming
    Chen, Chaoyang
    Zhang, Xuzhuo
    INTERNATIONAL JOURNAL OF CONTROL AUTOMATION AND SYSTEMS, 2022, 20 (08) : 2669 - 2680
  • [24] Real Time Path Planning of Robot using Deep Reinforcement Learning
    Raajan, Jeevan
    Srihari, P., V
    Satya, Jayadev P.
    Bhikkaji, B.
    Pasumarthy, Ramkrishna
    IFAC PAPERSONLINE, 2020, 53 (02): : 15602 - 15607
  • [25] AoI optimal UAV trajectory planning: A Deep Recurrent Reinforcement Learning Approach
    Wu, Mengjie
    Chi, Huijia
    Gan, Shuying
    Wang, Xijun
    Xu, Chao
    2021 IEEE 32ND ANNUAL INTERNATIONAL SYMPOSIUM ON PERSONAL, INDOOR AND MOBILE RADIO COMMUNICATIONS (PIMRC), 2021,
  • [26] A novel modified search and rescue optimization algorithm based on reinforcement learning for UAV path planning
    Zhou W.-J.
    Zhang C.-Q.
    Tang W.-D.
    Yi Y.-H.
    Liu W.-W.
    Qin W.-D.
    Kongzhi yu Juece/Control and Decision, 2024, 39 (04): : 1203 - 1211
  • [27] Deep Reinforcement Learning for Real-Time Trajectory Planning in UAV Networks
    Li, Kai
    Ni, Wei
    Tovar, Eduardo
    Guizani, Mohsen
    2020 16TH INTERNATIONAL WIRELESS COMMUNICATIONS & MOBILE COMPUTING CONFERENCE, IWCMC, 2020, : 958 - 963
  • [28] Deep Reinforcement Learning-Based UAV Path Planning Algorithm in Agricultural Time- Constrained Data Collection
    Cai, Mingcheng
    Fan, Shoucheng
    Xiao, GuoQiang
    Hu, Ke
    ADVANCES IN ELECTRICAL AND COMPUTER ENGINEERING, 2023, 23 (02) : 101 - 108
  • [29] Deep Reinforcement Learning for UAV Intelligent Mission Planning
    Yue, Longfei
    Yang, Rennong
    Zhang, Ying
    Yu, Lixin
    Wang, Zhuangzhuang
    COMPLEXITY, 2022, 2022
  • [30] UAV Coverage Path Planning under Varying Power Constraints using Deep Reinforcement Learning
    Theile, Mirco
    Bayerlein, Harald
    Nai, Richard
    Gesbert, David
    Caccamo, Marco
    2020 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2020, : 1444 - 1449