Deep Reinforcement Learning Approach for UAV Search Path Planning In Discrete Time and Space

被引:0
|
作者
Benalaya, Najoua [1 ]
Amdouni, Ichrak [1 ,2 ]
Adjih, Cedric [3 ]
Laouiti, Anis [2 ]
Saidane, Leila Azouz [1 ]
机构
[1] Univ Manouba, ENSI, Manouba, Tunisia
[2] Telecom SudParis, Paris, France
[3] INRIA Saclay, Palaiseau, France
关键词
Deep Reinforcement Learning; PPO; UAVs; Search Path Planning; Reward Design; Optuna; Hyperparameters Search;
D O I
10.1109/IWCMC61514.2024.10592510
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Path planning for search missions carried out by Unmanned Aerial Vehicles (UAVs) is a challenging problem. This is due to UAV limited energy budget and the importance of time for search operations. The objective of this study is to come up with an approach to minimize the total search time required to locate a specific target. To achieve this, we deployed a deep reinforcement learning (DRL) model based on the Proximal Policy Optimization (PPO) algorithm to solve the combinatorial optimization problem of UAV search path planning within a minimized search time. A smart reward formulation is designed to achieve the learning goal, fulfill the search requirement, and encourage the agent to select search paths that minimize search time. In addition, we employed Optuna hyperparameter optimization framework to systematically select optimal parameters for the PPO model. Most importantly, thanks to the state representation we considered, the model is generalized and adaptable to various search environments. The PPO model succeeds to compute an accurate search path to be followed by the UAV searcher. Results of the model are compared with results previously obtained with a linear program. We found that the PPO achieves almost the same expected search time, which proves the great relevance of the reward design and the hyperparameters selection we made.
引用
收藏
页码:1437 / 1442
页数:6
相关论文
共 50 条
  • [1] UAV Path Planning for Wireless Data Harvesting: A Deep Reinforcement Learning Approach
    Bayerlein, Harald
    Theile, Mirco
    Caccamo, Marco
    Gesbert, David
    2020 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2020,
  • [2] Reinforcement Learning Combined with Heuristic Search for Solving Discrete Space Path Planning Problems
    Zhang, Xiuling
    Kang, Xuenan
    Wei, Kailun
    Li, Jinxiang
    Ma, Kai
    PROCEEDINGS OF THE 33RD CHINESE CONTROL AND DECISION CONFERENCE (CCDC 2021), 2021, : 2142 - 2147
  • [3] Explainable Deep Reinforcement Learning for UAV autonomous path planning
    He, Lei
    Aouf, Nabil
    Song, Bifeng
    AEROSPACE SCIENCE AND TECHNOLOGY, 2021, 118
  • [4] A UAV Path Planning Method Based on Deep Reinforcement Learning
    Li, Yibing
    Zhang, Sitong
    Ye, Fang
    Jiang, Tao
    Li, Yingsong
    2020 IEEE USNC-CNC-URSI NORTH AMERICAN RADIO SCIENCE MEETING (JOINT WITH AP-S SYMPOSIUM), 2020, : 93 - 94
  • [5] UAV online path planning technology based on deep reinforcement learning
    Fan, Jiaxuan
    Wang, Zhenya
    Ren, Jinlei
    Lu, Ying
    Liu, Yiheng
    2020 CHINESE AUTOMATION CONGRESS (CAC 2020), 2020, : 5382 - 5386
  • [6] A Deep Reinforcement Learning Approach for UAV Path Planning Incorporating Vehicle Dynamics with Acceleration Control
    Sabzekar, Sina
    Samadzad, Mahdi
    Mehditabrizi, Asal
    Tak, Ala Nekouvaght
    UNMANNED SYSTEMS, 2024, 12 (03) : 477 - 498
  • [7] Towards Real-Time Path Planning through Deep Reinforcement Learning for a UAV in Dynamic Environments
    Yan, Chao
    Xiang, Xiaojia
    Wang, Chang
    JOURNAL OF INTELLIGENT & ROBOTIC SYSTEMS, 2020, 98 (02) : 297 - 309
  • [8] Towards Real-Time Path Planning through Deep Reinforcement Learning for a UAV in Dynamic Environments
    Chao Yan
    Xiaojia Xiang
    Chang Wang
    Journal of Intelligent & Robotic Systems, 2020, 98 : 297 - 309
  • [9] Research on Path Planning of Agricultural UAV Based on Improved Deep Reinforcement Learning
    Fu, Haitao
    Li, Zheng
    Zhang, Weijian
    Feng, Yuxuan
    Zhu, Li
    Fang, Xu
    Li, Jian
    AGRONOMY-BASEL, 2024, 14 (11):
  • [10] Path Planning for UAV Ground Target Tracking via Deep Reinforcement Learning
    Li, Bohao
    Wu, Yunjie
    IEEE ACCESS, 2020, 8 (29064-29074) : 29064 - 29074