A reinforcement learning-based evolutionary algorithm for the unmanned aerial vehicles maritime search and rescue path planning problem considering multiple rescue centers

被引:2
|
作者
Zhan, Haowen [1 ]
Zhang, Yue [2 ]
Huang, Jingbo [1 ]
Song, Yanjie [3 ]
Xing, Lining [4 ]
Wu, Jie [5 ]
Gao, Zengyun [6 ]
机构
[1] Natl Univ Def Technol, Coll Syst Engn, Changsha 410073, Peoples R China
[2] Beihang Univ, Sch Reliabil & Syst Engn, Beijing 100191, Peoples R China
[3] Wuyi Intelligent Mfg Inst Ind Technol, Jinhua 321017, Peoples R China
[4] Xidian Univ, Key Lab Collaborat Intelligence Syst, Xian 710071, Peoples R China
[5] Nanjing Univ, Sch Geog & Ocean Sci, Nanjing 210023, Peoples R China
[6] China Maritime Serv Ctr, Beijing 100029, Peoples R China
基金
中国国家自然科学基金;
关键词
UAV; Maritime search and rescue; Path planning; Reinforcement learning; Evolutionary algorithm; Genetic;
D O I
10.1007/s12293-024-00420-8
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In the realm of maritime emergencies, unmanned aerial vehicles (UAVs) play a crucial role in enhancing search and rescue (SAR) operations. They help in efficiently rescuing distressed crews, strengthening maritime surveillance, and maintaining national security due to their cost-effectiveness, versatility, and effectiveness. However, the vast expanse of sea territories and the rapid changes in maritime conditions make a single SAR center insufficient for handling complex emergencies. Thus, it is vital to develop strategies for quickly deploying UAV resources from multiple SAR centers for area reconnaissance and supporting maritime rescue operations. This study introduces a graph-structured planning model for the maritime SAR path planning problem, considering multiple rescue centers (MSARPPP-MRC). It incorporates workload distribution among SAR centers and UAV operational constraints. We propose a reinforcement learning-based genetic algorithm (GA-RL) to tackle the MSARPPP-MRC problem. GA-RL uses heuristic rules to initialize the population and employs the Q-learning method to manage the progeny during each generation, including their retention, storage, or disposal. When the elite repository's capacity is reached, a decision is made on the utilization of these members to refresh the population. Additionally, adaptive crossover and perturbation strategies are applied to develop a more effective SAR scheme. Extensive testing proves that GA-RL surpasses other algorithms in optimization efficacy and efficiency, highlighting the benefits of reinforcement learning in population management.
引用
收藏
页码:373 / 386
页数:14
相关论文
共 50 条
  • [1] Coverage path planning for multiple unmanned aerial vehicles in maritime search and rescue operations
    Cho, Sung Won
    Park, Hyun Ji
    Lee, Hanseob
    Shim, David Hyunchul
    Kim, Sun-Young
    COMPUTERS & INDUSTRIAL ENGINEERING, 2021, 161
  • [2] Coverage path planning for maritime search and rescue using reinforcement learning
    Ai, Bo
    Jia, Maoxin
    Xu, Hanwen
    Xu, Jiangling
    Wen, Zhen
    Li, Benshuai
    Zhang, Dan
    OCEAN ENGINEERING, 2021, 241
  • [3] Maritime Search and Rescue Based on Group Mobile Computing for Unmanned Aerial Vehicles and Unmanned Surface Vehicles
    Yang, Tingting
    Jiang, Zhi
    Sun, Ruijin
    Cheng, Nan
    Feng, Hailong
    IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2020, 16 (12) : 7700 - 7708
  • [4] Robust Cellular Communications for Unmanned Aerial Vehicles in Maritime Search and Rescue
    Gorczak, Philipp
    Bektas, Caner
    Kurtz, Fabian
    Luebcke, Thomas
    Wietfeld, Christian
    2019 IEEE INTERNATIONAL SYMPOSIUM ON SAFETY, SECURITY, AND RESCUE ROBOTICS (SSRR), 2019, : 229 - 234
  • [5] An autonomous coverage path planning algorithm for maritime search and rescue of persons-in-water based on deep reinforcement learning
    Wu, Jie
    Cheng, Liang
    Chu, Sensen
    Song, Yanjie
    OCEAN ENGINEERING, 2024, 291
  • [6] Construction of a virtual dataset of maritime search and rescue targets for unmanned aerial vehicles
    Zhao, Zhenqiang
    Shen, Helong
    Liang, Xiao
    Wang, Lucai
    Han, Bing
    OCEAN ENGINEERING, 2025, 328
  • [7] A novel modified search and rescue optimization algorithm based on reinforcement learning for UAV path planning
    Zhou W.-J.
    Zhang C.-Q.
    Tang W.-D.
    Yi Y.-H.
    Liu W.-W.
    Qin W.-D.
    Kongzhi yu Juece/Control and Decision, 2024, 39 (04): : 1203 - 1211
  • [8] Rescue path planning for urban flood: A deep reinforcement learning-based approach
    Li, Xiao-Yan
    Wang, Xia
    RISK ANALYSIS, 2024,
  • [9] Parallel Algorithm for the Path Planning of Multiple Unmanned Aerial Vehicles
    Roberge, Vincent
    Tarbouchi, Mohammed
    2020 FOURTH INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING IN DATA SCIENCES (ICDS), 2020,
  • [10] Fish-Inspired Task Allocation Algorithm for Multiple Unmanned Aerial Vehicles in Search and Rescue Missions
    Alhaqbani, Amjaad
    Kurdi, Heba
    Youcef-Toumi, Kamal
    REMOTE SENSING, 2021, 13 (01) : 1 - 17