A reinforcement learning-based evolutionary algorithm for the unmanned aerial vehicles maritime search and rescue path planning problem considering multiple rescue centers

被引:2
|
作者
Zhan, Haowen [1 ]
Zhang, Yue [2 ]
Huang, Jingbo [1 ]
Song, Yanjie [3 ]
Xing, Lining [4 ]
Wu, Jie [5 ]
Gao, Zengyun [6 ]
机构
[1] Natl Univ Def Technol, Coll Syst Engn, Changsha 410073, Peoples R China
[2] Beihang Univ, Sch Reliabil & Syst Engn, Beijing 100191, Peoples R China
[3] Wuyi Intelligent Mfg Inst Ind Technol, Jinhua 321017, Peoples R China
[4] Xidian Univ, Key Lab Collaborat Intelligence Syst, Xian 710071, Peoples R China
[5] Nanjing Univ, Sch Geog & Ocean Sci, Nanjing 210023, Peoples R China
[6] China Maritime Serv Ctr, Beijing 100029, Peoples R China
基金
中国国家自然科学基金;
关键词
UAV; Maritime search and rescue; Path planning; Reinforcement learning; Evolutionary algorithm; Genetic;
D O I
10.1007/s12293-024-00420-8
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In the realm of maritime emergencies, unmanned aerial vehicles (UAVs) play a crucial role in enhancing search and rescue (SAR) operations. They help in efficiently rescuing distressed crews, strengthening maritime surveillance, and maintaining national security due to their cost-effectiveness, versatility, and effectiveness. However, the vast expanse of sea territories and the rapid changes in maritime conditions make a single SAR center insufficient for handling complex emergencies. Thus, it is vital to develop strategies for quickly deploying UAV resources from multiple SAR centers for area reconnaissance and supporting maritime rescue operations. This study introduces a graph-structured planning model for the maritime SAR path planning problem, considering multiple rescue centers (MSARPPP-MRC). It incorporates workload distribution among SAR centers and UAV operational constraints. We propose a reinforcement learning-based genetic algorithm (GA-RL) to tackle the MSARPPP-MRC problem. GA-RL uses heuristic rules to initialize the population and employs the Q-learning method to manage the progeny during each generation, including their retention, storage, or disposal. When the elite repository's capacity is reached, a decision is made on the utilization of these members to refresh the population. Additionally, adaptive crossover and perturbation strategies are applied to develop a more effective SAR scheme. Extensive testing proves that GA-RL surpasses other algorithms in optimization efficacy and efficiency, highlighting the benefits of reinforcement learning in population management.
引用
收藏
页码:373 / 386
页数:14
相关论文
共 50 条
  • [31] Path planning of unmanned aerial vehicle based on improved gravitational search algorithm
    Pei Li
    HaiBin Duan
    Science China Technological Sciences, 2012, 55 : 2712 - 2719
  • [32] A Multi-Robot Coverage Path Planning Method for Maritime Search and Rescue Using Multiple AUVs
    Cai, Chang
    Chen, Jianfeng
    Yan, Qingli
    Liu, Fen
    REMOTE SENSING, 2023, 15 (01)
  • [33] Cooperatively pursuing a target unmanned aerial vehicle by multiple unmanned aerial vehicles based on multiagent reinforcement learning
    Wang X.
    Xuan S.
    Ke L.
    Advanced Control for Applications: Engineering and Industrial Systems, 2020, 2 (02):
  • [34] Determining Position of Target Subjects in Maritime Search and Rescue (MSAR) Operations Using Rotary Wing Unmanned Aerial Vehicles (UAVs)
    Ghazali, Siti Nur Alidda Mohd
    Anuar, Hardy Azmir
    Zakaria, Syed Nasir Alsagoff Syed
    Yusoff, Zaharin
    2016 PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGY (ICICTM), 2016, : 1 - 4
  • [35] An enhanced African Vulture Optimization Algorithm for solving the Unmanned Aerial Vehicles path planning problem✩
    Ait-Saadi, Amylia
    Meraihi, Yassine
    Soukane, Assia
    Yahia, Selma
    Ramdane-Cherif, Amar
    Gabis, Asma Benmessaoud
    COMPUTERS & ELECTRICAL ENGINEERING, 2023, 110
  • [36] Multiple Task Planning Based on TS Algorithm for Multiple Heterogeneous Unmanned Aerial Vehicles
    Wang Zheng
    Liu Qiaoqiao
    Tao Hongtao
    Li Jianxun
    2014 IEEE CHINESE GUIDANCE, NAVIGATION AND CONTROL CONFERENCE (CGNCC), 2014, : 630 - 635
  • [37] A Reinforcement Learning-Based Fire Warning and Suppression System Using Unmanned Aerial Vehicles
    Panahi, Fereidoun H.
    Panahi, Farzad H.
    Ohtsuki, Tomoaki
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2023, 72
  • [39] A better path planning algorithm based on Clothoid curves for unmanned aerial vehicles (UAVs)
    Wang, Y., 1600, Northwestern Polytechnical University (30):
  • [40] Learning-based Preferential Surveillance Algorithm for Persistent Surveillance by Unmanned Aerial Vehicles
    Ramasamy, Manickam
    Ghose, Debasish
    2016 INTERNATIONAL CONFERENCE ON UNMANNED AIRCRAFT SYSTEMS (ICUAS), 2016, : 1032 - 1040