A reinforcement learning-based evolutionary algorithm for the unmanned aerial vehicles maritime search and rescue path planning problem considering multiple rescue centers

被引:2
|
作者
Zhan, Haowen [1 ]
Zhang, Yue [2 ]
Huang, Jingbo [1 ]
Song, Yanjie [3 ]
Xing, Lining [4 ]
Wu, Jie [5 ]
Gao, Zengyun [6 ]
机构
[1] Natl Univ Def Technol, Coll Syst Engn, Changsha 410073, Peoples R China
[2] Beihang Univ, Sch Reliabil & Syst Engn, Beijing 100191, Peoples R China
[3] Wuyi Intelligent Mfg Inst Ind Technol, Jinhua 321017, Peoples R China
[4] Xidian Univ, Key Lab Collaborat Intelligence Syst, Xian 710071, Peoples R China
[5] Nanjing Univ, Sch Geog & Ocean Sci, Nanjing 210023, Peoples R China
[6] China Maritime Serv Ctr, Beijing 100029, Peoples R China
基金
中国国家自然科学基金;
关键词
UAV; Maritime search and rescue; Path planning; Reinforcement learning; Evolutionary algorithm; Genetic;
D O I
10.1007/s12293-024-00420-8
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In the realm of maritime emergencies, unmanned aerial vehicles (UAVs) play a crucial role in enhancing search and rescue (SAR) operations. They help in efficiently rescuing distressed crews, strengthening maritime surveillance, and maintaining national security due to their cost-effectiveness, versatility, and effectiveness. However, the vast expanse of sea territories and the rapid changes in maritime conditions make a single SAR center insufficient for handling complex emergencies. Thus, it is vital to develop strategies for quickly deploying UAV resources from multiple SAR centers for area reconnaissance and supporting maritime rescue operations. This study introduces a graph-structured planning model for the maritime SAR path planning problem, considering multiple rescue centers (MSARPPP-MRC). It incorporates workload distribution among SAR centers and UAV operational constraints. We propose a reinforcement learning-based genetic algorithm (GA-RL) to tackle the MSARPPP-MRC problem. GA-RL uses heuristic rules to initialize the population and employs the Q-learning method to manage the progeny during each generation, including their retention, storage, or disposal. When the elite repository's capacity is reached, a decision is made on the utilization of these members to refresh the population. Additionally, adaptive crossover and perturbation strategies are applied to develop a more effective SAR scheme. Extensive testing proves that GA-RL surpasses other algorithms in optimization efficacy and efficiency, highlighting the benefits of reinforcement learning in population management.
引用
收藏
页码:373 / 386
页数:14
相关论文
共 50 条
  • [41] Evolution-algorithm-based unmanned aerial vehicles path planning in complex environment
    Liu, Xiaolei
    Du, Xiaojiang
    Zhang, Xiaosong
    Zhu, Qingxin
    Guizani, Mohsen
    COMPUTERS & ELECTRICAL ENGINEERING, 2019, 80
  • [42] A Lightweight Reinforcement-Learning-Based Real-Time Path-Planning Method for Unmanned Aerial Vehicles
    Xi, Meng
    Dai, Huiao
    He, Jingyi
    Li, Wenjie
    Wen, Jiabao
    Xiao, Shuai
    Yang, Jiachen
    IEEE INTERNET OF THINGS JOURNAL, 2024, 11 (12): : 21061 - 21071
  • [43] An immune plasma algorithm with Q-learning based pandemic management for path planning of unmanned aerial vehicles
    Aslan, Selcuk
    Demirci, Sercan
    EGYPTIAN INFORMATICS JOURNAL, 2024, 26
  • [44] Iterative learning-based formation control for multiple quadrotor unmanned aerial vehicles
    Zhao, Zhihui
    Wang, Jing
    Chen, Yangquan
    Ju, Shuang
    INTERNATIONAL JOURNAL OF ADVANCED ROBOTIC SYSTEMS, 2020, 17 (02)
  • [45] Autonomous Obstacle Avoidance Algorithm for Unmanned Aerial Vehicles Based on Deep Reinforcement Learning
    Gao, Yuan
    Ren, Ling
    Shi, Tianwei
    Xu, Teng
    Ding, Jianbang
    ENGINEERING LETTERS, 2024, 32 (03) : 650 - 660
  • [46] Multi-UAV Coverage Path Planning Based on Hexagonal Grid Decomposition in Maritime Search and Rescue
    Cho, Sung-Won
    Park, Jin-Hyoung
    Park, Hyun-Ji
    Kim, Seongmin
    MATHEMATICS, 2022, 10 (01)
  • [47] Task Allocation of Multiple Unmanned Aerial Vehicles Based on Deep Transfer Reinforcement Learning
    Yin, Yongfeng
    Guo, Yang
    Su, Qingran
    Wang, Zhetao
    DRONES, 2022, 6 (08)
  • [48] Global path planning for amphibious unmanned vehicles with multiple constraints via deep reinforcement learning
    Wu, Ting
    Wang, Ronghao
    Zhang, Yan
    Meng, Yuhang
    Xiang, Yuzhu
    Xiang, Zhengrong
    2024 14TH ASIAN CONTROL CONFERENCE, ASCC 2024, 2024, : 1296 - 1301
  • [49] Research on Subareas Division Problem of Maritime Search and Rescue Based on Improved Monte Carlo Algorithm
    Jie, Li
    Yao, Sun
    Zhang, Zhenxing
    2013 IEEE INTERNATIONAL CONFERENCE ON MECHATRONICS AND AUTOMATION (ICMA), 2013, : 1464 - 1468
  • [50] Collaborative Route Planning of Multiple Unmanned Aerial Vehicles Considering Task Threats Based on Improved Grasshopper Optimization Algorithm
    Guo Z.
    Lou W.
    Li T.
    Zhang M.
    Bai Z.
    Qiao H.
    Binggong Xuebao/Acta Armamentarii, 2023, 44 : 52 - 60