A Novel Searching Method Using Reinforcement Learning Scheme for Multi-UAVs in Unknown Environments

被引:25
|
作者
Yue, Wei [1 ,2 ]
Guan, Xianhe [1 ]
Wang, Liyuan [2 ]
机构
[1] Dalian Maritime Univ, Sch Marine Elect Engn, Dalian 116000, Peoples R China
[2] Dalian Minzu Univ, Key Lab Intelligent Percept & Adv Control State E, Dalian 116600, Peoples R China
来源
APPLIED SCIENCES-BASEL | 2019年 / 9卷 / 22期
关键词
multi-UAV; cooperative search; reinforcement learning; dynamic target; TARGET;
D O I
10.3390/app9224964
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
In this paper, the important topic of cooperative searches for multi-dynamic targets in unknown sea areas by unmanned aerial vehicles (UAVs) is studied based on a reinforcement learning (RL) algorithm. A novel multi-UAV sea area search map is established, in which models of the environment, UAV dynamics, target dynamics, and sensor detection are involved. Then, the search map is updated and extended using the concept of the territory awareness information map. Finally, according to the search efficiency function, a reward and punishment function is designed, and an RL method is used to generate a multi-UAV cooperative search path online. The simulation results show that the proposed algorithm could effectively perform the search task in the sea area with no prior information.
引用
收藏
页数:15
相关论文
共 50 条
  • [41] A Multi-UAVs Communication Network Simulation Platform using OPNET Modeler
    Liu, Qiang
    Wang, Honggang
    Sun, Yantao
    Han, Tingting
    ICC 2020 - 2020 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC), 2020,
  • [42] Downlink Throughput Maximization in Multi-UAVs Networks Using Discrete Optimization
    Kalwar, Saadullah
    Chin, Kwan-Wu
    Yuan, Zhenhui
    JOURNAL OF NETWORK AND SYSTEMS MANAGEMENT, 2020, 28 (02) : 247 - 270
  • [43] Autonomous Conflict Resolution Method for multi-UAVs Based on Preorder Flight Information
    Wang, Weiye
    Wu, Xibao
    Tian, Tian
    Wang, Jiayuan
    2018 INTERNATIONAL CONFERENCE ON ALGORITHMS, COMPUTING AND ARTIFICIAL INTELLIGENCE (ACAI 2018), 2018,
  • [44] Downlink Throughput Maximization in Multi-UAVs Networks Using Discrete Optimization
    Saadullah Kalwar
    Kwan-Wu Chin
    Zhenhui Yuan
    Journal of Network and Systems Management, 2020, 28 : 247 - 270
  • [45] Multi-UAVs task allocation method based on MPSO-SA-DQN
    Pengfei, Peng
    Xue, Gong
    Yalian, Zheng
    MEASUREMENT & CONTROL, 2024,
  • [46] Coordinated variable-based guidance method and experimental verification for multi-UAVs
    Tang Z.-N.
    Xin H.-B.
    Wang Y.-J.
    Chen Q.-Y.
    Wang P.
    Yang X.-X.
    Gongcheng Kexue Xuebao/Chinese Journal of Engineering, 2022, 44 (08): : 1396 - 1405
  • [47] A Multi-UAVs Cooperative Spectrum Sensing Method Based on Improved IDW Algorithm
    Shi, Jie
    Chong, Jingzheng
    Huang, Zejiang
    Yang, Zhihua
    SPACE INFORMATION NETWORKS, SINC 2023, 2024, 2057 : 150 - 163
  • [48] A Hierarchical Reinforcement Learning Based Approach for Multi-robot Cooperation in Unknown Environments
    Cai, Yifan
    Yang, Simon X.
    Xu, Xin
    Mittal, Gauri S.
    PROCEEDINGS OF THE 2011 2ND INTERNATIONAL CONGRESS ON COMPUTER APPLICATIONS AND COMPUTATIONAL SCIENCE, VOL 1, 2012, 144 : 69 - +
  • [49] Enhanced Relative Localization Based on Persistent Excitation for Multi-UAVs in GPS-Denied Environments
    She, Fujiang
    Zhang, Yongjun
    Shi, Dianxi
    Zhou, Hao
    Ren, Xiaoguang
    Xu, Tianqi
    IEEE ACCESS, 2020, 8 : 148136 - 148148
  • [50] Robot Navigation of Environments with Unknown Rough Terrain Using Deep Reinforcement Learning
    Zhang, Kaicheng
    Niroui, Farzad
    Ficocelli, Maurizio
    Nejat, Goldie
    2018 IEEE INTERNATIONAL SYMPOSIUM ON SAFETY, SECURITY, AND RESCUE ROBOTICS (SSRR), 2018,