An apollonius circle based game theory and Q-learning for cooperative hunting in unmanned aerial vehicle cluster

被引:4
|
作者
Hua, Xiang [1 ]
Liu, Jing [1 ]
Zhang, Jinjin [1 ]
Shi, Chenglong [1 ]
机构
[1] Xian Technol Univ, Xian 710000, Peoples R China
关键词
UAV cluster; Cooperative hunting; Apollonius circle; Game theory; Q-learning;
D O I
10.1016/j.compeleceng.2023.108876
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Cooperative hunting has attracted great research interests with both pursuer and evader behavior strategies. Existing approaches typically utilize computing power to improve the accuracy of hunting. However, these methods ignore the real-time characteristic of unmanned aerial vehicle (UAV) cluster and timeliness of hunting process, directly using them into UAV cluster application would lose efficacy. To solve the problem of cooperative hunting of UAV cluster (pursuers) for one intelligent UAV (evader), we propose an apollonius circle based game theory and Q-learning for cooperative hunting (ACGQ-CH). Specifically, it constructs the behavior strategies and payment matrix of the pursuers and the evader by using game theory and apollonius circle. Then, a state-action matrix is built and a dynamically adjusting the greedy factor is designed based on Qlearning algorithm and reward mean, respectively. Finally, it derives Nash equilibrium solution by solving the state-action matrix, and guides the pursuers to achieve cooperative hunting. The simulation results demonstrate our approach reduces the number of learning steps by 50.8% compared to traditional Q-learning and reduces the hunting time by 16.83, 27.35 and 12.56% respectively compared to baseline methods.
引用
收藏
页数:14
相关论文
共 50 条
  • [31] Intelligent event-based output feedback control with Q-learning for unmanned marine vehicle systems
    Zhang, Dan
    Ye, ZeHua
    Chen, PengCheng
    Wang, Qing-Guo
    CONTROL ENGINEERING PRACTICE, 2020, 105
  • [32] Federated learning based on Stackelberg game in unmanned-aerial-vehicle-enabled mobile edge computing
    Li, Chunlin
    Song, Mingyang
    Luo, Youlong
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 235
  • [33] Markov decision evolutionary game theoretic learning for cooperative sensing of unmanned aerial vehicles
    SUN ChangHao
    DUAN HaiBin
    Science China(Technological Sciences), 2015, 58 (08) : 1392 - 1400
  • [34] Markov decision evolutionary game theoretic learning for cooperative sensing of unmanned aerial vehicles
    SUN ChangHao
    DUAN HaiBin
    Science China(Technological Sciences), 2015, (08) : 1392 - 1400
  • [35] Markov decision evolutionary game theoretic learning for cooperative sensing of unmanned aerial vehicles
    ChangHao Sun
    HaiBin Duan
    Science China Technological Sciences, 2015, 58 : 1392 - 1400
  • [36] Markov decision evolutionary game theoretic learning for cooperative sensing of unmanned aerial vehicles
    Sun ChangHao
    Duan HaiBin
    SCIENCE CHINA-TECHNOLOGICAL SCIENCES, 2015, 58 (08) : 1392 - 1400
  • [37] Learning Automata Based Q-Learning for Content Placement in Cooperative Caching
    Yang, Zhong
    Liu, Yuanwei
    Chen, Yue
    Jiao, Lei
    IEEE TRANSACTIONS ON COMMUNICATIONS, 2020, 68 (06) : 3667 - 3680
  • [38] Fuzzing for Unmanned Aerial Vehicle System Based on Reinforcement Learning
    Yu, Zhenhua
    Yang, Wenjian
    Li, Xiteng
    Cong, Xuya
    Computer Engineering and Applications, 2024, 60 (21) : 89 - 98
  • [39] Multi-criteria expertness based cooperative Q-learning
    Esmat Pakizeh
    Maziar Palhang
    Mir Mohsen Pedram
    Applied Intelligence, 2013, 39 : 28 - 40
  • [40] Multi-criteria expertness based cooperative Q-learning
    Pakizeh, Esmat
    Palhang, Maziar
    Pedram, Mir Mohsen
    APPLIED INTELLIGENCE, 2013, 39 (01) : 28 - 40