An apollonius circle based game theory and Q-learning for cooperative hunting in unmanned aerial vehicle cluster

被引:4
|
作者
Hua, Xiang [1 ]
Liu, Jing [1 ]
Zhang, Jinjin [1 ]
Shi, Chenglong [1 ]
机构
[1] Xian Technol Univ, Xian 710000, Peoples R China
关键词
UAV cluster; Cooperative hunting; Apollonius circle; Game theory; Q-learning;
D O I
10.1016/j.compeleceng.2023.108876
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Cooperative hunting has attracted great research interests with both pursuer and evader behavior strategies. Existing approaches typically utilize computing power to improve the accuracy of hunting. However, these methods ignore the real-time characteristic of unmanned aerial vehicle (UAV) cluster and timeliness of hunting process, directly using them into UAV cluster application would lose efficacy. To solve the problem of cooperative hunting of UAV cluster (pursuers) for one intelligent UAV (evader), we propose an apollonius circle based game theory and Q-learning for cooperative hunting (ACGQ-CH). Specifically, it constructs the behavior strategies and payment matrix of the pursuers and the evader by using game theory and apollonius circle. Then, a state-action matrix is built and a dynamically adjusting the greedy factor is designed based on Qlearning algorithm and reward mean, respectively. Finally, it derives Nash equilibrium solution by solving the state-action matrix, and guides the pursuers to achieve cooperative hunting. The simulation results demonstrate our approach reduces the number of learning steps by 50.8% compared to traditional Q-learning and reduces the hunting time by 16.83, 27.35 and 12.56% respectively compared to baseline methods.
引用
收藏
页数:14
相关论文
共 50 条
  • [21] Cooperative navigation of unmanned aerial vehicle swarm based on cooperative dilution of precision
    Chen, Mingxing
    Xiong, Zhi
    Liu, Jianye
    Wang, Rong
    Xiong, Jun
    INTERNATIONAL JOURNAL OF ADVANCED ROBOTIC SYSTEMS, 2020, 17 (03)
  • [22] Cooperative Hunting Method of Unmanned Surface Vehicle based on Attention Mechanism
    Sang, Guorong
    Zhu, Xiaoqing
    Ruan, Xiaogang
    Zheng, Xinyi
    Chen, Lu
    Li, Chunyang
    2023 35TH CHINESE CONTROL AND DECISION CONFERENCE, CCDC, 2023, : 4865 - 4869
  • [23] Behavior Control of Cooperative Vehicle Infrastructure System in Container Terminals Based on Q-learning
    Wu, Maopu
    Gao, Jian
    Li, Le
    Wang, Yue
    ARTIFICIAL INTELLIGENCE AND ROBOTICS, ISAIR 2022, PT II, 2022, 1701 : 240 - 246
  • [24] The Research on the Mass Incidents Law Based on Evolutionary Game Theory and Q-Learning
    Zhang D.
    Cao Y.
    Zhao C.
    Yao Z.
    Cao Q.
    Applied Mathematics and Nonlinear Sciences, 2024, 9 (01)
  • [25] An immune plasma algorithm with Q-learning based pandemic management for path planning of unmanned aerial vehicles
    Aslan, Selcuk
    Demirci, Sercan
    EGYPTIAN INFORMATICS JOURNAL, 2024, 26
  • [26] Employing Unmanned Aerial Vehicles for Improving Handoff Using Cooperative Game Theory
    Goudarzi, Shidrokh
    Anisi, Mohammad Hossein
    Ciuonzo, Domenico
    Soleymani, Seyed Ahmad
    Pescape, Antonio
    IEEE TRANSACTIONS ON AEROSPACE AND ELECTRONIC SYSTEMS, 2021, 57 (02) : 776 - 794
  • [27] Research on cooperative hunting method of unmanned surface vehicle based on multi-agent reinforcement learning
    Xia J.-W.
    Zhu X.-F.
    Zhang J.-Q.
    Luo Y.-S.
    Liu Z.
    Kongzhi yu Juece/Control and Decision, 2023, 38 (05): : 1438 - 1447
  • [28] Q-Learning Applied To Soft-Kill Countermeasures For Unmanned Aerial Vehicles (UAVs)
    da Silva, Douglas L.
    Antreich, Felix
    Coutinho, Olympio L.
    Machado, Renato
    2020 IEEE/ION POSITION, LOCATION AND NAVIGATION SYMPOSIUM (PLANS), 2020, : 91 - 99
  • [29] Q-learning-based unmanned aerial vehicle path planning with dynamic obstacle avoidance
    Sonny, Amala
    Yeduri, Sreenivasa Reddy
    Cenkeramaddi, Linga Reddy
    APPLIED SOFT COMPUTING, 2023, 147
  • [30] Cooperative Planning for an Unmanned Combat Aerial Vehicle Fleet Using Reinforcement Learning
    Yuksek, Burak
    Demirezen, Mustafa Umut
    Inalhan, Gokhan
    Tsourdos, Antonios
    JOURNAL OF AEROSPACE INFORMATION SYSTEMS, 2021, 18 (10): : 739 - 750