An Interrelated Imitation Learning Method for Heterogeneous Drone Swarm Coordination

被引:2
|
作者
Yang, Bo [1 ]
Ma, Chaofan [2 ]
Xia, Xiaofang [3 ]
机构
[1] Northwest A&F Univ, Coll Informat Engn, Yangling 712100, Shaanxi, Peoples R China
[2] Zhongyuan Univ Technol, Software Coll, Zhengzhou 450007, Henan, Peoples R China
[3] Xidian Univ, Sch Comp Sci & Technol, Xian 710071, Shaanxi, Peoples R China
基金
中国国家自然科学基金;
关键词
Unmanned aerial vehicles; formation control; multi-agent imitation learning; latent belief representation;
D O I
10.1109/TETC.2022.3202297
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The proliferation of small drones has boosted diverse intelligent services, in which the effective swarm coordination plays a vital role in enhancing execution efficiency. However, owing to unreliable air communication and heterogeneous computation capabilities, it is difficult to achieve coordinated actions particularly in distributed scenarios with incomplete observations. In this article, we utilize the generative adversarial imitation learning (GAIL) model to coordinate the drones' maneuvers by imitating the peer's demonstrations. However, incomplete observations will lead to inaccurate imitation policies. In order to recover true environment states, we encode historical observation-action trajectories into latent belief representations, which are trained in correlation to imitation policies. Moreover, by merging the trace of historical contexts, the prediction of future states and the action-assisted guidance information, we gain robust belief representations, which lead to more accurate imitation policies. We evaluate the algorithm performance via the drones' formation control task. Experiment results display the superiorities on imitation accuracy, execution time and energy cost.
引用
收藏
页码:1704 / 1716
页数:13
相关论文
共 50 条
  • [41] Identifying critical factors in systems with interrelated components: A method considering heterogeneous influence and strength attenuation
    Wang, Qun
    Jia, Guozhu
    Song, Wenyan
    EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2022, 303 (01) : 456 - 470
  • [42] Deep learning-based anomaly detection for individual drone vehicles performing swarm missions
    Ahn, Hyojung
    Chung, Sonia
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 244
  • [43] A syntactic method for robot imitation learning of complex sequence task
    Du, Yu
    Jian, Jipan
    Zhu, Zhiming
    Pan, Dehua
    Liu, Dong
    Tian, Xiaojing
    ROBOTIC INTELLIGENCE AND AUTOMATION, 2023, 43 (02): : 132 - 143
  • [44] A biologically inspired method for conceptual imitation using reinforcement learning
    Mobahi, Hossein
    Ahmadabadi, Majid Nili
    Araabi, Babak Nadjar
    APPLIED ARTIFICIAL INTELLIGENCE, 2007, 21 (03) : 155 - 183
  • [45] Multi-Modal Imitation Learning Method with Cosine Similarity
    Hao S.
    Liu Q.
    Xu P.
    Zhang L.
    Huang Z.
    Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2023, 60 (06): : 1358 - 1372
  • [46] Heterogeneous comprehensive learning and dynamic multi- swarm particle swarm optimizer with two mutation operators
    Wang, Shengliang
    Liu, Genyou
    Gao, Ming
    Cao, Shilong
    Guo, Aizhi
    Wang, Jiachen
    INFORMATION SCIENCES, 2020, 540 (540) : 175 - 201
  • [47] A Discrete Particle Swarm Optimization Approach to Compose Heterogeneous Learning Groups
    Zheng, Zhilin
    Pinkwart, Niels
    2014 14TH IEEE INTERNATIONAL CONFERENCE ON ADVANCED LEARNING TECHNOLOGIES (ICALT), 2014, : 49 - 51
  • [48] Heterogeneous comprehensive learning particle swarm optimization with enhanced exploration and exploitation
    Lynn, Nandar
    Suganthan, Pormuthurai Nagaratnam
    SWARM AND EVOLUTIONARY COMPUTATION, 2015, 24 : 11 - 24
  • [49] Heterogeneous pbest-guided comprehensive learning particle swarm optimization
    Meng, Xiaoding
    Li, Hecheng
    APPLIED SOFT COMPUTING, 2024, 162
  • [50] Heterogeneous unmanned swarm formation containment control based on reinforcement learning
    Yang, Jiaxiu
    Zhang, Hongli
    Wang, Hao
    Li, Xinkai
    Wang, Cong
    AEROSPACE SCIENCE AND TECHNOLOGY, 2024, 150