An Interrelated Imitation Learning Method for Heterogeneous Drone Swarm Coordination

被引:2
|
作者
Yang, Bo [1 ]
Ma, Chaofan [2 ]
Xia, Xiaofang [3 ]
机构
[1] Northwest A&F Univ, Coll Informat Engn, Yangling 712100, Shaanxi, Peoples R China
[2] Zhongyuan Univ Technol, Software Coll, Zhengzhou 450007, Henan, Peoples R China
[3] Xidian Univ, Sch Comp Sci & Technol, Xian 710071, Shaanxi, Peoples R China
基金
中国国家自然科学基金;
关键词
Unmanned aerial vehicles; formation control; multi-agent imitation learning; latent belief representation;
D O I
10.1109/TETC.2022.3202297
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The proliferation of small drones has boosted diverse intelligent services, in which the effective swarm coordination plays a vital role in enhancing execution efficiency. However, owing to unreliable air communication and heterogeneous computation capabilities, it is difficult to achieve coordinated actions particularly in distributed scenarios with incomplete observations. In this article, we utilize the generative adversarial imitation learning (GAIL) model to coordinate the drones' maneuvers by imitating the peer's demonstrations. However, incomplete observations will lead to inaccurate imitation policies. In order to recover true environment states, we encode historical observation-action trajectories into latent belief representations, which are trained in correlation to imitation policies. Moreover, by merging the trace of historical contexts, the prediction of future states and the action-assisted guidance information, we gain robust belief representations, which lead to more accurate imitation policies. We evaluate the algorithm performance via the drones' formation control task. Experiment results display the superiorities on imitation accuracy, execution time and energy cost.
引用
收藏
页码:1704 / 1716
页数:13
相关论文
共 50 条
  • [11] Research on a Judging Method for the Attack Intent of Drone Swarm
    Sun, Haiwen
    Han, Xiao
    Chen, Ting
    Li, Dan
    Li, Ye
    Jin, Zirong
    Binggong Xuebao/Acta Armamentarii, 2024, 45 : 25 - 35
  • [12] Enhanced multi agent coordination algorithm for drone swarm patrolling in durian orchards
    Tang, Ruipeng
    Tang, Jianrui
    Talip, Mohamad Sofian Abu
    Aridas, Narendra Kumar
    Xu, Xifeng
    SCIENTIFIC REPORTS, 2025, 15 (01):
  • [13] Adversarial imitation learning with deep attention network for swarm systems
    Wu, Yapei
    Wang, Tao
    Liu, Tong
    Zheng, Zhicheng
    Xu, Demin
    Peng, Xingguang
    COMPLEX & INTELLIGENT SYSTEMS, 2025, 11 (01)
  • [14] A Novel Heterogeneous Swarm Reinforcement Learning Method for Sequential Decision Making Problems
    Akbari, Zohreh
    Unland, Rainer
    MACHINE LEARNING AND KNOWLEDGE EXTRACTION, 2019, 1 (02): : 590 - 610
  • [15] Collective Transport Behavior in a Robotic Swarm with Hierarchical Imitation Learning
    Han, Ziyao
    Yi, Fan
    Ohkura, Kazuhiro
    JOURNAL OF ROBOTICS AND MECHATRONICS, 2024, 36 (03) : 538 - 545
  • [16] Decentralized swarm control based on graph convolutional imitation learning
    Guo, Ce
    Zeng, Zhi-Wen
    Zhu, Peng-Ming
    Zhou, Zhi-Qian
    Lu, Hui-Min
    Zhejiang Daxue Xuebao (Gongxue Ban)/Journal of Zhejiang University (Engineering Science), 2022, 56 (06): : 1055 - 1061
  • [17] Deep Reinforcement Learning Based on Curriculum Learning for Drone Swarm Area Defense
    Sun, Miaoping
    Yang, Zequan
    Dai, Xunhua
    Nian, Xiaohong
    Wang, Haibo
    Xiong, Hongyun
    PROCEEDINGS OF 2022 INTERNATIONAL CONFERENCE ON AUTONOMOUS UNMANNED SYSTEMS, ICAUS 2022, 2023, 1010 : 1119 - 1128
  • [18] Review of Reliability Assessment Methods of Drone Swarm (Fleet) and a New Importance Evaluation Based Method of Drone Swarm Structure Analysis
    Zaitseva, Elena
    Levashenko, Vitaly
    Mukhamediev, Ravil
    Brinzei, Nicolae
    Kovalenko, Andriy
    Symagulov, Adilkhan
    MATHEMATICS, 2023, 11 (11)
  • [19] Learn by Observation: Imitation Learning for Drone Patrolling from Videos of A Human Navigator
    Fan, Yue
    Chu, Shilei
    Zhang, Wei
    Song, Ran
    Li, Yibin
    2020 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2020, : 5209 - 5216
  • [20] Cooperative and Competitive Reinforcement and Imitation Learning for a Mixture of Heterogeneous Learning Modules
    Uchibe, Eiji
    FRONTIERS IN NEUROROBOTICS, 2018, 12