An Interrelated Imitation Learning Method for Heterogeneous Drone Swarm Coordination

被引:2
|
作者
Yang, Bo [1 ]
Ma, Chaofan [2 ]
Xia, Xiaofang [3 ]
机构
[1] Northwest A&F Univ, Coll Informat Engn, Yangling 712100, Shaanxi, Peoples R China
[2] Zhongyuan Univ Technol, Software Coll, Zhengzhou 450007, Henan, Peoples R China
[3] Xidian Univ, Sch Comp Sci & Technol, Xian 710071, Shaanxi, Peoples R China
基金
中国国家自然科学基金;
关键词
Unmanned aerial vehicles; formation control; multi-agent imitation learning; latent belief representation;
D O I
10.1109/TETC.2022.3202297
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The proliferation of small drones has boosted diverse intelligent services, in which the effective swarm coordination plays a vital role in enhancing execution efficiency. However, owing to unreliable air communication and heterogeneous computation capabilities, it is difficult to achieve coordinated actions particularly in distributed scenarios with incomplete observations. In this article, we utilize the generative adversarial imitation learning (GAIL) model to coordinate the drones' maneuvers by imitating the peer's demonstrations. However, incomplete observations will lead to inaccurate imitation policies. In order to recover true environment states, we encode historical observation-action trajectories into latent belief representations, which are trained in correlation to imitation policies. Moreover, by merging the trace of historical contexts, the prediction of future states and the action-assisted guidance information, we gain robust belief representations, which lead to more accurate imitation policies. We evaluate the algorithm performance via the drones' formation control task. Experiment results display the superiorities on imitation accuracy, execution time and energy cost.
引用
收藏
页码:1704 / 1716
页数:13
相关论文
共 50 条
  • [31] Dynamic Resource Management of Heterogeneous Mobile Platforms via Imitation Learning
    Mandal, Sumit K.
    Bhat, Ganapati
    Patil, Chetan Arvind
    Doppa, Janardhan Rao
    Pande, Partha Pratim
    Ogras, Umit Y.
    IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, 2019, 27 (12) : 2842 - 2854
  • [32] Hector: A Reinforcement Learning-based Scheduler for Minimizing Casualties of a Military Drone Swarm
    Jin, Heng
    Liu, Qingyu
    Li, Chengzhang
    Hou, Y. Thomas
    Lou, Wenjing
    Kompella, Sastry
    2022 IEEE MILITARY COMMUNICATIONS CONFERENCE (MILCOM), 2022,
  • [33] Robot imitation learning method based on structural grammar
    Cong M.
    Jian J.
    Zou Q.
    Liu D.
    Huazhong Keji Daxue Xuebao (Ziran Kexue Ban)/Journal of Huazhong University of Science and Technology (Natural Science Edition), 2021, 49 (10): : 97 - 102
  • [34] Imitation Learning with Graph Neural Networks for Improving Swarm Robustness under Restricted Communications
    Guo, Ce
    Zhu, Pengming
    Zhou, Zhiqian
    Lang, Lin
    Zeng, Zhiwen
    Lu, Huimin
    APPLIED SCIENCES-BASEL, 2021, 11 (19):
  • [35] Adaptive heterogeneous particle swarm optimization with comprehensive learning strategy
    Liu, Ziang
    Nishi, Tatsushi
    JOURNAL OF ADVANCED MECHANICAL DESIGN SYSTEMS AND MANUFACTURING, 2022, 16 (04):
  • [36] Learning and equilibrium selection in a coordination game with heterogeneous agents
    Fogale, Alberto
    Pellizzari, Paolo
    Warglien, Massimo
    PHYSICA A-STATISTICAL MECHANICS AND ITS APPLICATIONS, 2007, 380 : 519 - 527
  • [37] SON Coordination in Heterogeneous Networks: A Reinforcement Learning Framework
    Iacoboaiea, Ovidiu-Constantin
    Sayrac, Berna
    Ben Jemaa, Sana
    Bianchi, Pascal
    IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2016, 15 (09) : 5835 - 5847
  • [38] Multi-Target Tracking Method for Drone Swarm Based on Interaction Feature Extraction
    Li, Qi
    Yang, Xiaogang
    Xi, Jianxaing
    Lu, Ruitao
    2024 3RD INTERNATIONAL CONFERENCE ON ROBOTICS, ARTIFICIAL INTELLIGENCE AND INTELLIGENT CONTROL, RAIIC 2024, 2024, : 235 - 240
  • [39] SwarmGear: Heterogeneous Swarm of Drones with Morphogenetic Leader Drone and Virtual Impedance Links for Multi-Agent Inspection
    Darush, Zhanibek
    Martynov, Mikhail
    Fedoseev, Aleksey
    Shcherbak, Aleksei
    Tsetserukou, Dzmitry
    2023 INTERNATIONAL CONFERENCE ON UNMANNED AIRCRAFT SYSTEMS, ICUAS, 2023, : 557 - 563
  • [40] Coordination control design of heterogeneous swarm robots by means of task-oriented optimization
    Nishikawa, Naoki
    Suzuki, Reiji
    Arita, Takaya
    ARTIFICIAL LIFE AND ROBOTICS, 2016, 21 (01) : 57 - 68