Graph Reinforcement Learning for Multi-Aircraft Conflict Resolution

被引:0
|
作者
Li, Yumeng [1 ]
Zhang, Yunhe [1 ]
Guo, Tong [1 ]
Liu, Yu [2 ,3 ]
Lv, Yisheng [4 ]
Du, Wenbo [1 ]
机构
[1] Beihang Univ, Sch Elect & Informat Engn, Beijing 100191, Peoples R China
[2] Naval Aviat Univ, Inst Informat Fus, Yantai 264001, Peoples R China
[3] Tsinghua Univ, Dept Elect Engn, Beijing 100083, Peoples R China
[4] Chinese Acad Sci, Inst Automat, Beijing 100045, Peoples R China
来源
基金
国家重点研发计划;
关键词
Aircraft; Air traffic control; Decision making; Atmospheric modeling; Intelligent vehicles; Scalability; Reinforcement learning; Conflict resolution; graph reinforcement learning; air traffic management; DYNAMIC ENVIRONMENTS; AVOIDANCE; VEHICLE;
D O I
10.1109/TIV.2024.3364652
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The escalating density of airspace has led to sharply increased conflicts between aircraft. Efficient and scalable conflict resolution methods are crucial to mitigate collision risks. Existing learning-based methods become less effective as the scale of aircraft increases due to their redundant information representations. In this paper, to accommodate the increased airspace density, a novel graph reinforcement learning (GRL) method is presented to efficiently learn deconfliction strategies. A time-evolving conflict graph is exploited to represent the local state of individual aircraft and the global spatiotemporal relationships between them. Equipped with the conflict graph, GRL can efficiently learn deconfliction strategies by selectively aggregating aircraft state information through a multi-head attention-boosted graph neural network. Furthermore, a temporal regularization mechanism is proposed to enhance learning stability in highly dynamic environments. Comprehensive experimental studies have been conducted on an OpenAI Gym-based flight simulator. Compared with the existing state-of-the-art learning-based methods, the results demonstrate that GRL can save much training time while achieving significantly better deconfliction strategies in terms of safety and efficiency metrics. In addition, GRL has a strong power of scalability and robustness with increasing aircraft scale.
引用
收藏
页码:4529 / 4540
页数:12
相关论文
共 50 条
  • [21] A multi-aircraft conflict detection and resolution method for 4-dimensional trajectory-based operation
    Hao, Siqi
    Cheng, Shaowu
    Zhang, Yaping
    [J]. CHINESE JOURNAL OF AERONAUTICS, 2018, 31 (07) : 1579 - 1593
  • [22] A multi-aircraft conflict detection and resolution method for 4-dimensional trajectory-based operation
    [J]. ZHANG, Yaping (zxlt0905@163.com), 1600, Chinese Journal of Aeronautics (31):
  • [23] A multi-aircraft conflict detection and resolution method for 4-dimensional trajectory-based operation
    Siqi HAO
    Shaowu CHENG
    Yaping ZHANG
    [J]. Chinese Journal of Aeronautics, 2018, 31 (07) : 1579 - 1593
  • [24] A hybrid distributed-centralized conflict resolution approach for multi-aircraft based on cooperative co-evolutionary
    ZHANG XueJun
    GUAN XiangMin
    HWANG Inseok
    CAI KaiQuan
    [J]. Science China(Information Sciences), 2013, 56 (12) : 281 - 296
  • [25] Study on the resolution of multi-aircraft flight conflicts based on an IDQN
    Sui, Dong
    Xu, Weiping
    Zhang, Kai
    [J]. CHINESE JOURNAL OF AERONAUTICS, 2022, 35 (02) : 195 - 213
  • [26] Study on the resolution of multi-aircraft flight conflicts based on an IDQN
    Dong SUI
    Weiping XU
    Kai ZHANG
    [J]. Chinese Journal of Aeronautics, 2022, 35 (02) : 195 - 213
  • [27] A hybrid distributed-centralized conflict resolution approach for multi-aircraft based on cooperative co-evolutionary
    Zhang XueJun
    Guan XiangMin
    Hwang, Inseok
    Cai KaiQuan
    [J]. SCIENCE CHINA-INFORMATION SCIENCES, 2013, 56 (12) : 1 - 16
  • [28] A hybrid distributed-centralized conflict resolution approach for multi-aircraft based on cooperative co-evolutionary
    XueJun Zhang
    XiangMin Guan
    Inseok Hwang
    KaiQuan Cai
    [J]. Science China Information Sciences, 2013, 56 : 1 - 16
  • [29] Probabilistic multi-aircraft conflict detection approach for trajectory-based operation
    Hao, Siqi
    Zhang, Yaping
    Cheng, Shaowu
    Liu, Ruiwei
    Xing, Zhiwei
    [J]. TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES, 2018, 95 : 698 - 712
  • [30] Multi-aircraft Attack Guidance Law Based on Supervised Learning
    Lyu, Zhongshi
    Guo, Xiaoyun
    Xu, Cheng
    Hao, Mingrui
    Liang, Xinkai
    [J]. PROCEEDINGS OF 2022 INTERNATIONAL CONFERENCE ON AUTONOMOUS UNMANNED SYSTEMS, ICAUS 2022, 2023, 1010 : 2745 - 2751