A Multi-Agent Reinforcement Learning Approach for Conflict Resolution in Dense Traffic Scenarios

被引:7
|
作者
Lai, Jiajian [1 ]
Cai, Kaiquan [1 ]
Liu, Zhaoxuan [1 ]
Yang, Yang [2 ]
机构
[1] Beihang Univ, Sch Elect & Informat Engn, Beijing, Peoples R China
[2] Beihang Univ, Natl Key Lab CNS ATM, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
reinforcement learning; conflict detection and resolution; multi-agent deep deterministic policy gradient; air traffic control;
D O I
10.1109/DASC52595.2021.9594437
中图分类号
V [航空、航天];
学科分类号
08 ; 0825 ;
摘要
A multi-agent reinforcement learning (MARL) based conflict resolution method is proposed. The motivation is to reduce the workloads of air traffic controllers (ATCOs) and pilots in operation over the dense airspace. First, a intermediate waypoints generation method is presented to avoid the frequent fine-tuning in the resolution process. This method enables the controllers and pilots to resolve conflicts in one-step decision making. Next, the multi-agent reinforcement learning method is used to search for the optimal intermediate waypoints. Several numerical examples are presented to illustrate the proposed methodology. A detailed discussion of the sample efficiency with respect to various number of agents is given. Both the benchmark and practical examples are used for validation. The proposed method is able to handle the mulit-conflict scenarios without recourse to frequent disturbance of the pilots and controllers.
引用
收藏
页数:9
相关论文
共 50 条
  • [41] Microscopic Traffic Simulation by Cooperative Multi-agent Deep Reinforcement Learning
    Bacchiani, Giulio
    Molinari, Daniele
    Patander, Marco
    AAMAS '19: PROCEEDINGS OF THE 18TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS, 2019, : 1547 - 1555
  • [42] Online optimization of traffic policy through multi-agent reinforcement learning
    Sasaki, Y
    Flann, NS
    PROCEEDINGS OF THE 7TH JOINT CONFERENCE ON INFORMATION SCIENCES, 2003, : 1211 - 1214
  • [43] Learning without Gradients: Multi-Agent Reinforcement Learning approach to optimization
    Morcos, Amir
    Man, Hong
    West, Aaron
    Maguire, Brian
    ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING IN DEFENSE APPLICATIONS IV, 2022, 12276
  • [44] Micro Junction Agent: A Scalable Multi-agent Reinforcement Learning Method for Traffic Control
    Choi, BumKyu
    Choe, Jean Seong Bjorn
    Kim, Jong-kook
    ICAART: PROCEEDINGS OF THE 14TH INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE - VOL 3, 2022, : 509 - 515
  • [45] Reinforcement Learning Approach for Cooperative Control of Multi-Agent Systems
    Javalera-Rincon, Valeria
    Puig Cayuela, Vicenc
    Morcego Seix, Bernardo
    Orduna-Cabrera, Fernando
    PROCEEDINGS OF THE 11TH INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE (ICAART), VOL 2, 2019, : 80 - 91
  • [46] A Multi-Agent Reinforcement Learning Approach for Stock Portfolio Allocation
    Koratamaddi, Prahlad
    Wadhwani, Karan
    Gupta, Mridul
    Sanjeevi, Sriram G.
    CODS-COMAD 2021: PROCEEDINGS OF THE 3RD ACM INDIA JOINT INTERNATIONAL CONFERENCE ON DATA SCIENCE & MANAGEMENT OF DATA (8TH ACM IKDD CODS & 26TH COMAD), 2021, : 410 - 410
  • [47] Multi-agent reinforcement learning approach for hedging portfolio problem
    Pham, Uyen
    Luu, Quoc
    Tran, Hien
    SOFT COMPUTING, 2021, 25 (12) : 7877 - 7885
  • [48] A Tensor Factorization Approach to Generalization in Multi-Agent Reinforcement Learning
    Bromuri, Stefano
    2012 IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE AND INTELLIGENT AGENT TECHNOLOGY (WI-IAT 2012), VOL 2, 2012, : 274 - 281
  • [49] A Sample Efficient Multi-Agent Approach to Continuous Reinforcement Learning
    Corcoran, Diarmuid
    Kreuger, Per
    Boman, Magnus
    2022 18TH INTERNATIONAL CONFERENCE ON NETWORK AND SERVICE MANAGEMENT (CNSM 2022): INTELLIGENT MANAGEMENT OF DISRUPTIVE NETWORK TECHNOLOGIES AND SERVICES, 2022, : 338 - 344
  • [50] A multi-agent reinforcement learning approach to dynamic service composition
    Wang, Hongbing
    Wang, Xiaojun
    Hu, Xingguo
    Zhang, Xingzhi
    Gu, Mingzhu
    INFORMATION SCIENCES, 2016, 363 : 96 - 119