Hierarchical multiagent reinforcement learning schemes for air traffic management

被引:0
|
作者
Christos Spatharis
Alevizos Bastas
Theocharis Kravaris
Konstantinos Blekas
George A. Vouros
Jose Manuel Cordero
机构
[1] University of Ioannina,Department of Computer Science and Engineering
[2] University of Piraeus,Department of Digital Systems
[3] CRIDA,undefined
来源
关键词
Multiagent reinforcement learning; Hierarchical learning; State abstraction; Congestion problems; Air traffic management;
D O I
暂无
中图分类号
学科分类号
摘要
In this work we investigate the use of hierarchical multiagent reinforcement learning methods for the computation of policies to resolve congestion problems in the air traffic management domain. To address cases where the demand of airspace use exceeds capacity, we consider agents representing flights, who need to decide on ground delays at the pre-tactical stage of operations, towards executing their trajectories while adhering to airspace capacity constraints. Hierarchical reinforcement learning manages to handle real-world problems with high complexity, by partitioning the task into hierarchies of states and/or actions. This provides an efficient way of exploring the state–action space and constructing an advantageous decision-making mechanism. We first establish a general framework of hierarchical multiagent reinforcement learning, and then, we further formulate four alternative schemes of abstractions, on states, actions, or both. To quantitatively assess the quality of solutions of the proposed approaches and show the potential of the hierarchical methods in resolving the demand–capacity balance problem, we provide experimental results on real-world evaluation cases, where we measure the average delay per flight and the number of flights with delays.
引用
收藏
页码:147 / 159
页数:12
相关论文
共 50 条
  • [1] Hierarchical multiagent reinforcement learning schemes for air traffic management
    Spatharis, Christos
    Bastas, Alevizos
    Kravaris, Theocharis
    Blekas, Konstantinos
    Vouros, George A.
    Manuel Cordero, Jose
    NEURAL COMPUTING & APPLICATIONS, 2023, 35 (01): : 147 - 159
  • [2] Collaborative multiagent reinforcement learning schemes for air traffic management
    Spatharis, Chistos
    Blekas, Konstantinos
    Bastas, Alevizos
    Kravaris, Theocharis
    Vouros, George A.
    2019 10TH INTERNATIONAL CONFERENCE ON INFORMATION, INTELLIGENCE, SYSTEMS AND APPLICATIONS (IISA), 2019, : 357 - 364
  • [3] Improving Air Traffic Management with a Learning Multiagent System
    Tumer, Kagan
    Agogino, Adrian
    IEEE INTELLIGENT SYSTEMS, 2009, 24 (01) : 18 - 21
  • [4] Multiagent Reinforcement Learning in Traffic and Transportation
    Bazzan, Ana
    2014 IEEE SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE IN VEHICLES AND TRANSPORTATION SYSTEMS (CIVTS), 2014, : VII - VII
  • [5] Opportunities for multiagent systems and multiagent reinforcement learning in traffic control
    Bazzan, Ana L. C.
    AUTONOMOUS AGENTS AND MULTI-AGENT SYSTEMS, 2009, 18 (03) : 342 - 375
  • [6] Opportunities for multiagent systems and multiagent reinforcement learning in traffic control
    Ana L. C. Bazzan
    Autonomous Agents and Multi-Agent Systems, 2009, 18 : 342 - 375
  • [7] Multiagent traffic management: Opportunities for multiagent learning
    Dresner, Kurt
    Stone, Peter
    LEARNING AND ADAPTION IN MULTI-AGENT SYSTEMS, 2006, 3898 : 129 - 138
  • [8] Explaining deep reinforcement learning decisions in complex multiagent settings: towards enabling automation in air traffic flow management
    Kravaris, Theocharis
    Lentzos, Konstantinos
    Santipantakis, Georgios
    Vouros, George A.
    Andrienko, Gennady
    Andrienko, Natalia
    Crook, Ian
    Cordero Garcia, Jose Manuel
    Iglesias Martinez, Enrique
    APPLIED INTELLIGENCE, 2023, 53 (04) : 4063 - 4098
  • [9] Explaining deep reinforcement learning decisions in complex multiagent settings: towards enabling automation in air traffic flow management
    Theocharis Kravaris
    Konstantinos Lentzos
    Georgios Santipantakis
    George A. Vouros
    Gennady Andrienko
    Natalia Andrienko
    Ian Crook
    Jose Manuel Cordero Garcia
    Enrique Iglesias Martinez
    Applied Intelligence, 2023, 53 : 4063 - 4098
  • [10] Hierarchical Multiagent Reinforcement Learning for Allocating Guaranteed Display Ads
    Wang, Lu
    Han, Lei
    Chen, Xinru
    Li, Chengchang
    Huang, Junzhou
    Zhang, Weinan
    Zhang, Wei
    He, Xiaofeng
    Luo, Dijun
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 33 (10) : 5361 - 5373