Scheduling in Multiagent Systems Using Reinforcement Learning

被引:0
|
作者
Minashina, I. K. [1 ]
Gorbachev, R. A. [1 ]
Zakharova, E. M. [1 ]
机构
[1] Natl Res Univ, Moscow Inst Phys & Technol, Dolgoprudnyi 141700, Moscow oblast, Russia
关键词
reinforcement learning; multiagent systems; railroads; Flatland; reward function structuring; curriculum learning; centralized critic;
D O I
10.1134/S1064562422060175
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
The paper is devoted to scheduling in multiagent systems in the framework of the Flatland 3 competition. The main aim of this competition is to develop an algorithm for the effective control of dense traffic in complex railroad networks according to a given schedule. The proposed solution is based on reinforcement learning. To adapt this method to the particular scheduling problem, a novel approach based on structuring the reward function that stimulates an agent to adhere to its schedule was developed. The architecture of the proposed model is based on a multiagent version of centralized critic with proximal policy optimization (PPO) learning. In addition, a curriculum learning strategy was developed and implemented. This allowed the agent to cope with each level of complexity on time and train the model in more difficult conditions. The proposed solution won first place in the Flatland 3 competition in the reinforcement learning track.
引用
收藏
页码:S70 / S78
页数:9
相关论文
共 50 条
  • [1] Scheduling in Multiagent Systems Using Reinforcement Learning
    I. K. Minashina
    R. A. Gorbachev
    E. M. Zakharova
    [J]. Doklady Mathematics, 2022, 106 : S70 - S78
  • [2] An Adaptive Charging Scheduling for Electric Vehicles Using Multiagent Reinforcement Learning
    Lee, Xian-Long
    Yang, Hong-Tzer
    Tang, Wenjun
    Toosi, Adel N.
    Lam, Edward
    [J]. SERVICE-ORIENTED COMPUTING (ICSOC 2021), 2021, 13121 : 273 - 286
  • [3] Coordination in multiagent reinforcement learning systems
    Kamal, MAS
    Murata, J
    [J]. KNOWLEDGE-BASED INTELLIGENT INFORMATION AND ENGINEERING SYSTEMS, PT 1, PROCEEDINGS, 2004, 3213 : 1197 - 1204
  • [4] Multiairport Departure Scheduling via Multiagent Reinforcement Learning
    Cai, Kaiquan
    Li, Ziqi
    Guo, Tong
    Du, Wenbo
    [J]. IEEE INTELLIGENT TRANSPORTATION SYSTEMS MAGAZINE, 2024, 16 (02) : 102 - 116
  • [5] Opportunities for multiagent systems and multiagent reinforcement learning in traffic control
    Bazzan, Ana L. C.
    [J]. AUTONOMOUS AGENTS AND MULTI-AGENT SYSTEMS, 2009, 18 (03) : 342 - 375
  • [6] Opportunities for multiagent systems and multiagent reinforcement learning in traffic control
    Ana L. C. Bazzan
    [J]. Autonomous Agents and Multi-Agent Systems, 2009, 18 : 342 - 375
  • [7] A survey on transfer learning for multiagent reinforcement learning systems
    Da Silva, Felipe Leno
    Reali Costa, Anna Helena
    [J]. Journal of Artificial Intelligence Research, 2019, 64 : 645 - 703
  • [8] A Survey on Transfer Learning for Multiagent Reinforcement Learning Systems
    Da Silva, Felipe Leno
    Reali Costa, Anna Helena
    [J]. JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2019, 64 : 645 - 703
  • [9] Multiagent Collaboration for Emergency Evacuation Using Reinforcement Learning for Transportation Systems
    Yang Y.
    Yu J.
    Liu D.
    Lee S.-A.
    Namilae S.
    Islam S.
    Gou H.
    Park H.
    Song H.
    [J]. IEEE Journal on Miniaturization for Air and Space Systems, 2022, 3 (04): : 232 - 241
  • [10] An Advising Framework for Multiagent Reinforcement Learning Systems
    da Silva, Felipe Leno
    Glatt, Ruben
    Reali Costa, Anna Helena
    [J]. THIRTY-FIRST AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 4913 - 4914