Mitigating Bus Bunching via Hierarchical Multi-Agent Reinforcement Learning

被引:0
|
作者
Yu, Mengdi [1 ,2 ]
Yang, Tao [3 ]
Li, Chunxiao [4 ,5 ]
Jin, Yaohui [1 ,2 ]
Xu, Yanyan [1 ,2 ]
机构
[1] Shanghai Jiao Tong Univ, AI Inst, MoE Key Lab Artificial Intelligence, Shanghai 200240, Peoples R China
[2] Shanghai Jiao Tong Univ, Data Driven Management Decis Making Lab, Shanghai 200240, Peoples R China
[3] Shanghai Urban Rural Construct & Transportat Dev, Shanghai Transportat Informat Ctr, Shanghai 200032, Peoples R China
[4] Univ Sci & Technol China, Sch SciTech Business, Hefei 230026, Peoples R China
[5] Univ Sci & Technol China, Sch Management, Hefei 230026, Peoples R China
基金
美国国家科学基金会;
关键词
Velocity control; Reinforcement learning; Vehicle dynamics; Reliability; Stability analysis; Roads; Mathematical models; Bus bunching; multiple strategies; hierarchical multi-agent reinforcement learning; TIME; IMPROVE;
D O I
10.1109/TITS.2024.3362813
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
Bus bunching is harmful to the efficiency and stability of bus transit systems, consequently delaying the arrival time of passengers and lowering the public transportation's adoption rate. Traditional solutions adjust the additional holding time of buses at certain stations to mitigate this phenomenon. These methods sacrifice the system efficiency in exchange for even headway between neighboring buses. Little work focuses on optimizing multiple strategies when a single bus line not only has a bus bay to increase bus dwell time but also owns several dedicated bus lanes to accelerate. In this work, we develop a hierarchical multi-agent reinforcement learning (HMARL) framework to combine these two strategies. Speeding up certain buses via dedicated lanes can counteract the negative influence of additional holding time. Next, to support the two strategies, we devise a two-layer policy scheme, one for high-level policy deciding holding or accelerating and the other for low-level policy determining the specific dwell time or increase of speed. Besides, to handle the issue that the controlling actions of agents are asynchronous and temporally extended, we establish a duration-critic module based on the Recurrent Neural Networks (RNN) mechanism to model other agents' impact during the period between two consecutive control. We evaluate the proposed framework on a simulated bus line with a quasi-real-world pattern to compare the performance of both traditional headway-based control methods and existing MARL methods. Results show that our method outperforms other baselines, not only stabilizing a strongly unstable bus line but also shortening the traveling times of passengers.
引用
收藏
页码:9675 / 9692
页数:18
相关论文
共 50 条
  • [1] Reducing Bus Bunching with Asynchronous Multi-Agent Reinforcement Learning
    Wang, Jiawei
    Sun, Lijun
    [J]. PROCEEDINGS OF THE THIRTIETH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2021, 2021, : 426 - 433
  • [2] Hierarchical multi-agent reinforcement learning
    Mohammad Ghavamzadeh
    Sridhar Mahadevan
    Rajbala Makar
    [J]. Autonomous Agents and Multi-Agent Systems, 2006, 13 : 197 - 229
  • [3] Hierarchical multi-agent reinforcement learning
    Ghavamzadeh, Mohammad
    Mahadevan, Sridhar
    Makar, Rajbala
    [J]. AUTONOMOUS AGENTS AND MULTI-AGENT SYSTEMS, 2006, 13 (02) : 197 - 229
  • [4] Dynamic holding control to avoid bus bunching: A multi-agent deep reinforcement learning framework
    Wang, Jiawei
    Sun, Lijun
    [J]. TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES, 2020, 116 (116)
  • [5] Multi-objective multi-agent deep reinforcement learning to reduce bus bunching for multiline services with a shared corridor
    Wang, Jiawei
    Sun, Lijun
    [J]. TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES, 2023, 155
  • [6] Hierarchical reinforcement learning via dynamic subspace search for multi-agent planning
    Ma, Aaron
    Ouimet, Michael
    Cortes, Jorge
    [J]. AUTONOMOUS ROBOTS, 2020, 44 (3-4) : 485 - 503
  • [7] Hierarchical reinforcement learning via dynamic subspace search for multi-agent planning
    Aaron Ma
    Michael Ouimet
    Jorge Cortés
    [J]. Autonomous Robots, 2020, 44 : 485 - 503
  • [8] Studies on hierarchical reinforcement learning in multi-agent environment
    Yu Lasheng
    Marin, Alonso
    Hong Fei
    Lin Jian
    [J]. PROCEEDINGS OF 2008 IEEE INTERNATIONAL CONFERENCE ON NETWORKING, SENSING AND CONTROL, VOLS 1 AND 2, 2008, : 1714 - 1720
  • [9] Multi-Agent Hierarchical Reinforcement Learning with Dynamic Termination
    Han, Dongge
    Boehmer, Wendelin
    Wooldridge, Michael
    Rogers, Alex
    [J]. AAMAS '19: PROCEEDINGS OF THE 18TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS, 2019, : 2006 - 2008
  • [10] Multi-agent hierarchical reinforcement learning for energy management
    Jendoubi, Imen
    Bouffard, Francois
    [J]. APPLIED ENERGY, 2023, 332