Mitigating Bus Bunching via Hierarchical Multi-Agent Reinforcement Learning

被引：0

作者：

Yu, Mengdi ^{[1
,2
]}

Yang, Tao ^{[3
]}

Li, Chunxiao ^{[4
,5
]}

Jin, Yaohui ^{[1
,2
]}

Xu, Yanyan ^{[1
,2
]}

机构：

[1] Shanghai Jiao Tong Univ, AI Inst, MoE Key Lab Artificial Intelligence, Shanghai 200240, Peoples R China

[2] Shanghai Jiao Tong Univ, Data Driven Management Decis Making Lab, Shanghai 200240, Peoples R China

[3] Shanghai Urban Rural Construct & Transportat Dev, Shanghai Transportat Informat Ctr, Shanghai 200032, Peoples R China

[4] Univ Sci & Technol China, Sch SciTech Business, Hefei 230026, Peoples R China

[5] Univ Sci & Technol China, Sch Management, Hefei 230026, Peoples R China

来源：

IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS | 2024年 / 25卷 / 08期

基金：

美国国家科学基金会;

关键词：

Velocity control; Reinforcement learning; Vehicle dynamics; Reliability; Stability analysis; Roads; Mathematical models; Bus bunching; multiple strategies; hierarchical multi-agent reinforcement learning; TIME; IMPROVE;

D O I：

10.1109/TITS.2024.3362813

中图分类号：

TU [建筑科学];

学科分类号：

0813 ;

摘要：

Bus bunching is harmful to the efficiency and stability of bus transit systems, consequently delaying the arrival time of passengers and lowering the public transportation's adoption rate. Traditional solutions adjust the additional holding time of buses at certain stations to mitigate this phenomenon. These methods sacrifice the system efficiency in exchange for even headway between neighboring buses. Little work focuses on optimizing multiple strategies when a single bus line not only has a bus bay to increase bus dwell time but also owns several dedicated bus lanes to accelerate. In this work, we develop a hierarchical multi-agent reinforcement learning (HMARL) framework to combine these two strategies. Speeding up certain buses via dedicated lanes can counteract the negative influence of additional holding time. Next, to support the two strategies, we devise a two-layer policy scheme, one for high-level policy deciding holding or accelerating and the other for low-level policy determining the specific dwell time or increase of speed. Besides, to handle the issue that the controlling actions of agents are asynchronous and temporally extended, we establish a duration-critic module based on the Recurrent Neural Networks (RNN) mechanism to model other agents' impact during the period between two consecutive control. We evaluate the proposed framework on a simulated bus line with a quasi-real-world pattern to compare the performance of both traditional headway-based control methods and existing MARL methods. Results show that our method outperforms other baselines, not only stabilizing a strongly unstable bus line but also shortening the traveling times of passengers.

引用

页码：9675 / 9692

页数：18

共 50 条

[1] Reducing Bus Bunching with Asynchronous Multi-Agent Reinforcement Learning
Wang, Jiawei
Sun, Lijun
[J]. PROCEEDINGS OF THE THIRTIETH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2021, 2021, : 426 - 433
[2] Hierarchical multi-agent reinforcement learning
Mohammad Ghavamzadeh
Sridhar Mahadevan
Rajbala Makar
[J]. Autonomous Agents and Multi-Agent Systems, 2006, 13 : 197 - 229
[3] Hierarchical multi-agent reinforcement learning
Ghavamzadeh, Mohammad
Mahadevan, Sridhar
Makar, Rajbala
[J]. AUTONOMOUS AGENTS AND MULTI-AGENT SYSTEMS, 2006, 13 (02) : 197 - 229
[4] Dynamic holding control to avoid bus bunching: A multi-agent deep reinforcement learning framework
Wang, Jiawei
Sun, Lijun
[J]. TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES, 2020, 116 (116)
[5] Multi-objective multi-agent deep reinforcement learning to reduce bus bunching for multiline services with a shared corridor
Wang, Jiawei
Sun, Lijun
[J]. TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES, 2023, 155
[6] Hierarchical reinforcement learning via dynamic subspace search for multi-agent planning
Ma, Aaron
Ouimet, Michael
Cortes, Jorge
[J]. AUTONOMOUS ROBOTS, 2020, 44 (3-4) : 485 - 503
[7] Hierarchical reinforcement learning via dynamic subspace search for multi-agent planning
Aaron Ma
Michael Ouimet
Jorge Cortés
[J]. Autonomous Robots, 2020, 44 : 485 - 503
[8] Studies on hierarchical reinforcement learning in multi-agent environment
Yu Lasheng
Marin, Alonso
Hong Fei
Lin Jian
[J]. PROCEEDINGS OF 2008 IEEE INTERNATIONAL CONFERENCE ON NETWORKING, SENSING AND CONTROL, VOLS 1 AND 2, 2008, : 1714 - 1720
[9] Multi-Agent Hierarchical Reinforcement Learning with Dynamic Termination
Han, Dongge
Boehmer, Wendelin
Wooldridge, Michael
Rogers, Alex
[J]. AAMAS '19: PROCEEDINGS OF THE 18TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS, 2019, : 2006 - 2008
[10] Multi-agent hierarchical reinforcement learning for energy management
Jendoubi, Imen
Bouffard, Francois
[J]. APPLIED ENERGY, 2023, 332

← 1 2 3 4 5 →