Demonstration-guided deep reinforcement learning for coordinated ramp metering and perimeter control in large scale networks

被引：4

作者：

Hu, Zijian ^{[1
]}

Ma, Wei ^{[1
,2
]}

机构：

[1] Hong Kong Polytech Univ, Civil & Environm Engn, Kowloon, Hong Kong, Peoples R China

[2] Hong Kong Polytech Univ, Res Inst Sustainable Urban Dev, Hung Hom, Hong Kong 999077, Peoples R China

来源：

TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES | 2024年 / 159卷

关键词：

Intelligent transportation systems; Dynamic network models; Coordinated traffic control; Deep reinforcement learning; Large-scale networks; MODEL-PREDICTIVE CONTROL; CELL TRANSMISSION MODEL; FUNDAMENTAL DIAGRAM; MIXED NETWORK; URBAN; LEVEL;

D O I：

10.1016/j.trc.2023.104461

中图分类号：

U [交通运输];

学科分类号：

08 ; 0823 ;

摘要：

Effective traffic control methods have great potential in alleviating network congestion. Particularly, in an urban network consisting of heterogeneous roads (e.g., freeways and urban roads), how to integrate and coordinate control policies on different roads is a critical issue in largescale networks. This study addresses this question from two aspects: modeling and control. From the modeling aspect, we formulate the hybrid traffic modeling in heterogeneous networks with the Asymmetric Cell Transmission Model (ACTM) for freeways and the generalized bathtub model for urban roads. For the control aspect, this study considers two representative control approaches: ramp metering for freeways and perimeter control for urban roads, and we aim to develop a deep reinforcement learning (DRL)-based coordinated control framework for largescale networks. However, there are two significant challenges in the coordinated control in large-scale networks with DRL methods: non -stationary environment and large search space. To address both issues, we incorporate the demonstration to guide the DRL method for better convergence by introducing the concept of "teacher"and "student"models. The teacher models are traditional controllers that provide control demonstrations. For instance, ALINEA and Gating are two representative feedback controllers for ramp metering and perimeter control which can be "teacher"models. The student models are DRL methods, which learn from teachers and aim to surpass the teachers' performance. Additionally, we develop a parallel training scheme to accelerate the proposed DRL method. To validate the proposed framework, we conduct two case studies in a small-scale network and a real -world large-scale traffic network in Hong Kong. Numerical results show that the proposed DRL method outperforms demonstrators as well as DRL methods, and the coordinated control is more effective than just controlling ramps or perimeters respectively. The research outcome reveals the great potential of combining traditional controllers with DRL for coordinated control in large-scale networks.

引用

页数：30

共 50 条

[31] Cooperative Deep Reinforcement Learning for Large-Scale Traffic Grid Signal Control
Tan, Tian
Bao, Feng
Deng, Yue
Jin, Alex
Dai, Qionghai
Wang, Jie
IEEE TRANSACTIONS ON CYBERNETICS, 2020, 50 (06) : 2687 - 2700
[32] Distributed Hierarchical Deep Reinforcement Learning for Large-Scale Grid Emergency Control
Chen, Yixi
Zhu, Jizhong
Liu, Yun
Zhang, Le
Zhou, Jialin
IEEE TRANSACTIONS ON POWER SYSTEMS, 2024, 39 (02) : 4446 - 4458
[33] A dynamic self-improving ramp metering algorithm based on multi-agent deep reinforcement learning
Deng, Fuwen
Jin, Jiandong
Shen, Yu
Du, Yuchuan
TRANSPORTATION LETTERS-THE INTERNATIONAL JOURNAL OF TRANSPORTATION RESEARCH, 2024, 16 (07): : 649 - 657
[34] Advanced Self-Improving Ramp Metering Algorithm based on Multi-Agent Deep Reinforcement Learning
Deng, Fuwen
Jin, Jiandong
Shen, Yu
Du, Yuchuan
2019 IEEE INTELLIGENT TRANSPORTATION SYSTEMS CONFERENCE (ITSC), 2019, : 3804 - 3809
[35] Adaptive Coordinated Variable Speed Limit between Highway Mainline and On-Ramp with Deep Reinforcement Learning
Cheng, Ming
Zhang, Chenghao
Jin, Hui
Wang, Ziming
Yang, Xiaoguang
JOURNAL OF ADVANCED TRANSPORTATION, 2022, 2022
[36] Adaptive Coordinated Variable Speed Limit between Highway Mainline and On-Ramp with Deep Reinforcement Learning
Cheng, Ming
Zhang, Chenghao
Jin, Hui
Wang, Ziming
Yang, Xiaoguang
Journal of Advanced Transportation, 2022, 2022
[37] Grid-area coordinated load frequency control strategy using large-scale multi-agent deep reinforcement learning
Li, Jiawen
Geng, Jian
Yu, Tao
ENERGY REPORTS, 2022, 8 : 255 - 274
[38] Traffic Signal Control for Large-Scale Road Networks Based on Deep Reinforcement with PSR
Zhou, Zhicheng
Zhang, Hui
Zhang, Ya
2024 3RD CONFERENCE ON FULLY ACTUATED SYSTEM THEORY AND APPLICATIONS, FASTA 2024, 2024, : 848 - 853
[39] Large Scale Deep Reinforcement Learning in War-games
Wang, Hanchao
Tang, Hongyao
Hao, Jianye
Hao, Xiaotian
Fu, Yue
Ma, Yi
2020 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE, 2020, : 1693 - 1699
[40] Multi-Agent Deep Reinforcement Learning for Coordinated Multipoint in Mobile Networks
Schneider, Stefan
Karl, Holger
Khalili, Ramin
Hecker, Artur
IEEE TRANSACTIONS ON NETWORK AND SERVICE MANAGEMENT, 2024, 21 (01): : 908 - 924

← 1 2 3 4 5 →