Demonstration-guided deep reinforcement learning for coordinated ramp metering and perimeter control in large scale networks

被引:4
|
作者
Hu, Zijian [1 ]
Ma, Wei [1 ,2 ]
机构
[1] Hong Kong Polytech Univ, Civil & Environm Engn, Kowloon, Hong Kong, Peoples R China
[2] Hong Kong Polytech Univ, Res Inst Sustainable Urban Dev, Hung Hom, Hong Kong 999077, Peoples R China
关键词
Intelligent transportation systems; Dynamic network models; Coordinated traffic control; Deep reinforcement learning; Large-scale networks; MODEL-PREDICTIVE CONTROL; CELL TRANSMISSION MODEL; FUNDAMENTAL DIAGRAM; MIXED NETWORK; URBAN; LEVEL;
D O I
10.1016/j.trc.2023.104461
中图分类号
U [交通运输];
学科分类号
08 ; 0823 ;
摘要
Effective traffic control methods have great potential in alleviating network congestion. Particularly, in an urban network consisting of heterogeneous roads (e.g., freeways and urban roads), how to integrate and coordinate control policies on different roads is a critical issue in largescale networks. This study addresses this question from two aspects: modeling and control. From the modeling aspect, we formulate the hybrid traffic modeling in heterogeneous networks with the Asymmetric Cell Transmission Model (ACTM) for freeways and the generalized bathtub model for urban roads. For the control aspect, this study considers two representative control approaches: ramp metering for freeways and perimeter control for urban roads, and we aim to develop a deep reinforcement learning (DRL)-based coordinated control framework for largescale networks. However, there are two significant challenges in the coordinated control in large-scale networks with DRL methods: non -stationary environment and large search space. To address both issues, we incorporate the demonstration to guide the DRL method for better convergence by introducing the concept of "teacher"and "student"models. The teacher models are traditional controllers that provide control demonstrations. For instance, ALINEA and Gating are two representative feedback controllers for ramp metering and perimeter control which can be "teacher"models. The student models are DRL methods, which learn from teachers and aim to surpass the teachers' performance. Additionally, we develop a parallel training scheme to accelerate the proposed DRL method. To validate the proposed framework, we conduct two case studies in a small-scale network and a real -world large-scale traffic network in Hong Kong. Numerical results show that the proposed DRL method outperforms demonstrators as well as DRL methods, and the coordinated control is more effective than just controlling ramps or perimeters respectively. The research outcome reveals the great potential of combining traditional controllers with DRL for coordinated control in large-scale networks.
引用
收藏
页数:30
相关论文
共 50 条
  • [31] Cooperative Deep Reinforcement Learning for Large-Scale Traffic Grid Signal Control
    Tan, Tian
    Bao, Feng
    Deng, Yue
    Jin, Alex
    Dai, Qionghai
    Wang, Jie
    IEEE TRANSACTIONS ON CYBERNETICS, 2020, 50 (06) : 2687 - 2700
  • [32] Distributed Hierarchical Deep Reinforcement Learning for Large-Scale Grid Emergency Control
    Chen, Yixi
    Zhu, Jizhong
    Liu, Yun
    Zhang, Le
    Zhou, Jialin
    IEEE TRANSACTIONS ON POWER SYSTEMS, 2024, 39 (02) : 4446 - 4458
  • [33] A dynamic self-improving ramp metering algorithm based on multi-agent deep reinforcement learning
    Deng, Fuwen
    Jin, Jiandong
    Shen, Yu
    Du, Yuchuan
    TRANSPORTATION LETTERS-THE INTERNATIONAL JOURNAL OF TRANSPORTATION RESEARCH, 2024, 16 (07): : 649 - 657
  • [34] Advanced Self-Improving Ramp Metering Algorithm based on Multi-Agent Deep Reinforcement Learning
    Deng, Fuwen
    Jin, Jiandong
    Shen, Yu
    Du, Yuchuan
    2019 IEEE INTELLIGENT TRANSPORTATION SYSTEMS CONFERENCE (ITSC), 2019, : 3804 - 3809
  • [35] Adaptive Coordinated Variable Speed Limit between Highway Mainline and On-Ramp with Deep Reinforcement Learning
    Cheng, Ming
    Zhang, Chenghao
    Jin, Hui
    Wang, Ziming
    Yang, Xiaoguang
    JOURNAL OF ADVANCED TRANSPORTATION, 2022, 2022
  • [36] Adaptive Coordinated Variable Speed Limit between Highway Mainline and On-Ramp with Deep Reinforcement Learning
    Cheng, Ming
    Zhang, Chenghao
    Jin, Hui
    Wang, Ziming
    Yang, Xiaoguang
    Journal of Advanced Transportation, 2022, 2022
  • [37] Grid-area coordinated load frequency control strategy using large-scale multi-agent deep reinforcement learning
    Li, Jiawen
    Geng, Jian
    Yu, Tao
    ENERGY REPORTS, 2022, 8 : 255 - 274
  • [38] Traffic Signal Control for Large-Scale Road Networks Based on Deep Reinforcement with PSR
    Zhou, Zhicheng
    Zhang, Hui
    Zhang, Ya
    2024 3RD CONFERENCE ON FULLY ACTUATED SYSTEM THEORY AND APPLICATIONS, FASTA 2024, 2024, : 848 - 853
  • [39] Large Scale Deep Reinforcement Learning in War-games
    Wang, Hanchao
    Tang, Hongyao
    Hao, Jianye
    Hao, Xiaotian
    Fu, Yue
    Ma, Yi
    2020 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE, 2020, : 1693 - 1699
  • [40] Multi-Agent Deep Reinforcement Learning for Coordinated Multipoint in Mobile Networks
    Schneider, Stefan
    Karl, Holger
    Khalili, Ramin
    Hecker, Artur
    IEEE TRANSACTIONS ON NETWORK AND SERVICE MANAGEMENT, 2024, 21 (01): : 908 - 924