Signal Priority Control for Trams Using Deep Reinforcement Learning

被引:0
|
作者
Wang Y.-P. [1 ]
Guo G. [2 ,3 ]
机构
[1] School of Control Science and Engineering, Dalian University of Technology, Dalian
[2] State Key Laboratory of Synthetical Automation for Process Industries, Northeastern University, Shenyang
[3] School of Control Engineering, Northeastern University at Qinhuangdao, Qinhuangdao
来源
基金
中国国家自然科学基金;
关键词
Deep reinforcement learning; Markov decision process; Signal priority; Trams;
D O I
10.16383/j.aas.c190164
中图分类号
学科分类号
摘要
Current trams-priority signal control systems have many problems, such as low adaptability to real-time traffic changes and high complexity in optimization solutions, etc. In this paper, an active signal priority control model is proposed for the trams based on deep reinforcement learning. Considering the traffic demands from tram and general vehicles, it can reduce the traffic delay of general vehicles while minimizing the need for trams to stop at the intersection. Real-time traffic information is used to dynamically adjust the sequence of traffic signals throughout the whole passing process of the tram, without relying on the complex traffic modeling. We use deep Q-network algorithm for problem-solving, and adopt dueling network, double Q network, and prioritized experience replay to improve the learning performance. Experiments based on SUMO have demonstrated that the proposed model can excellently improve the efficiency of trams and general vehicles simultaneously. Copyright © 2019 Acta Automatica Sinica. All rights reserved.
引用
收藏
页码:2366 / 2377
页数:11
相关论文
共 25 条
  • [1] Statistical bulletin on transportation industry development in 2018. [Online], (2019)
  • [2] Shi J.G., Sun Y.S., Schonfeld P., Qi J., Joint optimization of tram timetables and signal timing adjustments at intersections, Transportation Research Part C: Emerging Technologies, 83, 6, pp. 104-119, (2017)
  • [3] Ji Y.X., Tang Y., Du Y.C., Zhang X., Coordinated optimization of tram trajectories with arterial signal timing resynchronization, Transportation Research Part C: Emerging Technologies, 99, 4, pp. 53-66, (2019)
  • [4] Little J.D.C., Kelson M.D., Gartner N.M., Maxband: a program for setting signals on arteries and triangular networks, Proceedings of the 60th Annual Meeting of the Transportation Research Board, pp. 40-46, (1981)
  • [5] Jeong Y.J., Kim Y.C., Tram passive signal priority strategy based on the maxband model, KSCE Journal of Civil Engineering, 18, 5, pp. 1518-1527, (2014)
  • [6] Ma W., Zou L., An K., Gartner N.H., Wang M., A partition-enabled multi-mode band approach to arterial traffic signal optimization, IEEE Transactions on Intelligent Transportation Systems, 20, 1, pp. 313-322, (2019)
  • [7] Kim H., Cheng Y., Chang G., Variable signal progression bands for transit vehicles under dwell time uncertainty and traffic queues, IEEE Transactions on Intelligent Transportation Systems, 20, 1, pp. 109-122, (2019)
  • [8] Ji Y.X., Tang Y., Wang W., Du Y.C., Tram-oriented traffic signal timing resynchronization, Journal of Advanced Transportation, 2018, 1, pp. 1-13, (2018)
  • [9] Jacobson J., Sheffi Y., Analytical model of traffic delays under bus signal preemption: theory and application, Transportation Research Part B: Methodological, 15, 2, pp. 127-138, (1981)
  • [10] Yang M., Ding J., Wang W., Ma Y.Y., A coordinated signal priority strategy for modern trams on arterial streets by predicting the tram dwell time, KSCE Journal of Civil Engineering, 22, 2, pp. 823-836, (2018)