Distributed Signal Control of Multi-agent Reinforcement Learning Based on Game

被引:0
|
作者
Qu Z.-W. [1 ]
Pan Z.-T. [1 ]
Chen Y.-H. [1 ]
Li H.-T. [1 ]
Wang X. [1 ]
机构
[1] College of Transportation, Jilin University, Changchun
来源
Chen, Yong-Heng (cyh@jlu.edu.cn) | 1600年 / Science Press卷 / 20期
基金
中国国家自然科学基金;
关键词
Distributed traffic signal control; Game theory; Intelligent transportation; Multi-agent reinforcement learning; Numerical simulation; Urban road network under unbalanced demand;
D O I
10.16097/j.cnki.1009-6744.2020.02.012
中图分类号
学科分类号
摘要
The difficulty of distributed signal control is increasing due to the unbalance and fluctuation of traffic demand. Since the decision-making of existing independent action multi-agent reinforcement learning (IA-MARL) is based on its own historical experience, the distributed signal control based on IA-MARL is difficult to timely alleviate the impact of unbalanced and fluctuating traffic demand. In this paper, the framework of multi-agent reinforcement learning based on the game (G- MARL) was proposed by improving the decision-making of IA-MARL with integrating the mixed strategy Nash-equilibrium, which is a concept in game theory. In the grid network with the Poisson arrival rate, the distributed control methods based on IA-MARL and G-MARL were simulated to obtain the unit travel time and the unit vehicle delay curves. The results show that, the unit travel time and the unit vehicle average delay obtained by G-MARL are reduced by 59.94% and 81.45% compared with IA-MARL respectively. It is proved that G-MARL is suitable for distributed signal control when there are unbalances and fluctuations in traffic demand with the unsaturated state. Copyright © 2020 by Science Press.
引用
收藏
页码:76 / 82and100
相关论文
共 7 条
  • [1] Yang W.C., Zhang L., Zhu F., Multi-agent reinforcement learning based traffic signal control for integrated urban network: survey of state of art, Application Research of Computers, 35, 6, pp. 1613-1618, (2018)
  • [2] Thorpe T., Vehicle traffic light control using SARSA, (1997)
  • [3] Abdulhai B., Karakoulas G.J., Pringle R., Reinforcement learning for true adaptive traffic signal control, Journal of Transportation Engineering, 129, 3, pp. 278-285, (2003)
  • [4] Balaji P.G., German X., Srinivasan D., Urban traffic signal control using reinforcement learning agents, IET Intelligent Transport Systems, 4, 3, pp. 177-188, (2010)
  • [5] Zhu F., Aziz H.M.A., Qian X., Et al., A junction-tree based learning algorithm to optimize network wide traffic control: A coordinated multi-agent framework, Transportation Research Part C: Emerging Technologies, 58, pp. 487-501, (2015)
  • [6] Fudenberg D., Tirole J., Game Theory, (1991)
  • [7] Cai Y., Yang X.G., Wang H., A Flexible on-line transition structure of traffic signal phases, Urban Transport of China, 7, 3, pp. 80-85, (2009)