Traffic signal phase control at urban isolated intersections: an adaptive strategy utilizing the improved D3QN algorithm

被引:1
|
作者
Fu, Zhumu [1 ,2 ]
Zhang, Jie [1 ]
Tao, Fazhan [1 ,3 ]
Ji, Baofeng [1 ,2 ]
机构
[1] Henan Univ Sci & Technol, Coll Informat Engn, Luoyang, Peoples R China
[2] Henan Univ Sci & Technol, Henan Key Lab Robot & Intelligent Syst, Luoyang, Peoples R China
[3] Longmen Lab, Luoyang, Peoples R China
基金
中国国家自然科学基金;
关键词
traffic signal phase control; adaptive real-time control; deep reinforcement learning; double dueling deep Q network; attenuation action selection strategy;
D O I
10.1088/1361-6501/ad8212
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
The intelligent control of traffic signals at urban single intersections has emerged as an effective approach to mitigating urban traffic congestion. However, the existing fixed phase control strategy of traffic signal lights lacks capability to dynamically adjust signal phase switching based on real-time traffic conditions leading to traffic congestion. In this paper, an adaptive real-time control method employed by the traffic signal phase at a single intersection is considered based on the improved double dueling deep Q network (I-D3QN) algorithm. Firstly, the traffic signal phase control problem is modeled as a Markov decision process, with its state, action, and reward defined. Subsequently, to enhance the convergence speed and learning performance of the D3QN algorithm, attenuation action selection strategy and priority experience playback technology based on tree summation structure are introduced. Then, traffic flow data from various traffic scenarios are utilized to train the traffic signal control model based on the I-D3QN to obtain the optimal signal phase switch strategy. Finally, the effectiveness and optimal performance of the I-D3QN-based traffic signal control strategy are validated across diverse traffic scenarios. The simulation results show that, compared with the control strategy based on actuated control, deep Q network, double deep Q network, D3QN, and C-D3QN algorithms, the cumulative reward of the proposed I-D3QN strategy is increased by at least 6.57%, and the average queue length and average waiting time are reduced by at least 9.64% and 7.61%, which can effectively reduce the congestion at isolated intersections and significantly improve traffic efficiency.
引用
收藏
页数:13
相关论文
共 21 条
  • [21] DPF-Bi-RRT*: An Improved Path Planning Algorithm for Complex 3D Environments With Adaptive Sampling and Dual Potential Field Strategy
    Ge, Lin
    Phang, Swee King
    Sariff, Nohaidda
    IEEE ACCESS, 2025, 13 : 35958 - 35972