Regional Multi-Agent Cooperative Reinforcement Learning for City-Level Traffic Grid Signal Control

被引:2
|
作者
Li, Yisha [1 ,2 ]
Zhang, Ya [1 ,2 ]
Li, Xinde [1 ,2 ]
Sun, Changyin [1 ,2 ]
机构
[1] Southeast Univ, Sch Automat, Nanjing 210096, Peoples R China
[2] Southeast Univ, Key Lab Measurement & Control Complex Syst Engn, Minist Educ, Nanjing 210096, Peoples R China
关键词
Q-learning; Human-machine systems; Heuristic algorithms; Feature extraction; Real-time systems; Human-machine cooperation; mixed domain attention mechanism; multi-agent reinforcement learning; spatio-temporal feature; traffic signal control;
D O I
10.1109/JAS.2024.124365
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This article studies the effective traffic signal control problem of multiple intersections in a city-level traffic system. A novel regional multi-agent cooperative reinforcement learning algorithm called RegionSTLight is proposed to improve the traffic efficiency. Firstly a regional multi-agent Q-learning framework is proposed, which can equivalently decompose the global Q value of the traffic system into the local values of several regions. Based on the framework and the idea of human-machine cooperation, a dynamic zoning method is designed to divide the traffic network into several strong-coupled regions according to real-time traffic flow densities. In order to achieve better cooperation inside each region, a lightweight spatio-temporal fusion feature extraction network is designed. The experiments in synthetic, real-world and city-level scenarios show that the proposed RegionSTLight converges more quickly, is more stable, and obtains better asymptotic performance compared to state-of-the-art models.
引用
收藏
页码:1987 / 1998
页数:12
相关论文
empty
未找到相关数据