Regional Multi-Agent Cooperative Reinforcement Learning for City-Level Traffic Grid Signal Control

被引：2

作者：

Li, Yisha ^{[1
,2
]}

Zhang, Ya ^{[1
,2
]}

Li, Xinde ^{[1
,2
]}

Sun, Changyin ^{[1
,2
]}

机构：

[1] Southeast Univ, Sch Automat, Nanjing 210096, Peoples R China

[2] Southeast Univ, Key Lab Measurement & Control Complex Syst Engn, Minist Educ, Nanjing 210096, Peoples R China

来源：

IEEE-CAA JOURNAL OF AUTOMATICA SINICA | 2024年 / 11卷 / 09期

关键词：

Q-learning; Human-machine systems; Heuristic algorithms; Feature extraction; Real-time systems; Human-machine cooperation; mixed domain attention mechanism; multi-agent reinforcement learning; spatio-temporal feature; traffic signal control;

D O I：

10.1109/JAS.2024.124365

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This article studies the effective traffic signal control problem of multiple intersections in a city-level traffic system. A novel regional multi-agent cooperative reinforcement learning algorithm called RegionSTLight is proposed to improve the traffic efficiency. Firstly a regional multi-agent Q-learning framework is proposed, which can equivalently decompose the global Q value of the traffic system into the local values of several regions. Based on the framework and the idea of human-machine cooperation, a dynamic zoning method is designed to divide the traffic network into several strong-coupled regions according to real-time traffic flow densities. In order to achieve better cooperation inside each region, a lightweight spatio-temporal fusion feature extraction network is designed. The experiments in synthetic, real-world and city-level scenarios show that the proposed RegionSTLight converges more quickly, is more stable, and obtains better asymptotic performance compared to state-of-the-art models.

引用

页码：1987 / 1998

页数：12