Spatial-Temporal Traffic Flow Control on Motorways Using Distributed Multi-Agent Reinforcement Learning

被引：17

作者：

Kusic, Kresimir ^{[1
]}

Ivanjko, Edouard ^{[1
]}

Vrbanic, Filip ^{[1
]}

Greguric, Martin ^{[1
]}

Dusparic, Ivana ^{[2
]}

机构：

[1] Univ Zagreb, Fac Transport & Traff Sci, Vukeliceva St 4, HR-10000 Zagreb, Croatia

[2] Trinity Coll Dublin, Sch Comp Sci & Stat, Dublin 2, Ireland

来源：

MATHEMATICS | 2021年 / 9卷 / 23期

关键词：

intelligent transport systems; traffic control; spatial-temporal variable speed limit; multi-agent systems; reinforcement learning; distributed W-learning; urban motorways; VARIABLE-SPEED LIMIT; CONGESTION; OPTIMIZATION;

D O I：

10.3390/math9233081

中图分类号：

O1 [数学];

学科分类号：

0701 ; 070101 ;

摘要：

The prevailing variable speed limit (VSL) systems as an effective strategy for traffic control on motorways have the disadvantage that they only work with static VSL zones. Under changing traffic conditions, VSL systems with static VSL zones may perform suboptimally. Therefore, the adaptive design of VSL zones is required in traffic scenarios where congestion characteristics vary widely over space and time. To address this problem, we propose a novel distributed spatial-temporal multi-agent VSL (DWL-ST-VSL) approach capable of dynamically adjusting the length and position of VSL zones to complement the adjustment of speed limits in current VSL control systems. To model DWL-ST-VSL, distributed W-learning (DWL), a reinforcement learning (RL)-based algorithm for collaborative agent-based self-optimization toward multiple policies, is used. Each agent uses RL to learn local policies, thereby maximizing travel speed and eliminating congestion. In addition to local policies, through the concept of remote policies, agents learn how their actions affect their immediate neighbours and which policy or action is preferred in a given situation. To assess the impact of deploying additional agents in the control loop and the different cooperation levels on the control process, DWL-ST-VSL is evaluated in a four-agent configuration (DWL4-ST-VSL). This evaluation is done via SUMO microscopic simulations using collaborative agents controlling four segments upstream of the congestion in traffic scenarios with medium and high traffic loads. DWL also allows for heterogeneity in agents' policies; cooperating agents in DWL4-ST-VSL implement two speed limit sets with different granularity. DWL4-ST-VSL outperforms all baselines (W-learning-based VSL and simple proportional speed control), which use static VSL zones. Finally, our experiments yield insights into the new concept of VSL control. This may trigger further research on using advanced learning-based technology to design a new generation of adaptive traffic control systems to meet the requirements of operating in a nonstationary environment and at the leading edge of emerging connected and autonomous vehicles in general.

引用

页数：28

共 50 条

[31] A model for distributed multi-agent traffic control
Bel, C
van Stokkum, W
[J]. MULTIPLE APPROACHES TO INTELLIGENT SYSTEMS, PROCEEDINGS, 1999, 1611 : 480 - 489
[32] Distributed Multi-Agent Reinforcement Learning for Cooperative Low-Carbon Control of Traffic Network Flow Using Cloud-Based Parallel Optimization
Zhang, Yongnan
Zhou, Yonghua
Fujita, Hamido
[J]. IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2024,
[33] Deep Reinforcement Learning Agent for Negotiation in Multi-Agent Cooperative Distributed Predictive Control
Aponte-Rengifo, Oscar
Vega, Pastora
Francisco, Mario
[J]. APPLIED SCIENCES-BASEL, 2023, 13 (04):
[34] Traffic signal control using a cooperative EWMA-based multi-agent reinforcement learning
Zhimin Qiao
Liangjun Ke
Xiaoqiang Wang
[J]. Applied Intelligence, 2023, 53 : 4483 - 4498
[35] Traffic signal control using a cooperative EWMA-based multi-agent reinforcement learning
Qiao, Zhimin
Ke, Liangjun
Wang, Xiaoqiang
[J]. APPLIED INTELLIGENCE, 2023, 53 (04) : 4483 - 4498
[36] Dynamic distributed constraint optimization using multi-agent reinforcement learning
Shokoohi, Maryam
Afsharchi, Mohsen
Shah-Hoseini, Hamed
[J]. SOFT COMPUTING, 2022, 26 (08) : 3601 - 3629
[37] Cooperative Multi-Agent Systems Using Distributed Reinforcement Learning Techniques
Zemzem, Wiem
Tagina, Moncef
[J]. KNOWLEDGE-BASED AND INTELLIGENT INFORMATION & ENGINEERING SYSTEMS (KES-2018), 2018, 126 : 517 - 526
[38] Dynamic distributed constraint optimization using multi-agent reinforcement learning
Maryam Shokoohi
Mohsen Afsharchi
Hamed Shah-Hoseini
[J]. Soft Computing, 2022, 26 : 3601 - 3629
[39] Learning scalable multi-agent coordination by spatial differentiation for traffic control
Liu, Junjia
Zhang, Huimin
Fu, Zhuang
Wang, Yao
[J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2021, 100 : 1 - 12
[40] GMIX: Graph-based spatial-temporal multi-agent reinforcement learning for dynamic electric vehicle dispatching system
Zhou, Tao
Kris, M. Y. Law
Creighton, Douglas
Wu, Changzhi
[J]. TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES, 2022, 144

← 1 2 3 4 5 →