Spatial-Temporal Traffic Flow Control on Motorways Using Distributed Multi-Agent Reinforcement Learning

被引:17
|
作者
Kusic, Kresimir [1 ]
Ivanjko, Edouard [1 ]
Vrbanic, Filip [1 ]
Greguric, Martin [1 ]
Dusparic, Ivana [2 ]
机构
[1] Univ Zagreb, Fac Transport & Traff Sci, Vukeliceva St 4, HR-10000 Zagreb, Croatia
[2] Trinity Coll Dublin, Sch Comp Sci & Stat, Dublin 2, Ireland
关键词
intelligent transport systems; traffic control; spatial-temporal variable speed limit; multi-agent systems; reinforcement learning; distributed W-learning; urban motorways; VARIABLE-SPEED LIMIT; CONGESTION; OPTIMIZATION;
D O I
10.3390/math9233081
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
The prevailing variable speed limit (VSL) systems as an effective strategy for traffic control on motorways have the disadvantage that they only work with static VSL zones. Under changing traffic conditions, VSL systems with static VSL zones may perform suboptimally. Therefore, the adaptive design of VSL zones is required in traffic scenarios where congestion characteristics vary widely over space and time. To address this problem, we propose a novel distributed spatial-temporal multi-agent VSL (DWL-ST-VSL) approach capable of dynamically adjusting the length and position of VSL zones to complement the adjustment of speed limits in current VSL control systems. To model DWL-ST-VSL, distributed W-learning (DWL), a reinforcement learning (RL)-based algorithm for collaborative agent-based self-optimization toward multiple policies, is used. Each agent uses RL to learn local policies, thereby maximizing travel speed and eliminating congestion. In addition to local policies, through the concept of remote policies, agents learn how their actions affect their immediate neighbours and which policy or action is preferred in a given situation. To assess the impact of deploying additional agents in the control loop and the different cooperation levels on the control process, DWL-ST-VSL is evaluated in a four-agent configuration (DWL4-ST-VSL). This evaluation is done via SUMO microscopic simulations using collaborative agents controlling four segments upstream of the congestion in traffic scenarios with medium and high traffic loads. DWL also allows for heterogeneity in agents' policies; cooperating agents in DWL4-ST-VSL implement two speed limit sets with different granularity. DWL4-ST-VSL outperforms all baselines (W-learning-based VSL and simple proportional speed control), which use static VSL zones. Finally, our experiments yield insights into the new concept of VSL control. This may trigger further research on using advanced learning-based technology to design a new generation of adaptive traffic control systems to meet the requirements of operating in a nonstationary environment and at the leading edge of emerging connected and autonomous vehicles in general.
引用
收藏
页数:28
相关论文
共 50 条
  • [1] Traffic flow control using multi-agent reinforcement learning
    Zeynivand, A.
    Javadpour, A.
    Bolouki, S.
    Sangaiah, A. K.
    Jafari, F.
    Pinto, P.
    Zhang, W.
    [J]. JOURNAL OF NETWORK AND COMPUTER APPLICATIONS, 2022, 207
  • [2] Urban Traffic Control Using Distributed Multi-agent Deep Reinforcement Learning
    Kitagawa, Shunya
    Moustafa, Ahmed
    Ito, Takayuki
    [J]. PRICAI 2019: TRENDS IN ARTIFICIAL INTELLIGENCE, PT III, 2019, 11672 : 337 - 349
  • [3] Multi-agent deep reinforcement learning with traffic flow for traffic signal control
    Hou, Liang
    Huang, Dailin
    Cao, Jie
    Ma, Jialin
    [J]. JOURNAL OF CONTROL AND DECISION, 2023,
  • [4] A Multi-agent Reinforcement Learning Perspective on Distributed Traffic Engineering
    Geng, Nan
    Lan, Tian
    Aggarwal, Vaneet
    Yang, Yuan
    Xu, Mingwei
    [J]. 2020 IEEE 28TH INTERNATIONAL CONFERENCE ON NETWORK PROTOCOLS (IEEE ICNP 2020), 2020,
  • [5] Multi-agent Reinforcement Learning for Traffic Signal Control
    Prabuchandran, K. J.
    Kumar, Hemanth A. N.
    Bhatnagar, Shalabh
    [J]. 2014 IEEE 17TH INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS (ITSC), 2014, : 2529 - 2534
  • [6] Distributed Transmission Control for Wireless Networks using Multi-Agent Reinforcement Learning
    Farquhar, Collin
    Kumar, Prem
    Jagannath, Anu
    Jagannath, Jithin
    [J]. BIG DATA IV: LEARNING, ANALYTICS, AND APPLICATIONS, 2022, 12097
  • [7] STAS: Spatial-Temporal Return Decomposition for Solving Sparse Rewards Problems in Multi-agent Reinforcement Learning
    Chen, Sirui
    Zhang, Zhaowei
    Yang, Yaodong
    Du, Yali
    [J]. THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 16, 2024, : 17337 - 17345
  • [8] Spatial-Temporal Graph Attention-based Multi-Agent Reinforcement Learning in Cooperative Edge Caching
    Hou, Jiacheng
    Nayak, Amiya
    [J]. ICC 2023-IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS, 2023, : 3078 - 3083
  • [9] Multi-agent Traffic Signal Control via Distributed RL with Spatial and Temporal Feature Extraction
    Zhang, Yifeng
    Damani, Mehul
    Sartoretti, Guillaume
    [J]. AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS: BEST AND VISIONARY PAPERS, AAMAS 2022, 2022, 13441 : 106 - 113
  • [10] Intelligent Traffic Light Control Using Distributed Multi-agent Q Learning
    Liu, Ying
    Liu, Lei
    Chen, Wei-Peng
    [J]. 2017 IEEE 20TH INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS (ITSC), 2017,