Traffic signal control using a cooperative EWMA-based multi-agent reinforcement learning

被引:3
|
作者
Qiao, Zhimin [1 ]
Ke, Liangjun [1 ]
Wang, Xiaoqiang [1 ]
机构
[1] Xi An Jiao Tong Univ, Sch Automat Sci & Engn, Xian 710049, Peoples R China
基金
中国国家自然科学基金;
关键词
Mean-field; Traffic signal control; TD3; Multi-agent reinforcement learning; NETWORK; ALGORITHM; COORDINATION;
D O I
10.1007/s10489-022-03643-9
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In contemporary urban, traffic signal control is still enormously difficult. Multi-agent reinforcement learning (MARL) is a promising ways to solve this problem. However, most MARL algorithms can not effectively transfer learning strategies when the agents increase or decrease. This paper proposes a new MARL algorithm called cooperative dynamic delay updating twin delayed deep deterministic policy gradient based on the exponentially weighted moving average (CoTD3-EWMA) to solve the problem. By introducing mean-field theory, the algorithm implicitly models the interaction between agents and environment. It reduces the dimension of action space and improves the scalability of the algorithm. In addition, we propose a dynamic delay updating method based on the exponentially weighted moving average (EWMA), which improves the Q value overestimation problem of the traditional TD3 algorithm. Moreover, a joint reward allocation mechanism and state sharing mechanism are proposed to improve the global strategy learning ability and robustness of the agent. The simulation results show that the performance of the new algorithm is better than the current state-of-the-art algorithms, which effectively reduces the delay time of vehicles and improves the traffic efficiency of the traffic network.
引用
收藏
页码:4483 / 4498
页数:16
相关论文
共 50 条
  • [1] Traffic signal control using a cooperative EWMA-based multi-agent reinforcement learning
    Zhimin Qiao
    Liangjun Ke
    Xiaoqiang Wang
    Applied Intelligence, 2023, 53 : 4483 - 4498
  • [2] Cooperative Traffic Signal Control Based on Multi-agent Reinforcement Learning
    Gao, Ruowen
    Liu, Zhihan
    Li, Jinglin
    Yuan, Quan
    BLOCKCHAIN AND TRUSTWORTHY SYSTEMS, BLOCKSYS 2019, 2020, 1156 : 787 - 793
  • [3] Multi-Agent Reinforcement Learning for Traffic Signal Control: A Cooperative Approach
    Kolat, Mate
    Kovari, Balint
    Becsi, Tamas
    Aradi, Szilard
    SUSTAINABILITY, 2023, 15 (04)
  • [4] Multiple intersections traffic signal control based on cooperative multi-agent reinforcement learning
    Liu, Junxiu
    Qin, Sheng
    Su, Min
    Luo, Yuling
    Wang, Yanhu
    Yang, Su
    INFORMATION SCIENCES, 2023, 647
  • [5] Swarm Reinforcement Learning for traffic signal control based on cooperative multi-agent framework
    Tahifa, Mohammed
    Boumhidi, Jaouad
    Yahyaouy, Ali
    2015 INTELLIGENT SYSTEMS AND COMPUTER VISION (ISCV), 2015,
  • [6] Multi-Agent Deep Reinforcement Learning for Decentralized Cooperative Traffic Signal Control
    Zhao, Yang
    Hu, Jian-Ming
    Gao, Ming-Yang
    Zhang, Zuo
    CICTP 2020: TRANSPORTATION EVOLUTION IMPACTING FUTURE MOBILITY, 2020, : 458 - 470
  • [7] Multi-agent Reinforcement Learning for Traffic Signal Control
    Prabuchandran, K. J.
    Kumar, Hemanth A. N.
    Bhatnagar, Shalabh
    2014 IEEE 17TH INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS (ITSC), 2014, : 2529 - 2534
  • [8] An Improved Traffic Signal Control Method Based on Multi-agent Reinforcement Learning
    Xu, Jianyou
    Zhang, Zhichao
    Zhang, Shuo
    Miao, Jiayao
    2021 PROCEEDINGS OF THE 40TH CHINESE CONTROL CONFERENCE (CCC), 2021, : 6612 - 6616
  • [9] A multi-agent reinforcement learning based approach for intelligent traffic signal control
    Benhamza, Karima
    Seridi, Hamid
    Agguini, Meriem
    Bentagine, Amel
    EVOLVING SYSTEMS, 2024, : 2383 - 2397
  • [10] CLlight: Enhancing representation of multi-agent reinforcement learning with contrastive learning for cooperative traffic signal control
    Fu, Xiang
    Ren, Yilong
    Jiang, Han
    Lv, Jiancheng
    Cui, Zhiyong
    Yu, Haiyang
    Expert Systems with Applications, 2025, 262