共 22 条
- [11] ZHANG L, WU Q, SHEN J, Et al., Expression might be enough: Representing pressure and demand for reinforcement learning based traffic signal control, Proceedings of the 39th International Conference on Machine Learning, (2022)
- [12] MAO F, LI Z H, LI L., A comparison of deep reinforcement learning models for isolated traffic signal control, IEEE Intelligent Transportation Systems Magazine, 15, 1, pp. 160-180, (2022)
- [13] XU M, WU J P, HUANG L, Et al., Network-wide traffic signal control based on the discovery of critical nodes and deep reinforcement learning, Journal of Intelligent Transportation Systems, 24, 1, pp. 1-10, (2020)
- [14] WEN M N, KUBA J G, LIN R J, Et al., Multi-agent reinforcement learning is a sequence modeling problem, Advances in Neural Information Processing Systems, 35, pp. 16509-16521, (2022)
- [15] SCHULMAN J, WOLSKI F, DHARIWAL P, Et al., Proximal policy optimization algorithms, ArXiv Preprint ArXiv, 1707, (2017)
- [16] KUBA J G, WEN M N, MENG L H, Et al., Settling the variance of multi-agent policy gradients, Advances in Neural Information Processing Systems, 34, pp. 13458-13470, (2021)
- [17] SCHULMAN J, MORITZ P, LEVINE S, Et al., High-dimensional continuous control using generalized advantage estimation, ArXiv Preprint ArXiv, 1506, (2015)
- [18] ZHANG H C, FENG S Y, LIU C, Et al., CityFlow: A multi-agent reinforcement learning environment for large scale city traffic scenario, Proceedings of the World Wide Web Conference (WWW 2019), (2019)
- [19] KOONCE P, RODEGERDTS L, LEE K, Et al., Traffic signal timing manual, (2008)
- [20] COOLS S B, GERSHENSON C, D'HOOGHE B., Self-organizing traffic lights: A realistic simulation, (2013)