Using Reinforcement Learning to Control Traffic Signals in a Real-World Scenario: An Approach Based on Linear Function Approximation

被引：14

作者：

Alegre, Lucas N. ^{[1
]}

Ziemke, Theresa ^{[2
]}

Bazzan, Ana L. C. ^{[1
]}

机构：

[1] Univ Fed Rio Grande do Sul, Inst Informat, BR-91501970 Porto Alegre, RS, Brazil

[2] Tech Univ Berlin, Transport Syst Planning & Transport Telemat Dept, D-10623 Berlin, Germany

来源：

IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS | 2022年 / 23卷 / 07期

关键词：

Traffic signal control; reinforcement learning; function approximation; multiagent systems; SIMULATION; NETWORK;

D O I：

10.1109/TITS.2021.3091014

中图分类号：

TU [建筑科学];

学科分类号：

0813 ;

摘要：

Reinforcement learning is an efficient, widely used machine learning technique that performs well in problems with a reasonable number of states and actions. This is rarely the case regarding control-related problems, as for instance controlling traffic signals, where the state space can be very large. One way to deal with the curse of dimensionality is to use generalization techniques such as function approximation. In this paper, a linear function approximation is used by traffic signal agents in a network of signalized intersections. Specifically, a true online SARSA(lambda) algorithm with Fourier basis functions (TOS(lambda)-FB) is employed. This method has the advantage of having convergence guarantees and error bounds, a drawback of non-linear function approximation. In order to evaluate TOS(lambda)-FB, we perform experiments in variations of an isolated intersection scenario and a scenario of the city of Cottbus, Germany, with 22 signalized intersections, implemented in MATSim. We compare our results not only to fixed-time controllers, but also to a state-of-the-art rule-based adaptive method, showing that TOS(lambda)-FB shows a performance that is highly superior to the fixed-time, while also being at least as efficient as the rule-based approach. For more than half of the intersections, our approach leads to less congestion and delay, without the need for the knowledge that underlies the rule-based approach.

引用

页码：9126 / 9135

页数：10

共 50 条

[41] A machine-learning approach to illuminant estimation using statistical regularities in photoreceptor signals from real-world surfaces
Hexley, Allie C.
Morimoto, Takuma
Uchikawa, Keiji
Smithson, Hannah E.
[J]. PERCEPTION, 2021, 50 (1_SUPPL) : 53 - 53
[42] A Lightweight Simulation Framework for Learning Control Policies for Autonomous Vehicles in Real-World Traffic Condition
Al-Qizwini, Mohammed
Bulan, Orhan
Qi, Xuewei
Mengistu, Yehenew
Mahesh, Sheetal
Hwang, Joon
Clifford, David
[J]. IEEE SENSORS JOURNAL, 2021, 21 (14) : 15762 - 15774
[43] Velocity control in a right-turn across traffic scenario for autonomous vehicles using kernel-based reinforcement learning
Zhang, Yuxiang
Gao, Bingzhao
Zhou, Jinghua
Guo, Lulu
Chen, Hong
[J]. 2017 CHINESE AUTOMATION CONGRESS (CAC), 2017, : 6211 - 6216
[44] Defining Reasonably Foreseeable Parameter Ranges Using Real-World Traffic Data for Scenario-Based Safety Assessment of Automated Vehicles
Nakamura, Hiroki
Muslim, H.
Kato, R.
Prefontaine-Watanabe, Sandra
Nakamura, H.
Kaneko, H.
Imanaga, H.
Antona-Makoshi, J.
Kitajima, S.
Uchida, N.
Kitahara, E.
Ozawa, K.
Taniguchi, S.
[J]. IEEE ACCESS, 2022, 10 : 37743 - 37760
[45] An online prediction algorithm for reinforcement learning with linear function approximation using cross entropy method
Joseph, Ajin George
Bhatnagar, Shalabh
[J]. MACHINE LEARNING, 2018, 107 (8-10) : 1385 - 1429
[46] An online prediction algorithm for reinforcement learning with linear function approximation using cross entropy method
Ajin George Joseph
Shalabh Bhatnagar
[J]. Machine Learning, 2018, 107 : 1385 - 1429
[47] Development Environment of Reinforcement Learning-based Controllers for Real-world Physical Systems Using LW-RCP
Lee T.
Ju D.
Lee Y.S.
[J]. Journal of Institute of Control, Robotics and Systems, 2023, 29 (07) : 543 - 549
[48] Enabling Efficient, Reliable Real-World Reinforcement Learning with Approximate Physics-Based Models
Westenbroek, Tyler
Levy, Jacob
Fridovich-Keil, David
[J]. CONFERENCE ON ROBOT LEARNING, VOL 229, 2023, 229
[49] RL4RS: A Real-World Dataset for Reinforcement Learning based Recommender System
Wang, Kai
Zou, Zhene
Zhao, Minghao
Deng, Qilin
Shang, Yue
Liang, Yile
Wu, Runze
Shen, Xudong
Lyu, Tangjie
Fan, Changjie
[J]. PROCEEDINGS OF THE 46TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, SIGIR 2023, 2023, : 2935 - 2944
[50] Traffic light control using deep policy-gradient and value-function-based reinforcement learning
Mousavi, Seyed Sajad
Schukat, Michael
Howley, Enda
[J]. IET INTELLIGENT TRANSPORT SYSTEMS, 2017, 11 (07) : 417 - 423

← 1 2 3 4 5 →