Using Reinforcement Learning to Control Traffic Signals in a Real-World Scenario: An Approach Based on Linear Function Approximation

被引:14
|
作者
Alegre, Lucas N. [1 ]
Ziemke, Theresa [2 ]
Bazzan, Ana L. C. [1 ]
机构
[1] Univ Fed Rio Grande do Sul, Inst Informat, BR-91501970 Porto Alegre, RS, Brazil
[2] Tech Univ Berlin, Transport Syst Planning & Transport Telemat Dept, D-10623 Berlin, Germany
关键词
Traffic signal control; reinforcement learning; function approximation; multiagent systems; SIMULATION; NETWORK;
D O I
10.1109/TITS.2021.3091014
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
Reinforcement learning is an efficient, widely used machine learning technique that performs well in problems with a reasonable number of states and actions. This is rarely the case regarding control-related problems, as for instance controlling traffic signals, where the state space can be very large. One way to deal with the curse of dimensionality is to use generalization techniques such as function approximation. In this paper, a linear function approximation is used by traffic signal agents in a network of signalized intersections. Specifically, a true online SARSA(lambda) algorithm with Fourier basis functions (TOS(lambda)-FB) is employed. This method has the advantage of having convergence guarantees and error bounds, a drawback of non-linear function approximation. In order to evaluate TOS(lambda)-FB, we perform experiments in variations of an isolated intersection scenario and a scenario of the city of Cottbus, Germany, with 22 signalized intersections, implemented in MATSim. We compare our results not only to fixed-time controllers, but also to a state-of-the-art rule-based adaptive method, showing that TOS(lambda)-FB shows a performance that is highly superior to the fixed-time, while also being at least as efficient as the rule-based approach. For more than half of the intersections, our approach leads to less congestion and delay, without the need for the knowledge that underlies the rule-based approach.
引用
收藏
页码:9126 / 9135
页数:10
相关论文
共 50 条
  • [41] A machine-learning approach to illuminant estimation using statistical regularities in photoreceptor signals from real-world surfaces
    Hexley, Allie C.
    Morimoto, Takuma
    Uchikawa, Keiji
    Smithson, Hannah E.
    [J]. PERCEPTION, 2021, 50 (1_SUPPL) : 53 - 53
  • [42] A Lightweight Simulation Framework for Learning Control Policies for Autonomous Vehicles in Real-World Traffic Condition
    Al-Qizwini, Mohammed
    Bulan, Orhan
    Qi, Xuewei
    Mengistu, Yehenew
    Mahesh, Sheetal
    Hwang, Joon
    Clifford, David
    [J]. IEEE SENSORS JOURNAL, 2021, 21 (14) : 15762 - 15774
  • [43] Velocity control in a right-turn across traffic scenario for autonomous vehicles using kernel-based reinforcement learning
    Zhang, Yuxiang
    Gao, Bingzhao
    Zhou, Jinghua
    Guo, Lulu
    Chen, Hong
    [J]. 2017 CHINESE AUTOMATION CONGRESS (CAC), 2017, : 6211 - 6216
  • [44] Defining Reasonably Foreseeable Parameter Ranges Using Real-World Traffic Data for Scenario-Based Safety Assessment of Automated Vehicles
    Nakamura, Hiroki
    Muslim, H.
    Kato, R.
    Prefontaine-Watanabe, Sandra
    Nakamura, H.
    Kaneko, H.
    Imanaga, H.
    Antona-Makoshi, J.
    Kitajima, S.
    Uchida, N.
    Kitahara, E.
    Ozawa, K.
    Taniguchi, S.
    [J]. IEEE ACCESS, 2022, 10 : 37743 - 37760
  • [45] An online prediction algorithm for reinforcement learning with linear function approximation using cross entropy method
    Joseph, Ajin George
    Bhatnagar, Shalabh
    [J]. MACHINE LEARNING, 2018, 107 (8-10) : 1385 - 1429
  • [46] An online prediction algorithm for reinforcement learning with linear function approximation using cross entropy method
    Ajin George Joseph
    Shalabh Bhatnagar
    [J]. Machine Learning, 2018, 107 : 1385 - 1429
  • [47] Development Environment of Reinforcement Learning-based Controllers for Real-world Physical Systems Using LW-RCP
    Lee T.
    Ju D.
    Lee Y.S.
    [J]. Journal of Institute of Control, Robotics and Systems, 2023, 29 (07) : 543 - 549
  • [48] Enabling Efficient, Reliable Real-World Reinforcement Learning with Approximate Physics-Based Models
    Westenbroek, Tyler
    Levy, Jacob
    Fridovich-Keil, David
    [J]. CONFERENCE ON ROBOT LEARNING, VOL 229, 2023, 229
  • [49] RL4RS: A Real-World Dataset for Reinforcement Learning based Recommender System
    Wang, Kai
    Zou, Zhene
    Zhao, Minghao
    Deng, Qilin
    Shang, Yue
    Liang, Yile
    Wu, Runze
    Shen, Xudong
    Lyu, Tangjie
    Fan, Changjie
    [J]. PROCEEDINGS OF THE 46TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, SIGIR 2023, 2023, : 2935 - 2944
  • [50] Traffic light control using deep policy-gradient and value-function-based reinforcement learning
    Mousavi, Seyed Sajad
    Schukat, Michael
    Howley, Enda
    [J]. IET INTELLIGENT TRANSPORT SYSTEMS, 2017, 11 (07) : 417 - 423