An Optimization Method for Single Intersection's Signal Timing Based on SARSA(λ) Algorithm

被引:2
|
作者
Lu Kai [1 ]
Xu Jian-min [1 ]
Li Yi-shun [1 ]
机构
[1] S China Univ Technol, Coll Traff & Commun, Guangzhou 510640, Guangdong, Peoples R China
关键词
Traffic Engineering; Signal Timing; Reinforcement Learning; SARSA(lambda) Algorithm; Signal Cycle; Split;
D O I
10.1109/CCDC.2008.4598311
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Considering the operating characteristic of intersection signal control system, the reinforcement learning method was introduced to the intersection signal control system due to its powerful adaptability. By defining the state sets, action sets and reward function for a reinforcement learning agent, a new intersection signal control model was established based on reinforcement learning, and an optimization method for single intersection's signal timing was presented using SARSA(lambda) algorithm. Analyses show that this new signal timing optimization method can make the intersection saturation degree approach to the optimum section as close as possible, improve the efficiency of the crossing, increase utilization rate of green light, and decrease intersection traffic delay and number of stops.
引用
收藏
页码:5146 / 5150
页数:5
相关论文
共 4 条
  • [1] Reinforcement learning: A survey
    Kaelbling, LP
    Littman, ML
    Moore, AW
    [J]. JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 1996, 4 : 237 - 285
  • [2] FUZZY LOGIC-CONTROLLER FOR A TRAFFIC JUNCTION
    PAPPIS, CP
    MAMDANI, EH
    [J]. IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS, 1977, 7 (10): : 707 - 717
  • [3] Sutton R. S., 1998, Reinforcement Learning: An Introduction, V22447
  • [4] Zhao Xiao-hua, 2006, Journal of System Simulation, V18, P2889