Optimal train control by approximate dynamic programming: Comparison of three value function approximation methods

被引:0
|
作者
Liu, Tong [1 ]
Xun, Jing [1 ]
Yin, Jiateng [1 ]
Xiao, Xiao [2 ]
机构
[1] Beijing Jiaotong Univ, State Key Lab Rail Traff Control & Safety, Beijing 100044, Peoples R China
[2] Traff Control Technol Co Ltd, Beijing 100070, Peoples R China
基金
中国国家自然科学基金;
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Due to the exponential growth of states and variables, traditional exact dynamic programming suffers from the curse of dimensionality in computing the optimal train control strategy. To address this problem, this paper first proposes a complete discrete model for depicting train control process, and the optimal train control problem is reformulated into a Markov decision process through defining state variables with three dimensionalities. To enhance the computational efficiency of dynamic programming, we design three value function approximation methods to estimate the optimal value functions, which are rollout algorithm, interpolation method and neural network with back propagation, respectively. In particular, the rollout algorithm uses one step forward prediction structure to generate the optimal train control law, while the interpolation method employs a lattice partitioning process for every stage in dynamic programming. The simulation experiments on Beijing Subway show that, 1) rollout algorithm could achieve the best performance compared with the other two algorithms in computing the approximate optimal control strategies, and 2) a simple neural network approximation can not always achieve a solid performance compared with other algorithms.
引用
收藏
页码:2741 / 2746
页数:6
相关论文
共 50 条