Deep Deterministic Policy Gradient for High-Speed Train Trajectory Optimization

被引:22
|
作者
Ning, Lingbin [1 ]
Zhou, Min [1 ]
Hou, Zhuopu [1 ]
Goverde, Rob M. P. [2 ]
Wang, Fei-Yue [3 ]
Dong, Hairong [1 ]
机构
[1] Beijing Jiaotong Univ, State Key Lab Rail Traff Control & Safety, Beijing 100044, Peoples R China
[2] Delft Univ Technol, Dept Transport & Planning, NL-2628 CN Delft, Netherlands
[3] Chinese Acad Sci, Inst Automat, State Key Lab Management & Control Complex Syst, Beijing 100190, Peoples R China
基金
中国国家自然科学基金;
关键词
Rail transportation; Training; Heuristic algorithms; Resistance; Optimal control; Trajectory optimization; Switches; High-speed railway; train trajectory optimization; deep deterministic policy gradient; energy efficiency; TRAFFIC MANAGEMENT; LEARNING APPROACH; MODEL; INTEGRATION; OPERATION; ALGORITHM; SYSTEM; DELAY;
D O I
10.1109/TITS.2021.3105380
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
This paper proposes a novel train trajectory optimization approach for high-speed railways. We restrict our attention to single train operation scenarios with different scheduled/rescheduled running times aiming at generating optimal train recommended trajectories in real time, which can ensure punctuality and energy efficiency of train operation. A learning-based approach deep deterministic policy gradient (DDPG) is designed to generate optimal train trajectories based on the offline training from the interaction between the agent and the trajectory simulation environment. An allocating running time and selecting operation modes (ARTSOM) algorithm is proposed to improve train punctuality and give a series of discrete operation modes (full traction, cruising, coasting, full braking), and thus to produce a feasible training set for DDPG, which can speed up the training process. Numerical experiments show that an optimized speed profile can be generated by DDPG within seconds on a realistic railway line. In addition, the results demonstrate the generalization ability of trained DDPG in solving TTO problems with different running times and line conditions.
引用
收藏
页码:11562 / 11574
页数:13
相关论文
共 50 条
  • [31] The future of high-speed train
    Endo, T
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2005, E88D (12) : 2625 - 2629
  • [32] HIGH-SPEED ICE TRAIN
    EISENSTADT, MM
    RILEY, JD
    [J]. MECHANICAL ENGINEERING, 1971, 93 (06): : 14 - +
  • [33] The future of high-speed train
    Ishida, Y
    [J]. ISADS 2005: International Symposium on Autonomous Decentralized Systems,Proceedings, 2005, : 415 - 418
  • [34] HIGH-SPEED TRAIN OPERATION
    KIRK, WB
    [J]. MECHANICAL ENGINEERING, 1966, 88 (02): : 75 - &
  • [35] High-speed train for everyone
    Navarri, A.
    [J]. Rail International, 1998, (9-10): : 18 - 19
  • [36] Italian high-speed train
    Bertorelli, D.
    [J]. Mechanical Incorporated Engineer, 1993, 5 (04):
  • [37] High-speed train bids are in
    [J]. ENR, 20 (21):
  • [38] Integrated optimization of train rescheduling decisions and train speeds based on satisfactory optimization for high-speed railway
    Long, Sihui
    Meng, Lingyun
    Wang, Yihui
    Miao, Jianrui
    Luan, Xiaojie
    [J]. 2022 IEEE 25TH INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS (ITSC), 2022, : 4117 - 4123
  • [39] Trajectory Optimization and Analytic Solutions for High-Speed Dynamic Soaring
    Sachs, Gottfried
    Grueter, Benedikt
    [J]. AEROSPACE, 2020, 7 (04)
  • [40] A Deep Deterministic Policy Gradient Approach for Vehicle Speed Tracking Control With a Robotic Driver
    Hao, Gaofeng
    Fu, Zhuang
    Feng, Xin
    Gong, Zening
    Chen, Peng
    Wang, Dan
    Wang, Weibin
    Si, Yang
    [J]. IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2022, 19 (03) : 2514 - 2525